N.B.: For a version of these notes with clickable issues with tooltips, see releases.
See the improved changelog-detailed.md for a detailed list of semantic changes in the EDAM ontology.
- 20 formats added (listed in changelog-detailed.md)
- Added
repository
attribute for Formats - Structural clean-up of various areas of the Data sub-ontology, especially around Identifier and Report
- Major clean-up of the hierarchy of Accessions
- Clean-up of some Operations (e.g. around Plotting i.e. Visualisation)
- Major improvements of the EDAM documentation
- Improved edam2json generator
- 26 concepts added (3 Data, 20 Format, 2 Operation, 1 Topic)
- 13 concepts deprecated (11 Data, 2 Operation)
- 315 concepts have changed relations|hierarchy (297 Data, of which 282 Identifier; 9 Format, 9 Operation)
- 25 issues fixed and closed (#67, #190, #206, #233, #244, #253, #271, #288, #338, #339, #340, #341, #342, #343, #344, #345, #347, #348, #349, #351, #364, edamontology/edamontologyDocs#1, edamontology/edamontologyDocs#18, edamontology/edam2json#1, edamontology/edam2json#2)
- 4 issues partially fixed but left open for further improvements (#15, #237, #293, #295)
- 0 invalid issues closed ()
- 5 issues closed, wrong project (#174, #299, #334, #354, #355)
- 0 issues closed, won't fix ()
- 2 issues closed, duplicate (#291, #322)
See the improved changelog-detailed.md for the detailed list of semantic changes in the EDAM ontology.
- Numerous Formats added (27, listed in changelog-detailed.md)
- Added Topics Agricultural Science and Metagenomic sequencing
- Improved a number of synonyms and labels (especially among Operations and Topics)
- Fixed a couple of bugs in syntax and
subset
- Binary format defined as strictly disjoint with HTML, XML, and Textual format
- Pre-parsing EDAM into "unrolled" JSON tree structure by edam2json (edamontology/edamontology#326)
- Additional CI validations by edamxpathvalidator:
label
,definition
,consider
,replacedBy
- Added a browsable latest-stable version of EDAM to WebProtégé as EDAM latest stable permalink
- Major improvements of the changelog-detailed.md file listing detailed semantic changes between stable versions of EDAM
- 29 concepts added (27 Format, 2 Topic)
- 7 concepts deprecated (3 Data, 1 Format, 3 Operation)
- 7 concepts have changed relations|hierarchy (3 Format, 4 Operation)
- 22 issues fixed and closed (#122, #143, #183, #197, #198, #199, #200, #211, #234, #261, #262, #272, #285, #286, #304, #327, #329, #330, #331, #332, #333, #337)
- 5 issues partially fixed but left open for further improvements (#128, #164, #265, #271, #326)
- 1 invalid issue closed (#176)
- 0 issues closed, wrong project ()
- 3 issues closed, won't fix (#69, #219, #296)
- 4 issues closed, duplicate (#128, #166, #193, #297)
- Various deprecations, synonyms, and rearrangements in the Operation sub-ontology
- Clean-up of technical artifacts: most newlines and all tab characters removed (from definitions etc.), corrections of wrong
created_in
-s, corrections of deprecated concepts (subset
-s, relations, replacement concepts, etc.) - Clean-up of the git repo
- Changes and additions of concepts related to electron microscopy
- Other updates (e.g. among Identifiers and Formats)
- 5 concepts added (4 Data, 1 Operation)
- 28 concepts deprecated (1 Data, 27 Operation)
- 71 concepts have changed relations|hierarchy (1 Data, 70 Operation)
See changelog-detailed.md for the list of additions, deprecations, and changes in relations between concepts.
- 22 issues fixed and closed (#266, #267, #277, #300, #301, #303, #304, #307, #308, #309, #310, #311, #312, #313, #314, #315, #316, #317, #318, #320, #323, #324)
- 2 issues partially fixed but left open for further improvements (#257, #268)
- 4 invalid issues closed (#208, #209 #298, #325)
- 0 issues closed, wrong project ()
- 0 issues closed, won't fix ()
- 3 issues closed, duplicate (#205, #212, #320)
See changelog-detailed.md for additions, deprecations, and changes in relations between concepts.
EDAM_1.18 includes:
- EDAM is now available in 2 additional formats: CSV and TSV
- http://edamontology.org/EDAM.csv is optimised for use in spreadsheet apps (e.g. Excel)
- http://edamontology.org/EDAM.tsv is optimised for scripting
- Available for EDAM versions 1.16 and newer
- More details in #268
- EDAM versions are now graphically browsable online in WebProtégé
- Free registration required
- Available for EDAM versions 1.16 and newer, including the unstable development version
- Links to the different EDAM versions are at http://edamontology.org/page#Viewing and in README.md
- EDAM releases now have DOIs
- DOI representing all released versions, resolving to the latest: 10.5281/zenodo.822690
- DOI of EDAM version 1.18: 10.5281/zenodo.822691
- various refactoring including concepts deprecations within the Operation sub-ontology to make this simpler and improve usability
- new attribute to provide tips e.g. in bio.tools UI indicating "organisational concepts", i.e. higher-level concepts which primarily structure the hierarchy and are not normally recommended for annotation
- LICENSE added to the EDAM repo (in .md and plain text) for immediate recognition
- 37 concepts have changed place in the hierarchy (37 Operation)
- 16 concepts deprecated (16 Operation)
- 6 issues fixed and closed (#269, #270, #273, #275, #276, #280)
- 3 issues partially fixed but left open for further improvements (#265, #268, #277)
- 0 invalid issues closed ()
- 0 issues closed, wrong project ()
- 1 issue closed, won't fix (#264)
- 0 issues closed, duplicate ()
- 0 issues reopened ()
See the detailed change log for additions, deprecations, and changes in relations between concepts.
EDAM_1.17 includes:
- addition of concepts for sequencing terms from SEQWiki
- addition of Spectral library search
- miscellaneous term requests, bug fixes and other changes received via GitHub
- deprecation of multiple Topics (especially removing organism types from under Model organisms), and Operations
- simplification to Operation branch for sequence feature detection concepts
- new attributes for better provenance on deprecated concepts
oldParent
: attribute for URI of erstwhile parents of now deprecated conceptsdeprecation_comment
: comment as to why the concept was deprecated
- multiple typo and other minor fixes
- 14 concepts added (11 Operation, 2 Topic, 1 Format)
- 24 concepts changed (20 Operation, 4 Topic; minor edits or relational|synonym changes)
- 15 concepts deprecated (8 Topic, 7 Operation)
- 15 issues fixed and closed (#118, #241, #243, #247, #248, #250, #251, #252, #254, #255, #256, #258, #259, #260, edamontology/edamxpathvalidator#4)
- 2 issues partially fixed but left open for further improvements (#128, #268)
- 1 invalid issue closed (#203)
- 1 issue closed, wrong project (#240)
- 1 issue closed, won't fix (#185)
- 2 issues closed, duplicate (#242 fixed in #241, #249 fixed in #118)
- 2 issues reopened (#118 fixed, #130)
See the detailed change log for exact details of changes.
EDAM_1.16 includes:
- concept and term updates and additions, as requested by EDAM users (mostly Bio.Tools registrants), including ones for text mining and natural language processing (BioNLP), and gene expression
- structural improvements and fixes, improvements of synonyms, new attributes for formats
- 40 concepts added (23 Format, 9 Data, 8 Operation)
- 32 concepts changed relations (12 Data, 12 Operation, 7 Format, 1 Topic)
- Added synonyms, updated primary terms and synonyms of various concepts
- Added Wikipedia links to some concepts
- Added documentation, examples, citations, media types, file extensions, information standards, organisations, and used ontologies of a couple of Format concepts
- 7 concepts deprecated (6 Operation, 1 Data)
- 32 issues fixed and closed (#99, #139, #177, #178, #179, #180, #181, #182, #186, #187, #188, #189, #191, #192, #194, #195, #196, #201, #202, #217, #220, #221, #222, #223, #224, #225, #227, #228, #230, #231, #232, #238)
- 4 issues partially fixed and left open for further improvements (#6, #120, #143, #237)
- 0 invalid issues closed ()
- 4 issues closed, wrong project (#161, #170, #192, #229)
- 1 issue closed, won't fix (#226)
- 3 issues closed, duplicate (#126, #167, #175)
See the detailed change log for exact details of changes.
EDAM_1.15 includes:
- new concepts and terms requested by the community, including ones for environmental omics and biodiversity
- structural improvements and fixes, including a clean-up of synonyms (automation of this in the pipeline), updates of formats, and simplification of Visualisation concepts
- 20 concepts added (9 Data, 6 Operation, 5 Format)
- 24 concepts changed (9 Data, 5 Format, 9 Operation, 1 Topic)
- Added citations, media types, and/or file extensions of some Format concepts
- Added and/or cleaned-up synonyms of various concepts
- 5 concepts deprecated (2 Data, 3 Operation)
- 26 issues fixed and closed (#119, #121, #130, #131, #132, #133, #134, #135, #136, #141, #142, #144, #145, #146, #150, #151, #152, #156, #157, #158, #159, #160, #165, #168, #169, #172)
- 3 issues fixed and left open for eventual further discussion (#120, #143, #166)
- 1 invalid issue closed (#149)
- 2 issues closed, wrong project (#153, #154)
- 4 issues closed, won't fix (#125, #147, #148, #155)
- 2 other issues closed (#124, #140)
See the detailed change log for exact details of changes.
EDAM_14 includes:
- many new terms or changes requested by the community (directly on GitHub, or during the last hackathons).
- a new CI process that will be extended over time to monitor and improve the quality of the ontology.
- 14 concepts changed
- 28 concepts added, mainly mass Format amd Data concepts for mass spectrometry, plus some hihg-level concepts for biodiversity.
- 3 concepts deprecated (overly-specific Data concecpts)
See the detailed change log for exact details of changes.
The main focus of EDAM_1.13.owl is:
- a Topic branch simplification in response to requests for a smaller, more usable and thus also more sustainable set of topics
- addition of new concepts requested via GitHub, prioritising addition of new formats from recent de.NBI/EDAM hackathon
- additions and changes for NGS tools packages within Debian Med but not included in SEQanswers Wiki (SEQwiki) (work in progress)
- 23 new concepts (mostly in Format branch) added
- 105 concepts changed (excluding changes/additions to synonyms)
- topic branch restructured for easier navigation
- all deprecated classes are now child of SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass
- 60 concepts were deprecated, mostly to greatly simplify the Topics branch
- removal of some overly specialised Operation concepts (work in progress)
- NB: terms, synonyms and comments on deprecated concepts were generally preserved in the parent concepts
- all deprecated concepts now have a suggestion (either consider or replacedBy) for an alternative
- all suggested alternatives for deprecated concepts are now to active (i.e. non-deprecated) concepts
- various other miscellaneous fixes as requested via GitHub
- new 'isdebtags' annotation defined on concepts to annotate a concept is a candidate for tagging Debian Med packages, following the recent Debian Med sprint
See the detailed change log for exact details of changes.
56 new concepts were added and 190 concepts changed.
- 56 new concepts added
- new concepts for mass spec from analysis of msutils.org
- new concepts for NGS from analysis of SEQanswers Wiki
- misc. additions arising from the recent hackathons in Brno, CZ and Amsterdam, NL
- multiple new synonyms
- reorganisation of top-level Operation concepts to make this branch more usable
- reorganisation of top-level Data concepts to make this branch more usable
- 72 concepts were deprecated
- removal of overly-specific Topic concepts that were overlapping with operations
- removal of overly-specific Data and Operation concepts
- removal of some obscure organisational classes (e.g.
<Operation (typed)>
)
- 44 new formats have been added, based on the needs of the Galaxy (http://usegalaxy.org), ReGaTE (https://github.com/bioinfo-center-pasteur-fr/ReGaTE), and Common Workflow Language (https://github.com/common-workflow-language) projects, as part of the BOSC Codefest 2015 (http://open-bio.org/wiki/Codefest-2015.html).
- hasDBXref class annotations added to Topic concepts to provide mapping to all VT Scientific Disciplines in branches 1.1 Mathematics, 1.2 Computer sciences, 1.3 Information sciences, 1.5 Biological sciences, 1.7 Chemical sciences, 3. Medical and Health Sciences, 3.2 Clinical medicine, 3.3 Health sciences and 3.4 Medical biotechnology.
- 9 new Topic concepts from mapping to VT Scientific Disciplines.
- 3 new Format concepts and 2 new Data concepts.
- 'Topic:Informatics' undeprecated and used as placeholder for various information science-related terms.
- 'Topic:Data management' and 'Topic:Computer science" siblings rearranged for conceptual clarity.
- Multiple duplications of synonyms and labels in Topics branch.
Style of Topic concept definitions changed, removing "Topic concerning ...", to make them more usable.
- 20 new concepts in preparation for the ELIXIR Tools and Data Services Registry
- 1 concept deprecation
- Various minor changes (synonyms etc.)
- Revision to provide comprehensive coverage of EBI Tool Topics, Data and Operations
- Removal of fine-grained report (human-readable data) concepts from the Data branch
- Rooting all report concepts under "Data->Report"
- Removal of operation-like concepts from the Topics branch
- Biological concepts (sequence feature-related, pathways and networks, experimental techniques) that were previously modeled under as reports within Data, are now given under Topic
- Simplification of key Data concepts concerning sequences, alignments and signatures (motifs/profiles)
- Many other additions and minor changes
- 107 concept deprecations
- 53 new concepts
-
Additions and changes following from the recent ELIXIR Registry Hackathon (tinyurl.com/RegistryHackathon).
-
About 50 new concepts added
-
9 concept deprecations
-
Many minor changes (new synonyms, minor structural changes etc.)
Bug fixes
-
Fixed synonyms that had URIs as values (1)
(1) for any synonyms that had a URI as value, that URI is now given as a seeAlso annotation instead. It was also necessary to remove all owl:annotatedProperty statements that defined a synyonm, from all "annotations on annotations", i.e. where comments had been added to an annotation on a class, via an owl:Axiom statement.
- A major revision of the EDAM Operation branch to simplify it and improve usability.
- 64 EDAM Operation concept deprecations.
- Top-level Operations now correspond to tool types in the ELIXIR Tools & Data Services Registry: Analysis, Query and retrieval, Visualisation, Deposition, Utility operation.
- Removal of excessively fine-grained Operation concepts.
- Removed "bioinformatics" subset and all corresponding annotations
- Removal of unnecessary "organisational" classes.
- Renaming of concepts (terms) to reflect the common terms in use.
-
A major revision of the EDAM Data branch aiming for simplification and ease of use.
-
117 EDAM Data concept deprecations
-
simplification of Data hierarchy
-
removal of excessively fine-grained Dat concepts
-
removal of out-of-scope Data concepts
-
removal of unnecessary "organisational" classes (near top of Data hierarchy)
-
renaming of concepts (terms) to reflect the common terms in use
-
addition of Data synonyms
Bug fixes
-
fixed many references to deprecated concepts
-
A major revision of the "Topic" sub-ontology expanding this into medical concepts (~60 new topics), following an effort led by Cath Brooksbank with major input from partners from EMTRAIN (European Medicines research TRAINing network) and partners from related ESFRI (European Strategy Forum on Research Infrastructures) projects.
-
Fixing many minor bugs (mostly overlapping or bad synonyms) within topics, and other clean-ups.
-
Removed the lowest tier of the "Topic" branch (mostly by moving terms up a level).
-
Removed all
oboOther:namespace
and some subsets; removed mostoboInOwl:inSubset
for deprecated concepts and added subset 'obsolete'. -
New forms of UniProt identifiers added (regex).
-
Examples of IANA and chemical media types added.
-
A couple of file-/data-handling concepts added (operations and an identifier).
-
An OBO-format version of EDAM has been omitted. We will only resume providing OBO format in case of substantial demand or full automation of the conversion.
-
Documentation files have been substantially updated, e.g. specifying channels for the most welcome community contributions.
And most importantly:
-
EDAM is now being developed at GitHub!!!
Highlights of changes:
- Greatly simplified "Topic" branch.
- Many new terms added for annotating tools in the BioToolsRegistry.
This is the first version of EDAM now that is maintained in OWL format. The OBO-format version is generated from it by processing the OWL file.
Highlights of changes:
-
New references to MeSH.
-
Edits to synonyms.
-
About a dozen new formats.
-
Clean-ups for cleaner viewing in Protege and OLS:
-
Removed problematic "has input" and "has output" axioms.
-
Cleaner annotations on the ontology itself.
Many additions (mostly in "Operation" and some in "Topic" branches) for "next generation" sequencing analysis. EDAM now provides complete coverage of biological domains and bioinformatics methods from SeqWiki. SeqWiki "biological domains" map to EDAM "Topic", SeqWiki "bioinformatics methods" map to EDAM "Operation".
The first release proper.
General changes
-
New style for concept IDs: 4 digit number, subontology namespace / subset("operation", "topic" etc) e.g. "EDAM_operation:0004" (new style) instead of "EDAM:0000004" (old style).
-
New relations ("has function", "is function of") are defined for use by annotators (they are not used in EDAM itself).
-
Synonyms are defined that define related or relevant concepts in many other ontologies and systems. Synonyms are added throughout but especially on top-level concepts ("Operation", "Data", "Format" and "Topic") and relations ("has input", "is input of", "has output", "is output of", "has topic", "is topic of", "has format", "is format of", "has function", "is function of").
-
New concept attributes and modifiers have been added, most importantly:
"{note}" for comments on synonyms and other attributes, e.g.
synonym: "assembly" NARROW [SO:0001248] {note="Perhaps surprisingly, the definition of 'SO:assembly' is narrower than the 'SO:sequence\_assembly'."}
.
"{since}" for annotation of version information, e.g. data of creation or obsoletion of a concept id:
id: EDAM\_data:3165 {since=1.0}
or
is\_obsolete: true {since=1.0}
.
"Format" branch
- 10 new formats.
General changes
- "Identifier" branch moved from top-level to beneath "Data". The "identifier" namespace / definitions have been kept!
- Extensive revision of "Data", "Operation" and "Topic" branches to reduce clutter and ease navigation.
- Bottom-up clean up removing terms that are too fine-grained. Top-down clean up to add or remove terms to aid navigation.
- has_topic (defined on "Data" and "Operation") replaces in_topic.
- Duplicated relationships (child terms erroneously restating the inherited relationships of their parents) have been removed.
"Data" branch
- All "Data" concepts now organised into 4 sub-concepts:
- "Core data" - Data that typically are the primary input or output of a tool or which correspond to entries from the primary (e.g. sequence or structural) biological databases.
- "Identifier" - A short numerical or textual label that identifies (typically uniquely) something such as data, a resource or a biological entity.
- "Parameter" - Typically a simple numerical or string value that controls the operation of a tool.
- "Report" - A human-readable collection of information that is distinct from primary (e.g. sequence or structural) biological data, including free text, annotation about biological entities and phenomena, computer-generated reports of analysis of primary data and metadata.
- "Report" concepts for sequences correspond better (without duplicating) established sequence feature keys.
"Operation" branch
- Fewer concepts, simpler is_a hierarchy
- "has_input" and "has_input" relations defined (on nearly all terms)
"Format" branch
- "is_format_of" relations defined (for nearly all terms)
"Topic" branch
- Improved term names and is_a hierarchy, reflecting whether topics concern a type of data, operation or are more general.
- New "Biological data resources" sub-branch includes common data resource concepts.
- Major revision! Too much to mention, so take a look :)
General changes
- OBO subset definitions added
- Sub-ontologies / namespaces / subsets now are "topic", "data", "format", "identifier", "operation"
- Relation types now are "in_topic", "has_input", "has_output", "is_format_of", "is_identifier_of"
- Many edits (to concepts and "is_a" relations) to improve navigability in all sub-ontologies
New "Identifier" sub-ontology
- Containing concepts which were under Data<-Identifier
- For fine-grained annotation of identifiers of data
"Resource" sub-ontology obsoleted
- Most concepts merged into "Topic" sub-ontology (see below)
- All remaining concepts in "resource" namespace obsoleted
Major revisions to "Topic" sub-ontology
- Concepts redefined as "...general bioinformatics subject or category, such as a field of study, data, processing, analysis or technology."
- For coarse-grained annotation of diverse resources
- Subsumes concepts from old "resource" sub-ontology (see above)
EDAM-specific relations
- Many new relations added (most term statements which should define relations now do)
- Relations defined on parent only (not duplicated in children)
"Format" sub-ontology
- About 50 new formats added
-
Entire "Entity" branch (all terms) made obsolete
-
Root term of "resource" namespace ("Data resource") renamed to "Resource"
-
Root term of "format" namespace ("Data format") renamed to "Format"
-
Corrections (2) removing duplicate IDs
Major revision of "Operation" branch
- immensely simplified top level
- better hierarchy
Major revision of "Data" branch
- simpler top-level
- better hierarchy
- new branches for "Protein data", "Nucleic acid data"
- new terms to aid navigation
- clean up "annotation" and "metadata" concepts
Major revision of "Data format" branch
- better hierarchy
- children of "HTML format" are now (mostly) obsolete
- many new formats added
Simplification of "Topic" branch
- concepts are now more strictly "fields of study"
General changes
- term relations are now defined in one direction only
- more consistent usage of words in term names
- more intuitive term names (child names follow parent in style where possible)
- many term additions and deletions