Existing ontologies
Pointers to existing ontologies
Just what is Ontology, Anyway
Ontology Design Patterns
Generic
Dublin Core
http://dublincore.org/documents/usageguide/http://dublincore.org/documents/usageguide/elements.shtml
http://rdfs.org/ns/void-guide voiD is a vocabulary and a set of instructions that enables the discovery and usage of linked datasets. A dataset is a collection of data, published and maintained by a single provider, available as RDF, and accessible, for example, through dereferenceable HTTP URIs or a SPARQL endpoint.
Chemistry, biology
The Open Biomedical Ontologies
The OBO Foundry is a collaborative experiment involving developers of science-based ontologies who are establishing a set of principles forontology development with the goal of creating a suite of orthogonal interoperable reference ontologies in the biomedical domain.
The Ontology Lookup Service
http://www.ebi.ac.uk/ontology-lookup/
The OLS provides a web service interface to query multiple ontologies from a single location with a unified output format. The OLS can integrate any ontology available in the Open Biomedical Ontology (OBO) format.
NCBO BioPortal
http://bioportal.bioontology.org/
Use BioPortal to access and share ontologies that are actively used in biomedical communities.
NCI Enterprise Vocabulary Services (EVS)
http://www.obofoundry.org/cgi-bin/detail.cgi?id=ncithesaurus
NCI EVS provides a set of services and resources, including NCI Thesaurus and NCI Metathesaurus, that facilitate the standardization of terminology across the Institute and the larger biomedical community.
The NCI Thesaurus is an ontology-like vocabulary that includes broad coverage of the cancer domain, including cancer related diseases, findings and abnormalities; anatomy; agents, drugs and chemicals; genes and gene products and so on. In certain areas, like cancer diseases and combination chemotherapies, it provides the most granular and consistent terminology available. It combines terminology from numerous cancer research related domains, and provides a way to integrate or link these kinds of information together through semantic relationships. The Thesaurus currently contains over 34,000 concepts, structured into 20 taxonomic trees.
NTP Pathology Code Tables
MESH - Medical subjects headings
High throughput data
MGED Society standards
http://www.mged.org/
MicroArray and Gene Expression - MAGE
http://www.mged.org/Workgroups/MAGE/mage.html
EMTREE Thesaurus
EMTREE, the Life Science Thesaurus, is a sophisticated, hierarchically structured controlled vocabulary arranged in a cascading "tree" and used for subject indexing in EMBASE, EMBASE.com, and customer-specific solutions. The printed EMTREE includes more than 46,000 drug and medical terms, 10,000 numeric codes and over 200,000 synonyms. It incorporates MeSH terms used by the National Library of Medicine, as well as almost 20,000 CAS registry numbers assigned by the Chemical Abstracts Service. In 2004 a significant number of general biological terms have been added. This three-volume set is an indispensable tool for both novice and experienced searchers. The 2004 edition of the EMTREE Life Science Thesaurus incorporates several enhancements, including over 1,000 new general biological terms.
Chemical Entities of Biological Interest (ChEBI)
Freely available dictionary of molecular entities focused on ‘small’ chemical compounds. The term ‘molecular entity’ refers to any constitutionally or isotopically distinct atom, molecule, ion, ion pair, radical, radical ion, complex, conformer, etc., identifiable as a separately distinguishable entity. The molecular entities in question are either products of nature or synthetic products used to intervene in the processes of living organisms.
ChEBI encompasses an ontological classification, whereby the relationships between molecular entities or classes of entities and their parents and/or children are specified.
ChEBI uses nomenclature, symbolism and terminology endorsed by the following international scientific bodies:
- International Union of Pure and Applied Chemistry (IUPAC)
- Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB)
ChemAxiom
A set of extensible ontology modules for the domain of chemistry. Modules of potential interest to OpenTox :
ChemAxiomDomain : Fundamental concepts in the chemical domain, like substance, molecule;
ChemAxiomDescriptor : An ontology of descriptors for computational chemistry and QSAR/QSPR;
ChemAxiomProp : An ontology of chemical and materials properties. It contains a list of over 150 properties togethe with definitions of their symbols
Nico Adams, Edward O. Cannon & Peter Murray-Rust, ChemAxiom – An Ontological Framework for Chemistry in Science
QSAR Descriptors and Algorithms
Blue Obelisk descriptor dictionary
Statistics,data mining
The Statistical Core Vocabulary (scovo)
http://sw.joanneum.at/scovo/schema.html
[RDF-Schema] vocabulary for representing statistical data on the Web
The Statistical Core Vocabulary (scovo) lets you describe statistical information on the Web of Data. Three basic concepts are used:
Data Mining
Data mining ontology (DMO_ http://www.e-lico.eu/?q=dmo
OntoDM http://kt.ijs.si/panovp/OntoDM
KDDOnto KDDONTO: An Ontology for Discovery and Composition of KDD Algorithms . Ontology-driven KDD Process Composition
Measurement units
https://forge.morfeo-project.org/wiki_en/index.php/Measurement_Units_Ontology
Formats, available software
Software
- Protege ontology editor
- Tabulator - Firefox plugin allowing to browse RDF data on the web
- Ontology browser (online) OwlSight
- Manchester Ontology Browser
- Redland is a set of free software C libraries that provide support for the Resource Description Framework (RDF).
- Raptor RDF Parser Library Language bindings to Perl, PHP, Python and Ruby when used via Redland
- http://esw.w3.org/topic/SemanticWebTools
The JSR 275 Java packages for modeling and working with measurement units. The JSR-275 specifies one or more Java packages for the programmatic handling of physical quantities and their expression as numbers of units.
- http://rdfstats.sourceforge.net/ RDFStats generates statistics for datasets behind SPARQL-endpoints and RDF documents.
Linked Data
Tim Berners-Lee outlined four principles of Linked Data in his Design Issues: Linked Data note, paraphrased along the following lines:
- Use URIs to identify things.
- Use HTTP URIs so that these things can be referred to and looked up ("dereference") by people and user agents.
- Provide useful information (i.e., a structured description — metadata) about the thing when its URI is dereferenced.
- Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.
Tim Berners-Lee gave a presentation on Linked Data at the TED 2009 conference.
- Linked Data at Wikipedia
- Linked Data - Connect Distributed Data across the Web http://linkeddata.org/
- Interlinking Web data http://sameas.org/