Skip to content

Glossary, Taxonomy, Ontology

drasil-bot edited this page Jun 13, 2024 · 7 revisions

Warning: Wiki should not be edited directly. Edit the files in the ./wiki/ folder instead and make a PR.

To assist with progress on Drasil, we need to know the best practices for capturing and documenting (scientific and computing) knowledge. We should also take advantage of any previous classifications and organizations of (scientific) knowledge.

Glossary, Dictionary, Taxonomy and Ontology

We should have a good understanding of the definitions, and differences, between glossary, dictionary, taxonomy and ontology. The Data Maven has an interesting take on the topic; another take is the paper Clarity in the usage of terms Ontology, Taxonomy and Classification. The headers below link to the appropriate page on Wikipedia (or W3C), and the definition is adapted from both those sources:

Glossary
A glossary, also known as a vocabulary or clavis, is an alphabetical list of terms in a particular domain of knowledge with the definitions for those terms. Traditionally, a glossary appears at the end of a book and includes terms within that book that are either newly introduced, uncommon, or specialized.
Dictionary
A dictionary is a collection of words in one or more specific languages, often arranged alphabetically, which may include information on definitions, usage, etymologies, pronunciations, translation, etc. It is a lexicographical product which shows inter-relationships among the data.

A broad distinction is made between general and specialized dictionaries. Specialized dictionaries include words in specialist fields, rather than a complete range of words in the language. Lexical items that describe concepts in specific fields are usually called terms instead of words, although there is no consensus whether lexicology and terminology are two different fields of study. Specialized dictionaries are supposed to be onomasiological, first identifying concepts and then establishing the terms used to designate them.

There is also a contrast between prescriptive or descriptive dictionaries; the former reflect what is seen as correct use of the language while the latter reflect recorded actual use.

Taxonomy
Taxonomy is the practice and science of classification. A nice example: a [taxonomy for static analysis](https://www.researchgate.net/publication/251940397_Taxonomy_of_static_code_analysis_tools).
Ontology (philosophy)
Ontology is the philosophical study of being. More broadly, it studies concepts that directly relate to being, in particular becoming, existence, reality, as well as the basic categories of being and their relations. Traditionally listed as a part of the major branch of philosophy known as metaphysics, ontology often deals with questions concerning what entities exist or may be said to exist and how such entities may be grouped, related within a hierarchy, and subdivided according to similarities and differences.
Ontology (computing)
An ontology, is a formal naming and definitions of the types, properties and interrelationships of the entities that fundamentally exist for a particular domain of discourse. It is a practical application of philosophical ontology, with a taxonomy. An ontology compartmentalizes the variables needed for some set of computations and establishes the relationships between them. In the semantic web, ontologies can then be used for inference.

Lists of Ontologies

Literature Review

(to do: give proper names to the links below)

Possibly Relevant

  1. https://pdfs.semanticscholar.org/ecd5/fd6495da16f6b09a063c16029877151c9466.pdf
  2. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6832486
  3. https://ai.ia.agh.edu.pl/wiki/_media/pl:miw:2009:brankevaluationsikdd2005.pdf
  4. https://pdfs.semanticscholar.org/7753/2e4ecc6ab2f0ff38922d5dc39413a370ad13.pdf
  5. Natalya F Noy and Deborah L McGuinness. Ontology development 101: A guide to creating your first ontology, 2001
  6. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.41.7834&rep=rep1&type=pdf

Research (Work In Progress)

VanRess2008 (Supplement to the link provided above)

Defines, compares, and discusses the three terms Taxonomy, Ontology, and Classification within the context of a formal paper.

Classification

  • Arrangement in groups or categories

Taxonomy

  • Core property is that it possesses a hierarchical structure.

Ontology

  • Provides a set of concepts from a certain domain that is well specified.
  • [Gruber 1993] provides the definition “a specification of a conceptualization”

Attempts to establish a set of fundamental definitions and notions to the fields of computer science and information technology. This is, in general, an attempt to clarify the relation of computer science and information technology with other scientific fields/disciplines.

This is done with three separate frameworks: a formula, a diagram, and the tetrachotomy (theory, method, language, and tool)

Formula

  • Two fundamental notions which refer to the real world.

Definition. A machine is a physical object which has been made to fulfill a well-defined purpose.

Definition. A property is a physical phenomenon which can be ascribed to a physical object.

  • Two supplementary notions which refer to the fundamental notions.

Definition. A specification is a statement of properties, in some suitable language. It is said to state a property to be obeyed or fulfilled.

Definition. A schema is a description of the composition of part of which specifications are given. (i.e. to develop a solution)

  • Problems accociated with the notions provided.
  1. adequacy is the problem of whether a machine has all desired properties.
  2. meaning is the problem of whether a specification states certain properties.
  3. acceptance is the problem of whether a machine fulfills a specification.
  4. structure is the problem of whether a machine is an appropriate relisation of the schema.
  5. correctness is the problem of whether a schema satisfies a specification

Diagram

Refer to the top of page 6 of http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.41.7834&rep=rep1&type=pdf

Definition. To formalize to write specification that states the exact desired properties.

Definition. To validate to establish that a specification states the desired properties

  • Methods to ensure validation:
  1. insight, when the specification is clear
  2. prototyping, where the prototype is tested or measured
  3. reasoning, where properties are mathematically proven

Definition. To implement the creation of a machine which fulfills the given specification.

Definition. To verify a machine against a specification to ensure that a machine fulfills its given specification.

Definition. To realize the creation of a machine which is the realization of a given schema.

Definition. To verify a machine against a schema to ensure that a machine is the realization of its given schema.

Definition. To design finding a suitable schema which satisfies a given specification.

Definition. To verify a schema against a specification to prove that a schema satisfies its given specification.

Tetrachotomy: a computer scientist's principal task

Separates complex results into the following classes:

Definition. A theory, is a collection of scientific laws/theorems.

Definition. A method, is a collection of notions, rules, and procedures which help to systematically find a solution for a given problem.

Definition. A language or formalism, is a collection of terms with well-defined syntax and semantics.

Definition. A tool, is a computer system by which parts of the activities can be automated.

Clone this wiki locally