Types of Vocabularies

Metadata vocabularies described by their form (category) and function (type)

As a managed list of acceptable terms, a controlled vocabulary helps to articulate, clarify, and narrow terminology. The managing body will generally choose to present the controlled vocabulary in a specific format, which is both documented and available for wider use. This documentation is of paramount importance for data managers who are looking to implement a given controlled vocabulary. It facilitates understanding of the structure of the controlled vocabulary and thus enables implementation. The documentation might provide additional information about each value, how the valuesMetadata values are the content connected to metadata labels in a metadata element. For example, if the metadata label is "date", the metadata value could be "May 13, 2007". Related Guide relate to one another, and how to use a vocabulary. With this information, data managers can appropriately interpret the managed list of acceptable terms.

Categories of Controlled Vocabularies

There are three broad categories of controlled vocabularies: flat, multi-level and relational vocabulariesManaged list of acceptable terms that makes use of relationships between metadata terms. Relational vocabularies include thesauri, semantic networks and ontologies. Related Guide. Within these three broad categories, there are a variety of specific types of controlled vocabularies - each of which serves a different function within different domains, and among a variety of metadataData about data. Metadata provides a context for research findings, ideally in a machine-readable format. It enables discovery of data via an electronic interface, and correct use and attribution of findings. Related Guide projects. Ultimately, all of these tools provide additional information about the set of terminology employed in a project. As such, they represent an invaluable resource for data gatherers, system managers, and metadata providers.

One word of caution: It is important to understand that the different types of controlled vocabularies rarely exist in a vacuum. In the "real metadata world", the differences between the various controlled vocabularies are often muddied- both intentionally and unintentionally. One controlled vocabulary might fit the definition of more than one type of controlled vocabulary. For example, an ontologyA type of relational controlled vocabulary, which provides for categories, relationships, rules and axioms among metadata elements. Typically a hierarchy of classes and terms, an ontology is a machine-readable way of relating metadata terminology. Related Guide might also have many of the characteristics of a dictionaryIn the context of metadata, a dictionary is a type of controlled flat vocabulary, which provides a list of metadata terms, definitions and additional information within a specific domain. Related Guide. Because of these "gray" areas, the different types of controlled vocabularies are typically just called "vocabularies" or "controlled vocabularies". The vocabulary descriptions that follow are merely tools to help implement, utilize and understand a managed list of acceptable terms.

Types of Controlled Vocabularies

Controlled vocabularies are classified not only by their category (structure, characteristics), but by their type. So, for example, a controlled vocabulary that includes terms used to discover data can be distinguished from a controlled vocabulary that includes terms used to understand and thus use the data. Some controlled vocabularies are inherently designed for computer use (syntactic vocabularies), while others are created for human consumption (semantic vocabularies). It should be noted that sometimes a combination of "type" and "category" of vocabularies are used to refer to a specific controlled vocabulary.

Suggested Citation

, , , , 2009. "Types of Vocabularies." In The MMI Guides: Navigating the World of Marine Metadata. http://marinemetadata.org/guides/vocabs/voctypes. Accessed: 03/18/2010