The Importance of Metadata Standards
MetadataData about data. Metadata provides a context for research findings, ideally in a machine-readable format. It enables discovery of data via an electronic interface, and correct use and attribution of findings. Related Guide Standards are sets of topic-specific norms and definitions that guide the collection and documentation of metadata so that the result has consistent collection criteria, nomenclature, and structure. This consistency defines formal metadataMetadata that conforms to a specific standard, with consistent collection criteria, terminology and structure. Related Guide. Creation of formal metadata through adherence to standards is essential for sharing data, for data management within the project, and for future funding.
Formal metadata standards enable interoperabilityThe ability of two or more information systems to exchange metadata with minimal loss of information. Related Guide among similarly formatted databases on local and global scales, greatly enhancing the reach of scientific research through data sharing. In order for data to be fully interoperable, consistent ontologiesA type of relational controlled vocabulary, which provides for categories, relationships, rules and axioms among metadata elements. Typically a hierarchy of classes and terms, an ontology is a machine-readable way of relating metadata terminology. Related Guide must be used throughout data sets—metadata should use the same terms for describing a particular type of scientific inquiry, natural phenomenon, or analytical method. Data sets that do not have consistent terminology between them may still be made interoperable by using ontologies to associate terms with common definitions. (For more information, please see the Ontologies section.)
Interoperability also involves standards of database structure and organization. The combination of vocabularyA set of terms (e.g., words) that are used in a specific community. Related Guide and database standards enables reliable queries during data searches and the use of common analytical models, allowing for collaboration between projects and individuals that are studying similar concerns.
In addition to collaborative benefits, metadata standards also provide specific advantages to the individual project. A good metadata standard results from the broad consultation of experienced researchers in a particular fieldIndividual instance of a metadata label and value pair. For example, "creator: John Doe" is a metadata element. Related Guide. When combined with robust scientific design, field protocolsA strategy for transmitting data between systems. A protocol can be used not only over the internet, between computers, but also between applications running anywhere. Examples: FTP, SNMP, SSH. , and data processing methods, the standard ensures quality data collection and documentation.
An established standard provides the definition of the geographic, environmental, and equipment-related information that should be recorded while in the field. Standards also provide direction for the documentation of data processing, file naming conventions and formats, and a glossaryA type of flat controlled vocabulary containing a list of terms in a particular domain of knowledge with the definitions for those terms. Related Guide of precise definitions for applicable terms. These provisions allow the researcher to maximize the use of tools designed to facilitate data interoperability through quality data collection and organization.
Data collected in accordance with the quality and organizational guidelines set out in an established metadata standard are not only more easily shared, the studies that rely on such data are more easily funded. An increasingly common way to share and seek data on a particular topic is through the use of data clearinghouses or geo-portals that serve as cyber-depots for the sharing of data and collaboration between scientists of related interests.
The consistency that is provided by the use of standards is essential for such sites to function, and adherence to a standard is most likely a prerequisite for participation. Additionally, many funding agencies are now requiring the use of metadata standards in general, or a particular standard, in projects that they support to assure that the data from studies that they fund can be easily integrated into existing databases and GISGeographic Information System projects.