All Topics  
Dublin Core

 

   Email Print
   Bookmark   Link






 

Dublin Core



 
 
The Dublin Core metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 element set is a standard for cross-domain information resource
Resource (Web)

The concept of resource is primitive in the World Wide Web architecture, and is used in the definition of its fundamental elements. The term was first introduced to refer to targets of Uniform Resource Locators , but its definition has been further extended to include the referent of any Uniform Resource Identifier , or Internationalized Reso...
 description. It provides a simple and standardised set of conventions for describing things online in ways that make them easier to find. Dublin Core is widely used to describe digital materials such as video, sound, image, text, and composite media like web pages. Implementations of Dublin Core typically make use of XML and are Resource Description Framework
Resource Description Framework

The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling, of information that is implemented in web resources; using a variety of syntax formats....
 based.






Discussion
Ask a question about 'Dublin Core'
Start a new discussion about 'Dublin Core'
Answer questions from other users
Full Discussion Forum



Encyclopedia


The Dublin Core metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 element set is a standard for cross-domain information resource
Resource (Web)

The concept of resource is primitive in the World Wide Web architecture, and is used in the definition of its fundamental elements. The term was first introduced to refer to targets of Uniform Resource Locators , but its definition has been further extended to include the referent of any Uniform Resource Identifier , or Internationalized Reso...
 description. It provides a simple and standardised set of conventions for describing things online in ways that make them easier to find. Dublin Core is widely used to describe digital materials such as video, sound, image, text, and composite media like web pages. Implementations of Dublin Core typically make use of XML and are Resource Description Framework
Resource Description Framework

The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling, of information that is implemented in web resources; using a variety of syntax formats....
 based. Dublin Core is defined by ISO in 2003 , and .

Background

The "Dublin" in the name refers to Dublin, Ohio
Dublin, Ohio

Dublin is a city in Franklin County, Ohio, Delaware County, Ohio, and Union County, Ohio counties in the U.S. state of Ohio. The population was 31,392 at the United States Census 2000....
, U.S.
United States

The United States of America is a Federal government constitutional republic comprising U.S. state and a federal district. The country is situated mostly in central North America, where its Contiguous United States and Washington, D.C., the Capital districts and territories, lie between the Pacific Ocean and Atlantic Oceans, Borders of the U...
, where the work originated from an invitational workshop (the "OCLC/NCSA Metadata Workshop") hosted in 1995 by OCLC
OCLC

OCLC Online Computer Library Center, Inc. is a "nonprofit, membership, computer library service and research organization dedicated to the public purpose of furthering access to the world's information and reducing information costs", according to its ....
, a library consortium that is based there. (NCSA is the National Center for Supercomputing Applications
National Center for Supercomputing Applications

The National Center for Supercomputing Applications is a state-federal partnership to develop and deploy national-scale cyberinfrastructure that advances science and engineering....
.) The "Core" refers to the fact that the metadata element set is a basic but expandable "core" list.

The semantics of Dublin Core were established and are maintained by an international, cross-disciplinary group of professionals from librarianship, computer science
Computer science

Computer science is the study of the theoretical foundations of information and computation, and of practical techniques for their implementation and application in computer systems....
, text encoding, museum
Museum

A museum is a "permanent institution in the service of society and of its development, open to the public, which acquires, conserves, researches, communicates and exhibits the tangible and intangible heritage of humanity and its environment, for the purposes of education, study, and entertainment", as defined by the International Coun...
s, and other related fields of scholarship and practice.

The Dublin Core Metadata Initiative () is an organization providing an open forum for the development of interoperable online metadata standards that support a broad range of purposes and business models. DCMI's activities include consensus-driven working groups, global conferences and workshops, standards liaison, and educational efforts to promote widespread acceptance of metadata standards and practices.

Levels of the standard

The Dublin Core standard includes two levels: Simple and Qualified. Simple Dublin Core comprises fifteen elements; Qualified Dublin Core includes three additional elements (Audience, Provenance and RightsHolder), as well as a group of element refinements (also called qualifiers) that refine the semantics of the elements in ways that may be useful in resource discovery.

Simple Dublin Core

The Simple Dublin Core Metadata Element Set (DCMES) consists of 15 metadata elements:

  1. Title
  2. Creator
  3. Subject
  4. Description
  5. Publisher
  6. Contributor
  7. Date
  8. Type
  9. Format
  10. Identifier
  11. Source
  12. Language
  13. Relation
  14. Coverage
  15. Rights


Each Dublin Core element is optional and may be repeated. The DCMI has established standard ways to refine elements and encourage the use of encoding and vocabulary schemes. There is no prescribed order in Dublin Core for presenting or using the elements.

Full information on element definitions and term relationships can be found in the Dublin Core Metadata Registry.

Qualified Dublin Core

Subsequent to the specification of the original 15 elements, an ongoing process to develop exemplary terms extending or refining the Dublin Core Metadata Element Set (DCMES) was begun. The additional terms were identified, generally in working groups of the Dublin Core Metadata Initiative, and judged by the DCMI Usage Board to be in conformance with principles of good practice for the qualification of Dublin Core metadata elements.

Elements refinements make the meaning of an element narrower or more specific. A refined element shares the meaning of the unqualified element, but with a more restricted scope. The guiding principle for the qualification of Dublin Core elements, colloquially known as the Dumb-Down Principle, states that an application that does not understand a specific element refinement term should be able to ignore the qualifier and treat the metadata value as if it were an unqualified (broader) element. While this may result in some loss of specificity, the remaining element value (without the qualifier) should continue to be generally correct and useful for discovery.

In addition to element refinements, Qualified Dublin Core includes a set of recommended encoding schemes, designed to aid in the interpretation of an element value. These schemes include controlled vocabularies and formal notations or parsing rules. A value expressed using an encoding scheme may thus be a token selected from a controlled vocabulary (e.g., a term from a classification system or set of subject headings) or a string formatted in accordance with a formal notation (e.g., "2000-12-31" as the standard expression of a date). If an encoding scheme is not understood by an application, the value may still be useful to a human reader.

DCMI also maintains a small, general vocabulary recommended for use within the element Type. This vocabulary currently consists of 12 terms:

Syntaxes

Syntax choices for DC metadata depend on a number of variables, and "one size fits all" prescriptions rarely apply. When considering an appropriate syntax, it is important to note that Dublin Core concepts and semantics are designed to be syntax independent, are equally applicable in a variety of contexts, as long as the metadata is in a form suitable for interpretation both by machines and by human beings.

The Dublin Core Abstract Model provides a reference model against which particular DC encoding guidelines can be compared, independent of any particular encoding syntax. Such a reference model allows implementors to gain a better understanding of the kinds of descriptions they are trying to encode and facilitates the development of better mappings and translations between different syntaxes.

Some applications

One Document Type Definition
Document Type Definition

Document Type Definition is one of several SGML and XML schema languages, and is also the term used to describe a document or portion thereof that is authored in the DTD language....
 based on Dublin Core is the (OMF) specification. OMF is in turn used by ScrollKeeper
ScrollKeeper

ScrollKeeper is a document cataloging system. It manages documentation metadata, as specified by the Open Source Metadata Framework . ScrollKeeper was used by the GNOME desktop help browser, Yelp, but has since been replaced by Rarian....
, which is used by the GNOME
Gnome

A gnome is a mythical creature characterized by its extremely small size and wiktionary:subterranean lifestyle. The word gnome is derived from the New Latin gnomus....
 desktop and KDE
KDE

KDE is a free software project based around its flagship product, a desktop environment for Unix-like systems. The goal of the project is to provide basic desktop functions and applications for daily needs as well as tools and documentation for developers to write stand-alone applications for the system....
 help browsers and the ScrollServer documentation server. PBCore
PBCore

The PBCore was created by the public broadcasting community in the United States of America for use by public broadcasters and related communities....
 is also based on Dublin Core. The Zope
Zope

Zope is a free software and open source software, object-oriented web application server written in the Python programming language. Zope stands for "Z Object Publishing Environment." It can be almost fully managed with a web-based user interface....
 CMF
Zope Content Management Framework

The Zope Content Management Framework is a series of free software tools for Zope that forms a framework providing many of the key services a content management system would need....
's Metadata products, used by the Plone
Plone (content management system)

Plone is a free software and open source software content management system built on top of the Zope application server. It is suited for an intranet or may be used as a server on the Internet, playing such roles as a document management system and collaborative software collaboration tool....
, ERP5
ERP5

ERP5 is a Open Source Enterprise resource planning system based on the Zope application server. It is mainly developed in the Python and the source code is freely available under the GNU General Public License....
, the Nuxeo CPS
Nuxeo CPS

Nuxeo Collaborative Portal Server is a Free software and Open source content management system written in the Python programming language.It can be used as an intranet and extranet server, as a document publishing system, and as a groupware tool for collaboration between separately located entities....
 Content management system
Content management system

A content management system is a computer application used to create, edit, manage, search and publish various kinds of Content . CMSs are frequently used for storing, controlling, versioning, and publishing industry-specific documentation such as news articles, operators' manuals, technical manuals, sales guides, and marketing brochures....
s, and FedoraCommons also implement Dublin Core.

DCMI also maintains a list of projects using Dublin Core on its website.

See also

  • Metadata registry
    Metadata registry

    A metadata registry is a central location in an organization where metadata definitions are stored and maintained in a controlled method....
  • Metadata Object Description Schema
  • Semantic Web
    Semantic Web

    The Semantic Web is an evolving extension of the World Wide Web in which the semantics of information and services on the web is defined, making it possible for the web to understand and satisfy the requests of people and machines to use the web content....
  • Ontology (computer science)
    Ontology (computer science)

    In computer science and information science, an ontology is a formal representation of a set of concepts within a Domain of discourse and the relationships between those concepts....
  • Open Archives Initiative
    Open Archives Initiative

    The Open Archives Initiative is an attempt to build a "low-barrier interoperability framework" for archives containing digital content . It allows people to harvest Metadata ....
  • Controlled vocabulary
    Controlled vocabulary

    Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies....
  • Interoperability
    Interoperability

    Interoperability is a property referring to the ability of diverse systems and organizations to work together . The term is often used in a technical systems engineering sense, or alternatively in a broad sense, taking into account social, political, and organizational factors that impact system to system performance....
  • METS
    METS

    The Metadata Encoding and Transmission Standard schema is a standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library, expressed using the XML schema language of the World Wide Web Consortium....
    , a metadata standard maintained by the Library of Congress
    Library of Congress

    The Library of Congress is the de facto national library of the United States and the research arm of the United States Congress. Located in three buildings in Washington, D.C., it is the largest library in the world by shelf space and holds the largest number of books....
     for the
  • Preservation Metadata: Implementation Strategies (PREMIS)
    Preservation Metadata: Implementation Strategies (PREMIS)

    PREMIS is an international working group concerned with developing metadata for use in digital preservation.In 2003 the Online Computer Library Center and Research Libraries Group established the PREMIS working group, which consisted of a multi-national roster of more than thirty representatives from the cultural, government, and private...


External links

  • (Cover Pages, March 2005)
  • [https://web.inf.unibz.it/courses/images/stories/2005_2006/Digital_Libraries/dini-less-5-6.ppt "Lecture slides about Dublin Core"], by Professor Luca Dini, lecturer at the Free University of Bolzano.