All Topics  
Extensible Metadata Platform

 

   Email Print
   Bookmark   Link






 

Extensible Metadata Platform



 
 
The Adobe Extensible Metadata Platform (XMP) is a standard for processing and storing standardized and proprietary metadata, created by Adobe Systems Inc..

XMP standardizes the definition, creation, and processing of extensible metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
. Serialized XMP can be embedded into a significant number of popular file formats, without breaking their readability by non-XMP-aware applications. Embedding metadata ("the truth is in the file") avoids many problems that occur when metadata is stored separately.






Discussion
Ask a question about 'Extensible Metadata Platform'
Start a new discussion about 'Extensible Metadata Platform'
Answer questions from other users
Full Discussion Forum



Encyclopedia


The Adobe Extensible Metadata Platform (XMP) is a standard for processing and storing standardized and proprietary metadata, created by Adobe Systems Inc..

XMP standardizes the definition, creation, and processing of extensible metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
. Serialized XMP can be embedded into a significant number of popular file formats, without breaking their readability by non-XMP-aware applications. Embedding metadata ("the truth is in the file") avoids many problems that occur when metadata is stored separately. XMP is used in PDF
Portable Document Format

Portable Document Format is a file format created by Adobe Systems in 1993 for document exchange. PDF is used for representing two-dimensional documents in a manner independent of the application software, hardware, and operating system....
, photography
Photography

Photography is the process, activity and art of creating still or moving by recording radiation on a sensitive medium, such as a photographic film, or an ....
 and photo editing
Graphics software

In computer graphics, graphics software or image editing software is a computer program or collection of programs that enable a person to manipulate visual images on a computer....
 applications.

XMP Data Model

XMP defines a metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 model that can be used with any defined set of metadata items. XMP also defines particular schemas
XML Schema

XML Schema, published as a W3C recommendation in May 2001, is one of several XML schema. It was the first separate schema language for XML to achieve Recommendation status by the W3C....
 for basic properties useful for recording the history of a resource as it passes through multiple processing steps, from being photographed, scanned
Image scanner

In computing, a scanner is a device that optically scans images, printed text, handwriting, or an object, and converts it to a digital image. Common examples found in offices are variations of the desktop scanner where the document is placed on a glass window for scanning....
, or authored as text, through photo editing steps (such as cropping
Cropping (image)

Cropping refers to the removal of the outer parts of an image to improve Framing , accentuate subject matter or change aspect ratio. Depending on the application, this may be performed on a physical photograph, artwork or film footage, or achieved workstation using software....
 or color adjustment), to assembly into a final image. XMP allows each software program or device along the way to add its own information to a digital resource, which can then be retained in the final digital file.

XMP is most commonly serialized and stored using a subset of the W3C
World Wide Web Consortium

The World Wide Web Consortium is the main international standards organization for the World Wide Web . It is arranged as a consortium where member organizations maintain full-time staff for the purpose of working together in the development of standards for the World Wide Web....
 Resource Description Framework
Resource Description Framework

The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling, of information that is implemented in web resources; using a variety of syntax formats....
 (RDF), which is in turn expressed in XML.

Serialization of XMP

Embedding metadata in files allows easy sharing and transfer of files across products, vendors, platforms, without metadata getting lost; embedding avoids a multitude of problems coming from proprietary vendor-specific metadata databases.

The most common metadata tags recorded in XMP data are those from the Dublin Core Metadata Initiative
Dublin Core

The Dublin Core metadata element set is a standard for cross-domain information Resource description. It provides a simple and standardised set of conventions for describing things online in ways that make them easier to find....
, which include things like title, description, creator, and so on. The standard is designed to be extensible, allowing users to add their own custom types of metadata into the XMP data. XMP generally does not allow binary data types to be embedded. This means that any binary data one wants to carry in XMP, such as thumbnail
Thumbnail

Thumbnails are reduced-size versions of pictures, used to help in recognizing and organizing them, serving the same role for images as a normal text index does for words....
 images, must be encoded in some XML-friendly format, such as Base64
Base64

The term Base64 refers to a specific MIME#Content-Transfer-Encoding. It is also used as a generic term for any similar encoding scheme that encodes binary data by treating it numerically and translating it into a base 64 representation....
.

XMP can be used in PDF and other graphics formats, such as JPEG
JPEG

In computing, JPEG is a commonly used method of for photographic images. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality....
, JPEG 2000
JPEG 2000

JPEG 2000 is a wavelet-based standard. It was created by the Joint Photographic Experts Group committee in the year 2000 with the intention of superseding their original discrete cosine transform-based JPEG standard ....
, GIF
GIF

The Graphics Interchange Format is a Raster graphics that was introduced by CompuServe in 1987 and has since come into widespread usage on the World Wide Web due to its wide support and portability....
, PNG, HTML
HTML

HTML, an Acronym and initialism of HyperText Markup Language, is the predominant markup language for Web pages. It provides a means to describe the structure of text-based information in a document?by denoting certain text as links, headings, paragraphs, lists, and so on?and to supplement that text with interactive forms, embedded '...
, TIFF, Adobe Illustrator
Adobe Illustrator

Adobe Illustrator is a vector graphics editor developed and marketed by Adobe Systems.The latest version, Illustrator CS4, is the fourteenth generation in the product line....
, PSD
Adobe Photoshop

Adobe Photoshop, or simply Photoshop, is a Graphics software developed and published by Adobe Systems. It is the current and primary Market dominance for commercial Raster graphics and manipulation, and is the flagship product of Adobe Systems....
, PostScript
PostScript

PostScript is a dynamically typed concatenative programming language programming language created by John Warnock and Charles Geschke in 1982. PostScript is best known for its use as a page description language in the electronic and desktop publishing areas....
, and Encapsulated PostScript
Encapsulated PostScript

Encapsulated PostScript, or EPS, is a Document Structuring Conventions-conforming PostScript document with additional restrictions intended to make EPS files usable as a graphics file format....
. In a typical edited JPEG
JPEG

In computing, JPEG is a commonly used method of for photographic images. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality....
 file, XMP information is typically included alongside Exif and IPTC Information Interchange Model
IPTC Information Interchange Model

The Information Interchange Model is a file structure and set of metadata attributes that can be applied to text, s and other media types. It was developed in the early 1990s by the International Press Telecommunications Council to expedite the international exchange of news among newspapers and news agencies....
 data.

XMP metadata can describe a document as a whole (the "main" metadata), but can also describe parts of a document, such as pages or included images. This architecture makes it possible to retain authorship and rights information about, for example, images included in a published document. Similarly, it permits documents created from several smaller documents to retain the original metadata associated with the parts.

XMP Support and Acceptance


XMP Toolkit

The XMP Toolkit implements metadata handling in two libraries:
  • XMPCore for creation and manipulation of metadata that follows the XMP Data Model
  • XMPFiles for embedding serialized metadata in files, and for retrieving embedded metadata.


Adobe provides the XMP Toolkit free of charge under a BSD license. The Toolkit includes specification and usage documents (PDFs), API documentation (doxygen/javadoc), C++
C++

C++ is a general-purpose programming language. It is regarded as a middle-level language, as it comprises a combination of both high-level programming language and low-level programming language language features....
 source code (XMPCore and XMPFiles) and Java
Java (programming language)

Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java ....
 source code (currently only XMPCore). XMPFiles is currently available as a C++ implementation in Windows and Mac OS only (not Unix
Unix

Unix is a computer operating system originally developed in 1969 by a group of American Telephone & Telegraph employees at Bell Labs, including Ken Thompson , Dennis Ritchie, Douglas McIlroy, and Joe Ossanna....
/Linux
Linux

Linux is a generic term referring to Unix-like computer operating systems based on the Linux kernel. Their development is one of the most prominent examples of free and open source software collaboration; typically all the underlying source code can be used, freely modified, and redistributed by anyone under the terms of the GNU GPL license...
, not Java
Java (programming language)

Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java ....
).

Various XMP Tools (read/write support)


  • ACDSee
    ACDSee

    ACDSee is a shareware and software for Microsoft Windows developed by ACD Systems....
     Pro - can read/write XMP information for DNG, GIF, JPEG, PNG and TIFF files.
  • - A digital photo system primary dealing with photo data which reads/writes/edits IPTC/XMP data to a large variety of image formats (PSD, DNG, JPEG, most RAW formats, PNG, TIFF, plus others). (Windows, .Net 2.0).
  • - Open Source library for manipulating XMP, built on the Adobe XMP SDK
  • by Phil Harvey, open source PERL module or command line.
  • - Exiv2 is a C++ library and a command line utility to manage image metadata. It provides fast and easy read and write access to the Exif, IPTC and XMP metadata of images in various formats.
  • - Image viewer (Windows) with XMP embedding and/or sidecar files creation (xmp:Rating, xmp:Label, photoshop:Urgency)
  • - Linux/Gnome/Ubuntu photo manager/editor
  • - can read/write and search XMP in images.
  • - includes extensive XMP support, with XMP editor, ratings, display and extensions provide a mapping from/to XMP
  • - tags and searches for JPEG, RAW, TIFF, PNG and AVI files containing IPTC & XMP headers. Exposes a simpler view of metadata. (Windows, .NET 3.5)
  • iText
    IText

    iText is a free software and open source software library for creating and manipulating Portable Document Format, Rich Text Format, and HTML files in Java ....
     - Open Source Java library that can read and write XMP embedded in PDF files.
  • - IPTC/XMP editor for jpeg image files indexing digital photos based on metadata (Windows)
  • - EXIF, IPTC and XMP
    XMP

    XMP may refer to:In software and data standards:* Extensible Metadata Platform, Adobe's standard for processing and embedding metadata in various file formats...
     metadata analyze tool.
  • - Digital photo panagement program.
  • - Camera importer tool.
  • - Photo Gallery saves tags to XMP (Windows)
  • - Geocoding and XMP support (Windows, .NET 3.0)
  • - Open Source Python library wrapping the Exempi C API.
  • - Open Source Digital Asset Management System that includes support for reading/writing XMP to assets
  • Windows Imaging Component
    Windows Imaging Component

    The Windows Imaging Component is a Component Object Model based imaging codec framework introduced in Windows Vista and Windows XP#Service Pack 3 for working with and processing digital images and image metadata....
     - Microsoft library for working with and processing digital images and image metadata (Windows)
  • Windows Live Photo Gallery
    Windows Live Photo Gallery

    Windows Live Photo Gallery is a photo management and sharing application released as a part of Microsoft Windows Live initiative. It is an upgraded version of Windows Photo Gallery, which is a part of Windows Vista....
     - a photo management and sharing application released as a part of Microsoft's Windows Live initiative. It is an upgraded version of Windows Photo Gallery, which is a part of Windows Vista.
  • - DAM (Digital Asset Management) system that allows read/write of XMP across all supported file types.
  • - A GUI (Linux/GNOME) to write XMP metadata from the File Manager


The mainstream IPTC editing tools also support editing of XMP data.

Licensing

Adobe has a trademark on XMP, and retains control over the specification.

Initially, Adobe released source code for the XMP SDK under a license called the ADOBE SYSTEMS INCORPORATED - OPEN SOURCE LICENSE. The compatibility of this license with the GNU General Public License
GNU General Public License

The GNU General Public License is a widely used free software license, originally written by Richard Stallman for the GNU project. The GPL is the most popular and well-known example of the type of strong copyleft license that requires derived works to be available under the same copyleft....
 has been questioned. The license is not listed on maintained by the Open Source Initiative
Open Source Initiative

The Open Source Initiative is an organization dedicated to promoting open-source software.The organization was founded in February 1998, by Bruce Perens and Eric S....
 and is different from the licenses for most of their other open source
Open source

Open source is an approach to design, development, and distribution offering practical accessibility to a product's source . Some consider open source as one of various possible design approaches, while others consider it a critical Strategy element of their business operations....
 software.

On May 14, 2007, Adobe released the XMP Toolkit SDK under a standard BSD license.

On August 28, 2008, Adobe posted a public patent license for the XMP specification.

History

XMP was first introduced by Adobe
Adobe Systems

Adobe Systems Incorporated is an United States computer Computer software company headquartered in San Jose, California, USA. The company has historically focused upon the creation of multimedia and creativity software products, with a more-recent foray into rich Internet application software development....
 in April 2001 as part of the Adobe Acrobat
Adobe Acrobat

Adobe Acrobat is a family of software developed by Adobe Systems, designed to view, create, manipulate and manage files in Adobe's Portable Document Format ....
 5.0 software product.

In June 21, 2004, Adobe announced its collaboration with the International Press Telecommunications Council
International Press Telecommunications Council

The International Press Telecommunications Council, based in Windsor, Berkshire, United Kingdom, is a consortium of the world's major news agencies and news industry vendors....
. In July 2004, a working group led by Adobe Systems
Adobe Systems

Adobe Systems Incorporated is an United States computer Computer software company headquartered in San Jose, California, USA. The company has historically focused upon the creation of multimedia and creativity software products, with a more-recent foray into rich Internet application software development....
' Gunar Penikis and IPTC's Michael Steidl was set up, and volunteers were recruited from AFP (Agence France-Presse)
Agence France-Presse

Agence France-Presse is the oldest news agency in the world, and one of the three largest with Associated Press and Reuters. It is also the largest France news agency....
, Associated Press
Associated Press

The Associated Press is an Media of the United States news agency. The AP is a cooperative owned by its contributing newspapers, Radio station and Television station stations in the United States, which both contribute stories to the AP and use material written by its staffers....
, ControlledVocabulary.com, IDEAlliance, Mainichi Shimbun
Mainichi Shimbun

The is one of the major newspapers in Japan, published by ....
, Reuters
Reuters

Reuters Group Limited is a United_Kingdom-based, Canadian controlled news agency and former financial market data provider that provides reports from around the world to newspapers and broadcasters....
, and others, to develop the new schema.

The "IPTC Core Schema for XMP" version 1.0 specification was released publicly on March 21, 2005. A set of custom panels for Adobe Photoshop CS can be downloaded from the IPTC. The package includes a User's Guide, example photos with embedded XMP information, the specification document, and an implementation guide for developers. The "User's Guide to the IPTC Core" goes into detail about how each of the fields should be used and is also available directly as a PDF (see external links below). The next version of the Adobe Creative Suite (CS2) included these custom panels as part of its default set.

The Windows Photo Gallery
Windows Photo Gallery

Windows Photo Gallery is a photo management, tagging, and editing tool development by Microsoft, and is included with all editions of Windows Vista....
, released with Windows Vista
Windows Vista

Windows Vista is one member in a family of operating systems developed by Microsoft for use on personal computers, including home and business Desktop computer, laptops, Tablet PCs, and media center PCs....
, offers support for the XMP standard, the first time Microsoft has released metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 compatibility beyond Exif.

Location in File Types


For more details you may want to look up the XMP specification which has details on embedding (listed below).
  • TIFF
    Tagged Image File Format

    Tagged Image File Format is a file format for storing raster graphics, including photographs and line art. It is now under the control of Adobe Systems....
     - Tag 700
  • JPEG
    JPEG

    In computing, JPEG is a commonly used method of for photographic images. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality....
     - Application segment 1 (0xFFE1) with segment header "http://ns.adobe.com/xap/1.0/\x00"
  • JPEG 2000
    JPEG 2000

    JPEG 2000 is a wavelet-based standard. It was created by the Joint Photographic Experts Group committee in the year 2000 with the intention of superseding their original discrete cosine transform-based JPEG standard ....
     - 'uuid' atom with UID of 0xBE7ACFCB97A942E89C71999491E3AFAC
  • PNG - inside a 'iTXt' text block with the keyword 'XML:com.adobe.xmp'
  • GIF
    GIF

    The Graphics Interchange Format is a Raster graphics that was introduced by CompuServe in 1987 and has since come into widespread usage on the World Wide Web due to its wide support and portability....
     - as an Application Extension
  • PDF
    Portable Document Format

    Portable Document Format is a file format created by Adobe Systems in 1993 for document exchange. PDF is used for representing two-dimensional documents in a manner independent of the application software, hardware, and operating system....
     - embedded in a metadata stream contained in a PDF object
  • For file formats that have no support for embedded XMP data, this data can be stored in external .xmp sidecar files
    Sidecar file

    Sidecar files, also known as buddy files or connected files, are a method of storing data which is not supported by the source file format, in external files....
    .


See also

  • IPTC Information Interchange Model
    IPTC Information Interchange Model

    The Information Interchange Model is a file structure and set of metadata attributes that can be applied to text, s and other media types. It was developed in the early 1990s by the International Press Telecommunications Council to expedite the international exchange of news among newspapers and news agencies....
  • Resource Description Framework (RDF)
    Resource Description Framework

    The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling, of information that is implemented in web resources; using a variety of syntax formats....


External links