All Topics  
Tag (metadata)

 

   Email Print
   Bookmark   Link






 

Tag (metadata)



 
 
A tag is a non-hierarchical keyword or term assigned to a piece of information (such as an internet bookmark, digital image, or computer file
Computer file

A computer file is a block of arbitrary information, or resource for storing information, which is available to a computer program and is usually based on some kind of durable computer storage....
). This kind of metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 helps describe an item and allows it to be found again by browsing or searching. Tags are chosen informally and personally by the item's creator or by its viewer, depending on the system. On a website in which many users tag many items, this collection of tags becomes a folksonomy
Folksonomy

Folksonomy is the practice and method of collaboratively creating and managing Tag to annotate and categorization Content . Folksonomy describes the bottom-up classification systems that emerge from social tagging....
.

Tagging was popularized by websites associated with Web 2.0
Web 2.0

The term "Web 2.0" refers to a perceived second generation of web development and web design, that aims to facilitate communication, secure information sharing, interoperability, and collaboration on the World Wide Web....
 and is an important feature of many Web 2.0 services.






Discussion
Ask a question about 'Tag (metadata)'
Start a new discussion about 'Tag (metadata)'
Answer questions from other users
Full Discussion Forum



Encyclopedia


A tag is a non-hierarchical keyword or term assigned to a piece of information (such as an internet bookmark, digital image, or computer file
Computer file

A computer file is a block of arbitrary information, or resource for storing information, which is available to a computer program and is usually based on some kind of durable computer storage....
). This kind of metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 helps describe an item and allows it to be found again by browsing or searching. Tags are chosen informally and personally by the item's creator or by its viewer, depending on the system. On a website in which many users tag many items, this collection of tags becomes a folksonomy
Folksonomy

Folksonomy is the practice and method of collaboratively creating and managing Tag to annotate and categorization Content . Folksonomy describes the bottom-up classification systems that emerge from social tagging....
.

Tagging was popularized by websites associated with Web 2.0
Web 2.0

The term "Web 2.0" refers to a perceived second generation of web development and web design, that aims to facilitate communication, secure information sharing, interoperability, and collaboration on the World Wide Web....
 and is an important feature of many Web 2.0 services. It is now also part of some desktop software.

History and context


The use of keywords predates the internet and carried over to early websites as a way for publishers to help users find content. In 2003, the social bookmarking
Social bookmarking

Social bookmarking is a method for Internet users to store, organize, search, and manage Internet bookmark of web pages on the Internet with the help of metadata....
 website Delicious provided a way for its users to add "tags" to their bookmarks (as a way to help find them later); Delicious also provided browseable aggregated views of the bookmarks of all users featuring a particular tag. Flickr
Flickr

Flickr is an and video hosting service website, web services suite, and online community platform. In addition to being a popular Web site for users to share personal photographs, the service is widely used by bloggers as a photo repository....
 allowed its users to add free-form tags to each of their pictures, constructing flexible and easy metadata that made the pictures highly searchable. The success of Flickr and the influence of Delicious popularized the concept, and other social software
Social software

Social software encompasses a range of software systems that allow users to interact and share data. This computer-mediated communication has become very popular with social sites like MySpace and Facebook, media sites like Flickr and YouTube, and commercial sites like Amazon.com and eBay....
 websites – such as YouTube
YouTube

YouTube is a Video hosting service website where users can upload, view and share video clips. Three former PayPal employees created YouTube in February 2005....
, Technorati
Technorati

Technorati is an Internet search engine for searching blogs, competing with Google and Yahoo. As of June 2008, Technorati Web indexinges 112.8 million blogs and over 250 million pieces of tagged social media....
, and Last.fm
Last.fm

Last.fm is a United Kingdom-based Internet radio and music community website, founded in 2002. It claims over 21 million active users based in more than 200 countries....
 – also implemented tagging. "Labels" in Gmail
Gmail

Gmail is a free Post Office Protocol and Internet Message Access Protocol webmail service provided by Google. In the United Kingdom and Germany it is officially called Google Mail....
 are similar to tags.

Websites that include tags often display collections of tags as tag cloud
Tag cloud

A tag cloud or word cloud is a visual depiction of user-generated tag , or simply the word content of a site, used typically to describe the content of web sites....
s. A user's tags are useful both to them and to the larger community of the website's uses. This collective set of tags is known as a folksonomy.

Tags are a "bottom-up" type of classification, compared to hierarchies
Hierarchy

A 'hierarchy' is an arrangement of items The word derives from the Greek language , from ?e?????? , "president of sacred rites, high-priest" and that from , "sacred" + , "to lead, to rule"....
, which are "top-down". In a traditional hierarchical system (taxonomy
Taxonomy

Taxonomy is the practice and science of classification. The word comes from the Greek language ', taxis and ', nomos .Taxonomies, or taxonomic schemes, are composed of taxonomic units known as taxa , or kinds of things that are arranged frequently in a hierarchical structure....
), the designer sets out a limited number of terms to use for classification, and there is one correct way to classify each item. In a tagging system, there are an unlimited number of ways to classify an item, and there is no "wrong" choice. Instead of belonging to one category, an item may have several different tags.

Examples


Within a blog

Many blog
Blog

A blog is a type of website, usually maintained by an individual with regular entries of commentary, descriptions of events, or other material such as graphics or video....
 systems allow authors to add free-form tags to a post, along with (or instead of) placing the post into categories. For example, a post may display that it has been tagged with baseball and tickets. Each of those tags is usually a web link leading to an index page listing all of the posts associated with that tag. The blog may have a sidebar listing all the tags in use on that blog, with each tag leading to an index page. To reclassify a post, an author edits its list of tags. All connections between posts are automatically tracked and updated by the blog software; there is no need to relocate the page within a complex hierarchy of categories.

For an event

An official tag is a keyword adopted by events and conferences for participants to use in their web publications, such as blog entries, photos of the event, and presentation slides. Search engines can then index them to make relevant materials related to the event searchable in a uniform way. In this case, the tag is part of a controlled vocabulary
Controlled vocabulary

Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies....
.

Special types


Triple tags


A triple tag or machine tag uses a special syntax
Syntax

In linguistics, syntax is the study of the principles and rules for constructing Sentence s in natural languages. In addition to referring to the discipline, the term syntax is also used to refer directly to the rules and principles that govern the sentence structure of any individual language, as in "the Irish syntax"....
 to define extra information about the tag, making it easier or more meaningful for interpretation by a computer program. Triple tags comprise three parts: a namespace
Namespace

In general, a namespace is an abstract container providing context for the items it holds and allowing disambiguation of items having the same name ....
, a predicate
Predicate

Predicate or predication may refer to:*Predicate , the rest of a sentence apart from the subject in traditional grammar and in many Phrase structure grammar approaches...
, and a value. For example, "geo:long=50.123456" is a tag for the geographical longitude
Longitude

Longitude , symbolized by the Greek character lambda , is the geographic coordinate most commonly used in cartography and global navigation for east-west measurement....
 coordinate whose value is 50.123456.

The triple tag format was first devised for geolicious in November 2004, to map del.icio.us
Del.icio.us

Delicious is a social bookmarking web service for storing, sharing, and discovering World Wide Web Bookmark . The site was founded by Joshua Schachter in 2003 and acquired by Yahoo! in 2005....
 bookmarks, and gained wider acceptance after its adoption by mappr and GeoBloggers to map Flickr
Flickr

Flickr is an and video hosting service website, web services suite, and online community platform. In addition to being a popular Web site for users to share personal photographs, the service is widely used by bloggers as a photo repository....
 photos. In January 2007, Aaron Straup Cope at Flickr
Flickr

Flickr is an and video hosting service website, web services suite, and online community platform. In addition to being a popular Web site for users to share personal photographs, the service is widely used by bloggers as a photo repository....
 introduced the term machine tag as an alternative name for the triple tag, adding some questions and answers on purpose, syntax, and use.

Hash tags

Short messages on services such as Twitter
Twitter

Twitter is a social networking and micro-blogging service. It enables its users to send and read other users' updates , which are text-based posts of up to 140 characters in length....
 may be tagged including one or more hashtags; words or phrases prefixed with a hash symbol
Number sign

'Number sign' is a name for the symbol '#'; it is the preferred Unicode name for the code point associated with that glyph. The symbol is similar to the musical symbol called Sharp ....
 (#), such as those in:

#pilsner is my favourite kind of #beer


Advantages and disadvantages


In a tagging system, typically there is no information about the meaning or semantics
Semantics

Semantics is the study of meaning in communication. The word is derived from the Greek language word s??a?t???? , "significant", from s??a??? , "to signify, to indicate" and that from s??a , "sign, mark, token"....
 of each tag. For example, the tag "orange" might refer to the fruit
Orange (fruit)

An orange?specifically, the sweet orange?is the citrus Citrus sinensis and its fruit. The orange is a Hybrid of ancient cultivated origin, possibly between pomelo and tangerine ....
 or the color
Orange (colour)

The color orange occurs between red and yellow in the visible Optical spectrum at a wavelength of about 585 ? 620 nanometre, and has a hue of 30? in HSV colour space....
, and this lack of semantic distinction can lead to inappropriate connections between items.

People often select different tags to describe the same item: for example, items related to a version of Apple's operating system
Operating system

An operating system is an interface between hardware and applications; it is responsible for the management and coordination of activities and the sharing of the limited resources of the computer....
 may be tagged "Mac OS X", "Leopard", "software", and a variety of other terms. This flexibility allows people to classify their collections of items in the way that they find useful, but the personalized variety of terms can make it difficult for people to find comprehensive information about a subject; in order to catch every relevant item, they may have to search several times using different keywords. Users also have to decide whether each tagged item is actually relevant to what they're looking for.

Larger-scale folksonomies address some of the problems of tagging, as users of tagging systems tend to notice the current use of "tag terms" within these systems, and thus use existing tags in order to easily form connections to related items. In this way, folksonomies collectively develop a partial set of tagging conventions.

One common challenge in tagging systems is that people use both single and plural words as tags. A user could tag an object with "teacher" or with "teachers", which can make finding similar objects more difficult for both that user and other users in the system.

Syntax

Some tagging systems provide a single text box
Text box

A text box, text field or text entry box is a common element of graphical user interface of computer programs, as well as the corresponding type of widget used when programming GUIs....
 to enter textual tags. To be able to tokenize the string, a separator must be used. A popular separator is the space character
Space (punctuation)

In writing, a space is a blank area that is devoid of content, which word divider, letters, numbers, and punctuation. Conventions for interword separation and intersentence spaces vary among languages, and in some cases the spacing rules are quite complex....
. To enable the use of separators in the tags, a system may allow for higher-level separators (such as quotation mark
Quotation mark

Quotation marks or inverted commas are punctuation marks used in pairs to set off speech, a quotation, a phrase or a word. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the same character....
s) or escape character
Escape character

In computing and telecommunication, an escape character is a single character which in a sequence of characters signifies that what is to follow takes an alternative interpretation....
s. Systems can avoid the use of separators by allowing only one tag to be added to each input widget
Web widget

A web widget is a portable chunk of code that can be installed and executed within any separate HTML-based web page by an end user without requiring additional compiler ....
 at a time, although this makes adding multiple tags more time-consuming.

Another syntax for use within HTML
HTML

HTML, an Acronym and initialism of HyperText Markup Language, is the predominant markup language for Web pages. It provides a means to describe the structure of text-based information in a document?by denoting certain text as links, headings, paragraphs, lists, and so on?and to supplement that text with interactive forms, embedded '...
 is to use the attribute rel="tag" to indicate that the linked-to page acts as a tag for the current context. More detail is available in the .

See also

  • Geotagging
    GeoTagging

    Geotagging is the process of adding geographical identification metadata to various media such as photographs, video, websites, or RSS feeds and is a form of geospatial metadata....
  • Ontology
    Ontology (computer science)

    In computer science and information science, an ontology is a formal representation of a set of concepts within a Domain of discourse and the relationships between those concepts....
  • Resource Description Framework
    Resource Description Framework

    The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling, of information that is implemented in web resources; using a variety of syntax formats....
     (RDF)
  • Social network service
    Social network service

    A social network service focuses on building online communities of people who share interests and/or activities, or who are interested in exploring the interests and activities of others....
  • Tag editor
    Tag editor

    A tag editor is a piece of software that supports editing metadata of multimedia file formats, rather than the actual file content. These are mainly taggers for common audio tagging formats like ID3, APEv2 tag, and Vorbis comments , but can also be taggers for JPEG and TIFF metadata, for example ....


External links

  • by Ellyssa Kroski, December 7, 2005.
  • by Rashmi Sinha, September 27, 2005.
  • . Tim Bray. Internet draft, expires August 5, 2007.