All Topics  
Folksonomy

 

   Email Print
   Bookmark   Link






 

Folksonomy



 
 
Folksonomy (also known as collaborative tagging, social classification, social indexing, and social tagging) is the practice and method of collaboratively creating and managing tags
Tag (metadata)

A tag is a non-hierarchical index term assigned to a piece of information . This kind of metadata helps describe an item and allows it to be found again by browsing or searching....
 to annotate and categorize
Categorization

Categorization is the process in which ideas and objects are recognition, difference and understanding. Categorization implies that objects are grouped into categories, usually for some specific purpose....
 content
Content (media and publishing)

In media production and publishing, content is information and experiences that may provide value for an end-user/audience in specific contexts....
. Folksonomy describes the bottom-up classification systems that emerge from social tagging. In contrast to traditional subject indexing
Subject indexing

Subject indexing is the act of describing a document by keyword to indicate what the document is about or to summarize its content . Indices are constructed, separately, on three distinct levels: terms in a document such as a book; objects in a collection such as a library; and documents within a field of knowledge....
, metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 is generated not only by experts but also by creators and consumers of the content.






Discussion
Ask a question about 'Folksonomy'
Start a new discussion about 'Folksonomy'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


Folksonomy (also known as collaborative tagging, social classification, social indexing, and social tagging) is the practice and method of collaboratively creating and managing tags
Tag (metadata)

A tag is a non-hierarchical index term assigned to a piece of information . This kind of metadata helps describe an item and allows it to be found again by browsing or searching....
 to annotate and categorize
Categorization

Categorization is the process in which ideas and objects are recognition, difference and understanding. Categorization implies that objects are grouped into categories, usually for some specific purpose....
 content
Content (media and publishing)

In media production and publishing, content is information and experiences that may provide value for an end-user/audience in specific contexts....
. Folksonomy describes the bottom-up classification systems that emerge from social tagging. In contrast to traditional subject indexing
Subject indexing

Subject indexing is the act of describing a document by keyword to indicate what the document is about or to summarize its content . Indices are constructed, separately, on three distinct levels: terms in a document such as a book; objects in a collection such as a library; and documents within a field of knowledge....
, metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 is generated not only by experts but also by creators and consumers of the content. Usually, freely chosen keyword
Keyword

'Keyword' may refer to:* Keyword * Keyword * Keyword * Keyword * ...
s are used instead of a controlled vocabulary
Controlled vocabulary

Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies....
. Folksonomy (from folk + taxonomy
Taxonomy

Taxonomy is the practice and science of classification. The word comes from the Greek language ', taxis and ', nomos .Taxonomies, or taxonomic schemes, are composed of taxonomic units known as taxa , or kinds of things that are arranged frequently in a hierarchical structure....
) is a user-generated
User-generated content

User-generated content , also known as Consumer generated media or user-created content , refers to various kinds of media content, publicly available, that are produced by end-users....
 taxonomy.

Folksonomies became popular on the Web
World Wide Web

The World Wide Web is a very large set of interlinked hypertext documents accessed via the Internet. With a Web browser, one can view Web pages that may contain writing, s, videos, and other multimedia and navigate between them using hyperlinks....
 around 2004 as part of social software
Social software

Social software encompasses a range of software systems that allow users to interact and share data. This computer-mediated communication has become very popular with social sites like MySpace and Facebook, media sites like Flickr and YouTube, and commercial sites like Amazon.com and eBay....
 applications including social bookmarking
Social bookmarking

Social bookmarking is a method for Internet users to store, organize, search, and manage Internet bookmark of web pages on the Internet with the help of metadata....
 and annotating photographs. Tagging, which is characteristic of Web 2.0
Web 2.0

The term "Web 2.0" refers to a perceived second generation of web development and web design, that aims to facilitate communication, secure information sharing, interoperability, and collaboration on the World Wide Web....
 services, allows non-expert users to collectively classify and find information. Some websites include tag cloud
Tag cloud

A tag cloud or word cloud is a visual depiction of user-generated tag , or simply the word content of a site, used typically to describe the content of web sites....
s as a way to visualize tags in a folksonomy.

Typically, folksonomies are Internet
Internet

The Internet is a global network of interconnected computers, enabling users to share information along multiple channels. Typically, a computer that connects to the Internet can access information from a vast array of available server and other computers by moving information from them to the computer's local memory....
-based, although they are also used in other contexts. Aggregating the tags of many users creates a folksonomy. Aggregation
Aggregation

Aggregation may refer to:* Link aggregation, using multiple Ethernet network cables/ports in parallel to increase link speed* Purchasing aggregation, combining multiple users of a specific material or service to increase the purchasing power of the combined group....
 is the pulling together of all of the tags in an automated way. Folksonomic tagging is intended to make a body of information increasingly easy to search, discover, and navigate over time. A well-developed folksonomy is ideally accessible as a shared vocabulary that is both originated by, and familiar to, its primary users. Two widely cited examples of websites using folksonomic tagging are Flickr
Flickr

Flickr is an and video hosting service website, web services suite, and online community platform. In addition to being a popular Web site for users to share personal photographs, the service is widely used by bloggers as a photo repository....
 and Delicious, although Flickr may not be a good example of folksonomy.

As folksonomies develop in Internet-mediated social environments, users can discover who used a given tag and see the other tags that this person has used. In this way, folksonomy users can discover the tag sets of another user who tends to interpret and tag content in a way that makes sense to them. The result can be a rewarding gain in the user's capacity to find related content (a practice known as "pivot browsing"). Part of the appeal of folksonomy is its inherent subversiveness: when faced with the choice of the search tools that Web sites provide, folksonomies can be seen as a rejection of the search engine
Search engine

A search engine is an information retrieval designed to help find information stored on a computer system. The search results are usually presented in a list and are commonly called hits....
 status quo in favor of tools that are created by the community.

Folksonomy creation and searching tools are not part of the underlying World Wide Web
World Wide Web

The World Wide Web is a very large set of interlinked hypertext documents accessed via the Internet. With a Web browser, one can view Web pages that may contain writing, s, videos, and other multimedia and navigate between them using hyperlinks....
 protocols. Folksonomies arise in Web-based communities where provisions are made at the site level for creating and using tags. These communities are established to enable Web users to label and share user-generated content, such as photographs, or to collaboratively label existing content, such as Web sites, books, works in the scientific and scholarly literatures, and blog entries.

Practical evaluation

Folksonomy is criticized because its lack of terminological control causes it to be more likely to produce unreliable and inconsistent results. If tags are freely chosen (instead of taken from a given vocabulary), synonym
Synonym

Synonyms are different words with identical or very similar meanings. Words that are synonyms are said to be synonymous, and the state of being a synonym is called synonymy....
s (multiple tags for the same concept), homonymy (same tag used with different meaning), and polysemy
Polysemy

Polysemy is the capacity for a sign or signs to have multiple meanings , i.e. a large semantic field. This is a pivotal concept within social sciences, such as media studies and linguistics....
 (same tag with multiple related meanings) are likely to arise, lowering the efficiency of content indexing and searching. Other reasons for meta noise are the lack of stemming
Stemming

Stemming is the process for reducing inflected words to their Word stem, base or root form – generally a written word form. The stem need not be identical to the morphological root of the word; it is usually sufficient that related words map to the same stem, even if this stem is not in itself a valid root....
 (normalization of word inflection
Inflection

In grammar, inflection or inflexion is the way language handles grammatical relations and relational categories such as grammatical tense, grammatical mood, grammatical voice, grammatical aspect, grammatical person, grammatical number, grammatical gender, grammatical case....
s) and the heterogeneity of users and contexts.

Classification systems have several problems: they can be slow to change, they reflect (and reinforce) a particular worldview, they are rooted in the culture and era that created them, and they can be absurd at times. Idiosyncratic folksonomic classification within a clique can especially reinforce pre-existing viewpoints. Folksonomies are routinely generated by people who have spent a great deal of time interacting with the content they tag, and may not properly identify the content's relationship to external items.

For example, items tagged as "Web 2.0" represent seemingly inconsistent and contradictory resources. The lack of a hierarchical or systematic structure for the tagging system makes the terms relevant to what they are describing, but often fails to show their relevancy or relationship to other objects of the same or similar type.

Origin


The term folksonomy is generally attributed to Thomas Vander Wal
Thomas Vander Wal

Thomas Vander Wal is an information architect best known for coining the term "folksonomy". He is also known for initiating the term "Information cloud"....
. It is a portmanteau of the words folk
Folk

English Folk "people" is derived from a Germanic languages noun *fulka meaning "people" or "army" . The English word folk has cognates in most of the other Germanic languages....
 (or folks) and taxonomy
Taxonomy

Taxonomy is the practice and science of classification. The word comes from the Greek language ', taxis and ', nomos .Taxonomies, or taxonomic schemes, are composed of taxonomic units known as taxa , or kinds of things that are arranged frequently in a hierarchical structure....
 that specifically refers to subject indexing systems created within Internet communities. Folksonomy has little to do with taxonomy
Taxonomy

Taxonomy is the practice and science of classification. The word comes from the Greek language ', taxis and ', nomos .Taxonomies, or taxonomic schemes, are composed of taxonomic units known as taxa , or kinds of things that are arranged frequently in a hierarchical structure....
—the latter refers to an ontological, hierarchical way of categorizing, while folksonomy establishes categories (each tag is a category) that are theoretically "equal" to each other (i.e., there is no hierarchy, or parent-child relation between different tags).

Early attempts and experiments include the World Wide Web Consortium
World Wide Web Consortium

The World Wide Web Consortium is the main international standards organization for the World Wide Web . It is arranged as a consortium where member organizations maintain full-time staff for the purpose of working together in the development of standards for the World Wide Web....
's Annotea
Annotea

In metadata, Annotea is an Resource Description Framework standard sponsored by the W3C to enhance document-based collaboration via shared document metadata based on tags, Internet bookmark, and other annotations....
 project with user-generated tags in 2002. According to Vander Wal, a folksonomy is "tagging that works".

Folksonomy is unrelated to folk taxonomy
Folk taxonomy

A folk taxonomy is a vernacular name, and can be contrasted with taxonomy. Folk biological classification is the way peoples make sense of and organize their natural surroundings/the world around them, typically making generous use of form taxa like "shrubs", "bug s", "ducks", "ungulates" and the likes....
, a cultural practice that has been widely documented in anthropological and folkloristic
Folkloristics

Folkloristics is the formal academic study of folklore. What actually constitutes folklore is disputed even within the discipline, but generally folklore focuses on the forms of artistic expression communicated within groups....
 work. Folk taxonomies are culturally supplied, intergenerationally transmitted, and relatively stable classification systems that people in a given culture use to make sense of the entire world around them (not just the Internet
Internet

The Internet is a global network of interconnected computers, enabling users to share information along multiple channels. Typically, a computer that connects to the Internet can access information from a vast array of available server and other computers by moving information from them to the computer's local memory....
).

Folksonomy and the Semantic Web


Folksonomy may hold the key to developing a Semantic Web
Semantic Web

The Semantic Web is an evolving extension of the World Wide Web in which the semantics of information and services on the web is defined, making it possible for the web to understand and satisfy the requests of people and machines to use the web content....
, in which every Web page contains machine-readable metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 that describes its content. Such metadata would dramatically improve the precision (the percentage of relevant documents) in search engine
Search engine

A search engine is an information retrieval designed to help find information stored on a computer system. The search results are usually presented in a list and are commonly called hits....
 retrieval lists. However, it is difficult to see how the large and varied community of Web page authors could be persuaded to add metadata
Metadata

Metadata is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, for example a database schema....
 to their pages in a consistent, reliable way; web authors who wish to do so experience high entry costs because metadata systems are time-consuming to learn and use. For this reason, few Web authors make use of the simple Dublin Core
Dublin Core

The Dublin Core metadata element set is a standard for cross-domain information Resource description. It provides a simple and standardised set of conventions for describing things online in ways that make them easier to find....
 metadata standard, even though the use of Dublin Core meta-tags
Tag (metadata)

A tag is a non-hierarchical index term assigned to a piece of information . This kind of metadata helps describe an item and allows it to be found again by browsing or searching....
 could increase their pages' prominence in search engine retrieval lists
Search engine optimization

Search engine optimization is the process of improving the volume and quality of traffic to a web site from search engines via "natural" Search engine results page....
. In contrast to more formalized, top-down classifications using controlled vocabularies
Controlled vocabulary

Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies....
, folksonomy is a distributed classification system with low entry costs.

Enterprise


Since folksonomies are user-generated and therefore inexpensive to implement, advocates of folksonomy believe that it provides a useful low-cost alternative to more traditional, institutionally supported taxonomies
Taxonomy

Taxonomy is the practice and science of classification. The word comes from the Greek language ', taxis and ', nomos .Taxonomies, or taxonomic schemes, are composed of taxonomic units known as taxa , or kinds of things that are arranged frequently in a hierarchical structure....
 or controlled vocabularies
Controlled vocabulary

Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies....
. An employee-generated folksonomy could therefore be seen as an "emergent enterprise taxonomy". Some folksonomy advocates believe that it is useful in facilitating workplace democracy
Workplace democracy

Workplace democracy is the application of democracy in all its forms to the workplace.It usually involves or requires more use of lateral methods like arbitration when workplace disputes arise....
 and the distribution of management
Management

Management in business and human organization activity is simply the act of getting people together to accomplish desired goals. Management comprises planning, organizing, staffing, leadership or directing, and Control an organization or effort for the purpose of accomplishing a goal....
 tasks among people actually doing the work.

However, workplace democracy
Workplace democracy

Workplace democracy is the application of democracy in all its forms to the workplace.It usually involves or requires more use of lateral methods like arbitration when workplace disputes arise....
 may be perceived as a utopian concept at odds with the governing reality of the enterprise, the majority of which exist and thrive as hierarchically-structured corporations not especially aligned to democratically informed governance and decision-making. Also, as a distribution method, the folksonomy may, indeed, facilitate workflow, but it does not guarantee that the information worker will tag and, then, tag consistently, in an unbiased way, and without intentional malice directed at the enterprise.

Folksonomy and top-down taxonomies


Commentators and information architects have contrasted the hierarchical approach of top-down taxonomies with the folksonomy approach. The former approach is prevalent and represented by many practical examples.

One such example is Yahoo!
Yahoo!

Yahoo! Inc. is an United States public company corporation with headquarters in Sunnyvale, California, , and provides Internet services worldwide....
, one of the earliest general directories for content on the Web. Yahoo! and other similar sites organized and presented links under a fixed hierarchy. This approach imposed one set of tags and one sort order, although hyperlinking enabled at least a limited ability to traverse distant nodes in the hierarchy based on related subject matter. Clay Shirky
Clay Shirky

Clay Shirky is an United States writer, consultant and teacher on the social and economic effects of Internet technologies. He teaches New Media as an adjunct professor at New York University's graduate Interactive Telecommunications Program ....
 is one commentator who has offered explanations for why this approach is limited.

Compromise with top-down taxonomies


The differences between taxonomies and folksonomies may have been overestimated. A possible solution to the shortcomings of folksonomies and controlled vocabulary is a collabulary, which can be conceptualized as a compromise between the two: a team of classification experts collaborates with content consumers to create rich, but more systematic content tagging systems. A collabulary arises much the way a folksonomy does, but it is developed in a spirit of collaboration with experts in the field. The result is a system that combines the benefits of folksonomies—low entry costs, a rich vocabulary that is broadly shared and comprehensible by the user base, and the capacity to respond quickly to language change—without the errors that inevitably arise in naive, unsupervised folksonomies.

The ability to group tags, such as that provided by Delicious's "bundles", provides one way for taxonomists to work with an underlying folksonomy. This allows structure to be added without the need for direct collaboration between classification experts and content consumers.

Another possible solution is a taxonomy-directed-folksonomy, which relies on the user interfaces to suggest tags from a formal taxonomy, but allows many users to use their own tags.

Main problems of folksonomy tagging


Four main problems of folksonomy tagging are plurals, polysemy
Polysemy

Polysemy is the capacity for a sign or signs to have multiple meanings , i.e. a large semantic field. This is a pivotal concept within social sciences, such as media studies and linguistics....
, synonymy, and depth (specificity) of tagging. Folksonomy-based systems can employ optional authority control of subject keywords, place, personal, or corporate names and resource titles, by connecting the system to established authority control files or controlled vocabularies using new techniques. A folksonomy-based system needs a controlled vocabulary and a suggestion-based system.

See also

  • Semantic similarity
    Semantic similarity

    Semantic similarity is a concept whereby a set of documents or terms within term lists are assigned a metric space based on the likeness of their Meaning / semantic content....
  • Steve.museum
    Steve.museum

    The steve.museum project is a collaborative effort to improve public access to, and engagement with, art museum collections. To do so, it is exploring the possibilities of user-generated descriptions of works of art, also known as folksonomy....
  • Thesaurus
    Thesaurus

    A thesaurus is a work that contains synonyms and sometimes antonyms, in contrast to a dictionary, which contains definitions and pronunciations....
  • Weak ontology
    Weak ontology

    The term weak ontology has unrelated meanings in computer science and political theory....


External links

  • - Folksonomies as a tool for professional scientific databases.
  • , The New York Times
    The New York Times

    The New York Times is an American daily newspaper published in New York City. The largest metropolitan newspaper in the United States, "The Gray Lady"?named for its staid appearance and style?is regarded as a national newspaper of record....
    , 2005-12-11
  • , Wired News
    Wired News

    Wired News is an online technology news website, formerly known as HotWired, that split off from Wired magazine when the magazine was purchased by Cond? Nast Publishing in the 1990s....
    , 2005-02-01