Semantic Wiki
Encyclopedia
A semantic wiki is a wiki
Wiki
A wiki is a website that allows the creation and editing of any number of interlinked web pages via a web browser using a simplified markup language or a WYSIWYG text editor. Wikis are typically powered by wiki software and are often used collaboratively by multiple users. Examples include...

 that has an underlying model of the knowledge described in its pages. Regular, or syntactic, wikis have structured text and untyped hyperlink
Hyperlink
In computing, a hyperlink is a reference to data that the reader can directly follow, or that is followed automatically. A hyperlink points to a whole document or to a specific element within a document. Hypertext is text with hyperlinks...

s. Semantic wikis, on the other hand, provide the ability to capture or identify information about the data within pages, and the relationships between pages, in ways that can be queried or exported like a database
Database
A database is an organized collection of data for one or more purposes, usually in digital form. The data are typically organized to model relevant aspects of reality , in a way that supports processes requiring this information...

.

Semantic wikis were first proposed in the early 2000s, and began to be implemented seriously around 2005. As of 2011, the best-known semantic wiki software may be Semantic MediaWiki
Semantic MediaWiki
Semantic MediaWiki is an extension to MediaWiki that allows for annotating semantic data within wiki pages, thus turning a wiki that incorporates the extension into a semantic wiki...

, while the best-known standalone semantic wiki may be Freebase
Freebase (database)
Freebase is a large collaborative knowledge base consisting of metadata composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual 'wiki' contributions. Freebase aims to create a global resource which allows people to...

.

Formal notation

The knowledge model found in a semantic wiki is typically available in a formal language
Formal language
A formal language is a set of words—that is, finite strings of letters, symbols, or tokens that are defined in the language. The set from which these letters are taken is the alphabet over which the language is defined. A formal language is often defined by means of a formal grammar...

, so that machines can process it into an entity-relationship
Entity-relationship model
In software engineering, an entity-relationship model is an abstract and conceptual representation of data. Entity-relationship modeling is a database modeling method, used to produce a type of conceptual schema or semantic data model of a system, often a relational database, and its requirements...

 or relational database
Relational database
A relational database is a database that conforms to relational model theory. The software used in a relational database is called a relational database management system . Colloquial use of the term "relational database" may refer to the RDBMS software, or the relational database itself...

.

The formal notation may be included in the pages themselves by users, as in Semantic MediaWiki
Semantic MediaWiki
Semantic MediaWiki is an extension to MediaWiki that allows for annotating semantic data within wiki pages, thus turning a wiki that incorporates the extension into a semantic wiki...

, or it may be derived from the pages or the page names or the means of linking. For instance, using a specific alternative page name might indicate that a specific type of link was intended. This is especially common in wikis devoted to code projects. It should be easy to examine and fix, e.g. to identify problems in parsing and conventions introduced by newer users.

In either case, providing information through a formal notation allows machines to calculate new facts (e.g. relations between pages) from the facts represented in the knowledge model.

Semantic Web compatibility

The technologies developed by the Semantic Web
Semantic Web
The Semantic Web is a collaborative movement led by the World Wide Web Consortium that promotes common formats for data on the World Wide Web. By encouraging the inclusion of semantic content in web pages, the Semantic Web aims at converting the current web of unstructured documents into a "web of...

 community provide one basis for formal reasoning about the knowledge model that is developed by importing this data. However, there are also a wide array of technologies that work on ERD
Entity-relationship model
In software engineering, an entity-relationship model is an abstract and conceptual representation of data. Entity-relationship modeling is a database modeling method, used to produce a type of conceptual schema or semantic data model of a system, often a relational database, and its requirements...

 or relational data.

Example

Imagine a semantic wiki devoted to food
Food
Food is any substance consumed to provide nutritional support for the body. It is usually of plant or animal origin, and contains essential nutrients, such as carbohydrates, fats, proteins, vitamins, or minerals...

. The page for an apple
Apple
The apple is the pomaceous fruit of the apple tree, species Malus domestica in the rose family . It is one of the most widely cultivated tree fruits, and the most widely known of the many members of genus Malus that are used by humans. Apple grow on small, deciduous trees that blossom in the spring...

 would contain, in addition to standard text information, some machine-readable or at least machine-intuitable semantic data. The most basic kind of data would be that an apple is a kind of fruit
Fruit
In broad terms, a fruit is a structure of a plant that contains its seeds.The term has different meanings dependent on context. In non-technical usage, such as food preparation, fruit normally means the fleshy seed-associated structures of certain plants that are sweet and edible in the raw state,...

 - what's known as an inheritance
Inheritance (computer science)
In object-oriented programming , inheritance is a way to reuse code of existing objects, establish a subtype from an existing object, or both, depending upon programming language support...

 relationship. The wiki would thus be able to automatically generate a list of fruits, simply by listing all pages that are tagged as being of type "fruit." Further semantic tags in the "apple" page could indicate other data about apples, including their possible colors and sizes, nutritional information and serving suggestions, and so on.

If the wiki exports all this data in RDF
Resource Description Framework
The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model...

 or a similar format, it can then be queried in a similar way to a database - so that an external user or site could, for instance, request a list of all fruits that are red and can be baked in a pie.

History

In the 1980s, before the Web began, there were several technologies to process typed links between collectively maintained hypertext pages, such as NoteCards
NoteCards
NoteCards was a hypertext personal knowledge basesystem developed at Xerox PARC by Randall Trigg, Frank Halasz and Thomas Moran in 1984. NoteCards developed after Trigg became the first to write a Ph.D. thesis on hypertext while at the University of Maryland College Park in 1983...

, KMS
KMS (hypertext)
KMS,‭ ‬an abbreviation of Knowledge Management System,‭ ‬was a commercial second generation hypermedia system, originally created as a successor for the early hypermedia system ZOG...

 and gIBIS. Extensive research was published on these tools by the collaboration software, computer-mediated communication
Computer-mediated communication
Computer-mediated communication is defined as any communicative transaction that occurs through the use of two or more networked computers...

, hypertext
Hypertext
Hypertext is text displayed on a computer or other electronic device with references to other text that the reader can immediately access, usually by a mouse click or keypress sequence. Apart from running text, hypertext may contain tables, images and other presentational devices. Hypertext is the...

, and computer supported cooperative work
Computer supported cooperative work
The term computer-supported cooperative work was first coined by Irene Greif and Paul M. Cashman in 1984, at a workshop attended by individuals interested in using technology to support people in their work. At about this same time, in 1987 Dr...

 communities.

The first known usage of the term "Semantic Wiki" was a Usenet
Usenet
Usenet is a worldwide distributed Internet discussion system. It developed from the general purpose UUCP architecture of the same name.Duke University graduate students Tom Truscott and Jim Ellis conceived the idea in 1979 and it was established in 1980...

 posting by Andy Dingley in January 2001. Its first known appearance in a technical paper was in a 2003 paper by Austrian researcher Leo Sauermann.

Many of the existing semantic wiki applications were started in the mid-2000s, including Semantic MediaWiki (2005), Freebase
Freebase (database)
Freebase is a large collaborative knowledge base consisting of metadata composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual 'wiki' contributions. Freebase aims to create a global resource which allows people to...

 (2005) and OntoWiki
OntoWiki
OntoWiki is a free, open-source semantic wiki application, meant to serve as an ontology editor and a knowledge acquisition system. It is a web-based application written in PHP and using either a MySQL database or a Virtuoso triple store...

 (2006).

June 2006 saw the first meeting dedicated to semantic wikis, "SemWiki2006", co-located with the European Semantic Web Conference in Montenegro
Montenegro
Montenegro Montenegrin: Crna Gora Црна Гора , meaning "Black Mountain") is a country located in Southeastern Europe. It has a coast on the Adriatic Sea to the south-west and is bordered by Croatia to the west, Bosnia and Herzegovina to the northwest, Serbia to the northeast and Albania to the...

.

The site DBpedia
DBpedia
DBpedia is a project aiming to extract structured content from the information created as part of the Wikipedia project. This structured information is then made available on the World Wide Web. DBpedia allows users to query relationships and properties associated with Wikipedia resources,...

, launched in 2007, though not a semantic wiki, publishes structured data from Wikipedia
Wikipedia
Wikipedia is a free, web-based, collaborative, multilingual encyclopedia project supported by the non-profit Wikimedia Foundation. Its 20 million articles have been written collaboratively by volunteers around the world. Almost all of its articles can be edited by anyone with access to the site,...

 in RDF form, which enables semantic querying of Wikipedia's data.

In March 2008, Wikia
Wikia
Wikia is a free web hosting service for wikis . It is normally free of charge for readers and editors, deriving most of its income from advertising, and publishes all user-provided text under copyleft licenses. Wikia hosts several hundred thousand wikis using the open-source wiki software MediaWiki...

, the world's largest wiki farm
Wiki farm
A wiki hosting service or wiki farm is a server or an array of servers that offer users tools to simplify the creation and development of individual, independent wikis...

, made the use of Semantic MediaWiki available for all their wikis, thus allowing all the wikis they hosted to function as semantic wikis.

In July 2010, Google
Google
Google Inc. is an American multinational public corporation invested in Internet search, cloud computing, and advertising technologies. Google hosts and develops a number of Internet-based services and products, and generates profit primarily from advertising through its AdWords program...

 purchased Metaweb
Metaweb
Metaweb Technologies, Inc. was a United States company based in San Francisco that developed Freebase, described as an "open, shared database of the world's knowledge". The company was founded by Danny Hillis in July, 2005, and operated in stealth mode until 2007. Metaweb was acquired by Google in...

, the company behind Freebase.

Semantic wiki software

There are a number of wiki applications that provide semantic functionality. Some standalone semantic wiki applications exist, including OntoWiki
OntoWiki
OntoWiki is a free, open-source semantic wiki application, meant to serve as an ontology editor and a knowledge acquisition system. It is a web-based application written in PHP and using either a MySQL database or a Virtuoso triple store...

 (written in PHP
PHP
PHP is a general-purpose server-side scripting language originally designed for web development to produce dynamic web pages. For this purpose, PHP code is embedded into the HTML source document and interpreted by a web server with a PHP processor module, which generates the web page document...

) and Wagn (written in Ruby on Rails
Ruby on Rails
Ruby on Rails, often shortened to Rails or RoR, is an open source web application framework for the Ruby programming language.-History:...

).

Other semantic wiki software is structured as extensions or plugins to standard wiki software. The best-known of these is Semantic MediaWiki
Semantic MediaWiki
Semantic MediaWiki is an extension to MediaWiki that allows for annotating semantic data within wiki pages, thus turning a wiki that incorporates the extension into a semantic wiki...

, an extension to MediaWiki
MediaWiki
MediaWiki is a popular free web-based wiki software application. Developed by the Wikimedia Foundation, it is used to run all of its projects, including Wikipedia, Wiktionary and Wikinews. Numerous other wikis around the world also use it to power their websites...

. SMW+
SMW+
SMW+ is an open source software bundle composed of the wiki application MediaWiki along with a number of its extensions. It is produced by the German software company Ontoprise GmbH. SMW+'s extensions include, most notably, Semantic MediaWiki and the...

 is a notable software package that includes Semantic MediaWiki.

Some standard wiki engines also include the ability to add typed, semantic links to pages, including PhpWiki
PhpWiki
PhpWiki is a web-based wiki software application.It began as a clone of WikiWikiWeb and was the first wiki written in PHP.PhpWiki has been used to edit and format paper books for publication.-History:...

 and Tiki Wiki CMS Groupware.

Freebase
Freebase
Freebase is a large collaborative knowledge base consisting of metadata composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual 'wiki' contributions. Freebase aims to create a global resource which allows people to...

, though not billed as a wiki engine, is a web database with semantic-wiki-like properties.

Classification

Although adding link-types to a wiki is straightforward, the number of link types can often be quite large. The Cyc
Cyc
Cyc is an artificial intelligence project that attempts to assemble a comprehensive ontology and knowledge base of everyday common sense knowledge, with the goal of enabling AI applications to perform human-like reasoning....

 system has over 15,000 different types of links. In order to create the right type of link a set of questions is often used to create the correct link type. Rules can also be added to check that the destination page is appropriate for that link type. For example a link of "capital_of" might only be appropriate when linking a city to a region or a country.

Argument

Most early wiki-like technologies, specifically gIBIS
Issue-Based Information System
Issue-Based Information System was invented by Werner Kunz and Horst Rittel as an argumentation-based approach to tackling wicked problems - complex, ill-defined problems that involve multiple stakeholders....

 and NoteCards
NoteCards
NoteCards was a hypertext personal knowledge basesystem developed at Xerox PARC by Randall Trigg, Frank Halasz and Thomas Moran in 1984. NoteCards developed after Trigg became the first to write a Ph.D. thesis on hypertext while at the University of Maryland College Park in 1983...

, had direct support for online deliberation
Online deliberation
Online deliberation is a term associated with an emerging body of practice, research, and software dedicated to fostering serious, purposive discussion over the Internet...

, often including argumentation frameworks such as logic trees or more elaborated issue/position/argument structures that were flexible enough to support meeting agendas and support actual online meetings.

In these uses, the link types are deliberately limited to simplify presentation and also to avoid anyone gaining advantage in the debate or meeting by knowing the types better (often thought to be a primary reason why ordinary users strongly resist typed links). For instance, a link to an assertion that "contradicts" another or which "supports" it.

The challenge is to create new adversarial process
Adversarial process
An adversarial process is one that supports conflicting one-sided positions held by individuals, groups or entire societies, as inputs into the conflict resolution situation, typically with rewards for prevailing in the outcome...

 designs that deal with the power imbalances and rapid pace of change in online forums, and support these with new tools. The open politics theory, for instance, developed some of these for use in politics.

Common features

Semantic wikis vary in their degree of formalization. Semantics may be either included in, or placed separately from, the wiki markup. Users may be supported when adding this content, using forms or autocompletion, or more complex proposal generation or consistency checks. The representation language may be wiki syntax, a standard language like RDF
Resource Description Framework
The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model...

 or OWL
Web Ontology Language
The Web Ontology Language is a family of knowledge representation languages for authoring ontologies.The languages are characterised by formal semantics and RDF/XML-based serializations for the Semantic Web...

, or some database directly populated by the tool that withdraws the semantics from the raw data.
Separate versioning support or correction editing for the formalized content may also be provided. Provenance support for the formalized content, that is, tagging the author of the data separately from the data itself, varies.

What data can get formalized also varies. One may be able to specify types for pages, categories or paragraphs or sentences (the latter features were more common in pre-web systems). Links are usually also typed. The source, property and target may be determined by some defaults, e.g. in Semantic MediaWiki the source is always the current page.

Reflexivity also varies. More reflexive user interfaces provide strong ontology support from within the wiki, and allow it to be loaded, saved, created and changed.

Some wikis inherit their ontology entirely from a pre-existing strong ontology like Cyc or SKOS
SKOS
Simple Knowledge Organization System is a family of formal languages designed for representation of thesauri, classification schemes, taxonomies, subject-heading systems, or any other type of structured controlled vocabulary. SKOS is built upon RDF and RDFS, and its main objective is to enable...

, while, on the other extreme, in other semantic wikis the entire ontology is generated by users.

Conventional, non-semantic wikis typically still have ways for users to express data and metadata, typically by tagging, categorizing and using namespace
Namespace
In general, a namespace is a container that provides context for the identifiers it holds, and allows the disambiguation of homonym identifiers residing in different namespaces....

s. In semantic wikis, these features still typically exist, but integrated these with other semantic declarations, and sometimes with their use restricted.

Some semantic wikis provide reasoning
Semantic reasoner
A semantic reasoner, reasoning engine, rules engine, or simply a reasoner, is a piece of software able to infer logical consequences from a set of asserted facts or axioms. The notion of a semantic reasoner generalizes that of an inference engine, by providing a richer set of mechanisms to work with...

 support, using a variety of engines. Such reasoning may require that all instance data comply with the ontologies.

Most semantic wikis have simple querying support (such as searching for all triples with a certain subject, predicate, object), but the degree of advanced query support varies; some semantic wikis provide querying in standard languages like SPARQL
SPARQL
SPARQL is an RDF query language; its name is an acronym that stands for SPARQL Protocol and RDF Query Language. It was made a standard by the RDF Data Access Working Group of the World Wide Web Consortium, and considered as one of the key technologies of semantic web...

, while others instead provide a custom language. User interface support to construct these also varies. Visualization of the links especially may be supported.

Many semantic wikis can display the relationships between pages, or other data such as dates, geographical coordinates and number values, in various formats, such as graphs, tables, charts, calendars and maps.

See also

  • Microformats
    Microformats
    A microformat is a web-based approach to semantic markup which seeks to re-use existing HTML/XHTML tags to convey metadata and other attributes in web pages and other contexts that support HTML, such as RSS...

  • Ontology
    Ontology (computer science)
    In computer science and information science, an ontology formally represents knowledge as a set of concepts within a domain, and the relationships between those concepts. It can be used to reason about the entities within that domain and may be used to describe the domain.In theory, an ontology is...

  • RDF
    Resource Description Framework
    The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model...

    , RDFS, OWL
    Web Ontology Language
    The Web Ontology Language is a family of knowledge representation languages for authoring ontologies.The languages are characterised by formal semantics and RDF/XML-based serializations for the Semantic Web...

    , SPARQL
    SPARQL
    SPARQL is an RDF query language; its name is an acronym that stands for SPARQL Protocol and RDF Query Language. It was made a standard by the RDF Data Access Working Group of the World Wide Web Consortium, and considered as one of the key technologies of semantic web...

  • Business Intelligence 2.0
    Business Intelligence 2.0
    Business Intelligence 2.0 is a term that refers to new tools and software for business intelligence, beginning in the mid-2000s, that enable, among other things, dynamic querying of real-time corporate data by employees, and a more web- and browser-based approached to such data, as opposed to the...

     (BI 2.0)
  • Wikis:
    • Familypedia
      Familypedia
      Familypedia is a free-to-use public wiki on family history and genealogy. It is hosted by Wikia, a wiki farm. Familypedia is a collaborative effort by amateur genealogists and family historians, with over 30,000 unique people having their own pages among 100,000 articles , with over 114,000 other...

    • Freebase
      Freebase
      Freebase is a large collaborative knowledge base consisting of metadata composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual 'wiki' contributions. Freebase aims to create a global resource which allows people to...

    • Gardenology.org
      Gardenology.org
      Gardenology.org is a wiki, launched in 2007, meant to serve as a free, "complete plant and garden wiki encyclopedia." There are over 19,000 articles on the site, and a plant search box...

    • Math Images Project
      Math Images Project
      The Math Images Project is a wiki collaboration between Swarthmore College, the Math Forum at Drexel University, and the National Science Digital Library. The project aims to introduce the public to mathematics through beautiful and intriguing images found throughout the fields of math...

    • Metavid
      Metavid
      Metavid is a free-software wiki-based community archive project for audio video media. The site hosts public domain US legislative footage. It was started as a Digital Arts/New Media MFA thesis project of Michael Dale and Abram Stern under the advisement of Professor Warren Sack in late 2005 at the...

    • NeuroLex
      NeuroLex
      NeuroLex is a dynamic lexicon of neuroscience concepts. It is a structured as a semantic wiki, using Semantic MediaWiki. NeuroLex is supported by the Neuroscience Information Framework project.- Overview :...

    • OpenEI
      OpenEI
      Open Energy Information is an online platform that "links energy communities and decision makers with valuable energy data, information, analyses, tools, images, maps, and other resources." It was launched by the United States Department of Energy on December 9,...

    • SKYbrary
      SKYbrary
      SKYbrary is a wiki created by the European Organisation for the Safety of Air Navigation, International Civil Aviation Organization, and the Flight Safety Foundation to create a comprehensive source of aviation safety information freely available online. It was launched in May 2008 on a platform...

    • SNPedia
      SNPedia
      SNPedia is a wiki-based bioinformatics web site that serves as a database of single nucleotide polymorphisms . Each article on a SNP provides a short description, links to scientific articles and personal genomics web sites, as well as microarray information about that SNP...


External links

  • Semantic wiki article at SemanticWeb.org
  • Semantic wiki projects - contains a list of active, defunct and proposed semantic wiki applications
  • SemWiki.org – homepage of the semantic wiki community and workshop series
  • SemanticWiki mini-series - homepage of the virtual mini-series jointly organized by FZI Karlsruhe, Mayo Clinic, Ontolog, RPI Tetherless World Constellation and Salzburg Research, Austria between Sep-2008 and Mar-2009.
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK