All Topics  
Uniform Resource Locator

 

   Email Print
   Bookmark   Link






 

Uniform Resource Locator



 
 
In computing
Information technology

Information technology , as defined by the Information Technology Association of America , is "the study, design, development, implementation, support or management of computer-based information systems, particularly software applications and computer hardware." IT deals with the use of electronic computers and computer software to data conv...
, a Uniform Resource Locator (URL) is a type of Uniform Resource Identifier
Uniform Resource Identifier

In Information technology, a Uniform Resource Identifier is a Character string of Character s used to Identifier or name a Resource on the Internet....
 (URI) that specifies where an identified resource is available and the mechanism for retrieving it. In popular usage and in many technical documents and verbal discussions it is often, imprecisely and confusingly, used as a synonym
Synonym

Synonyms are different words with identical or very similar meanings. Words that are synonyms are said to be synonymous, and the state of being a synonym is called synonymy....
 for uniform resource identifier. The confusion in usage stems from historically different interpretations of the semantics of the terms involved.






Discussion
Ask a question about 'Uniform Resource Locator'
Start a new discussion about 'Uniform Resource Locator'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In computing
Information technology

Information technology , as defined by the Information Technology Association of America , is "the study, design, development, implementation, support or management of computer-based information systems, particularly software applications and computer hardware." IT deals with the use of electronic computers and computer software to data conv...
, a Uniform Resource Locator (URL) is a type of Uniform Resource Identifier
Uniform Resource Identifier

In Information technology, a Uniform Resource Identifier is a Character string of Character s used to Identifier or name a Resource on the Internet....
 (URI) that specifies where an identified resource is available and the mechanism for retrieving it. In popular usage and in many technical documents and verbal discussions it is often, imprecisely and confusingly, used as a synonym
Synonym

Synonyms are different words with identical or very similar meanings. Words that are synonyms are said to be synonymous, and the state of being a synonym is called synonymy....
 for uniform resource identifier. The confusion in usage stems from historically different interpretations of the semantics of the terms involved. In popular language, a URL is also referred to as a Web address.

History

The URL was created in 1990 by Tim Berners-Lee
Tim Berners-Lee

Sir Timothy John Berners-Lee, Order of Merit, Order of the British Empire, Royal Society, Royal Academy of Engineering, Royal Society of Arts is an English people computer scientist and MIT professor credited with inventing the World Wide Web....
 as part of the URI. He regrets the format of the URL. Instead of being divided into the route to the server
Server (computing)

A server is a computer program that provides services to other computer programs , in the same or other computer. The physical computer that runs a server program is also often referred to as server....
, separated by dots, and the file path, separated by slashes, he would have liked it to be one coherent hierarchical path. For example, http://www.serverroute.com/path/to/file.html would look like http://com/serverroute/path/to/file.html.

Syntax


Every URL is made up of some combination of the following: The scheme name
URI scheme

In the field of computer networking, a URI scheme is the top level of the Uniform Resource Identifier naming structure. All URIs and absolute URI references are formed with a scheme name, followed by a Colon , and the remainder of the URI called the scheme-specific part....
 or resource type, a registered domain name
Domain name

The term domain name has multiple related meanings:* A hostname that identifies a computer or computers on the Internet. These names appear as a component of a Web site's Uniform Resource Locator, e.g....
 or internet protocol address, the port number, the pathname of the file to be fetched or the program to be run, and with html
HTML

HTML, an Acronym and initialism of HyperText Markup Language, is the predominant markup language for Web pages. It provides a means to describe the structure of text-based information in a document?by denoting certain text as links, headings, paragraphs, lists, and so on?and to supplement that text with interactive forms, embedded '...
 files, an anchor
HTML element

In computing, an HTML element indicates structure in an HTML document and a way of hierarchically arranging content. More specifically, an HTML element is an Standard Generalized Markup Language element that meets the requirements of one or more of the HTML Document Type Definitions ....
 for where the page should be displayed at .

The combined syntax will look similar to this: resource_type://domain:port/filepathname#anchor

  • The scheme name, or resource type, defines its namespace, purpose, and the syntax of the remaining part of the URL. Most Web-enabled programs will try to dereference
    Uniform Resource Identifier

    In Information technology, a Uniform Resource Identifier is a Character string of Character s used to Identifier or name a Resource on the Internet....
     a URL according to the semantics of its scheme and a context-vbn. For example, a Web browser
    Web browser

    A Web browser is a application software which enables a user to display and interact with text, images, videos, music, games and other information typically located on a Web page at a website on the World Wide Web or a local area network....
     will usually dereference the URL http://example.org:80 by performing an HTTP request to the host example.org, at the port number 80. Dereferencing the URL mailto:bob@example.com will usually start an e-mail
    E-mail

    Electronic mail, often abbreviated as e-mail, email, E-Mail, or eMail, is any method of creating, transmitting, or storing primarily text-based human communications with digital communications systems....
     composer with the address bob@example.com in the To field.
    • Other examples of scheme names include https
      Https

      Hypertext Transfer Protocol Secure is a combination of the Hypertext Transfer Protocol and a secure communication protocol.HTTP operates at the highest layer of the TCP/IP model, the Application layer; but the security protocol operates at lower sublayer, encrypting an HTTP message prior to transmission and decrypting a message upon arriva...
      : gopher:, wais:, ftp:. URLs that specify https as a scheme (such as https://example.com/) normally denote a secure website.


  • The registered domain name or IP address gives the destination location for the URL. The domain google.com, or its IP address 72.14.207.99, directs you to where Google's website resides.


  • The hostname and domain name portion of a URL are case insensitive since the DNS
    Domain name system

    The Domain Name System is a hierarchical naming system for computers, services, or any resource participating in the Internet. It associates various information with domain names assigned to such participants....
     is specified to ignore case. http://en.wikipedia.org/ and HTTP://EN.WIKIPEDIA.ORG/ will both open same page.


  • The port number is optional. If it is not provided, the default for the scheme will be used. For example, in your browser
    Web browser

    A Web browser is a application software which enables a user to display and interact with text, images, videos, music, games and other information typically located on a Web page at a website on the World Wide Web or a local area network....
     you could type http://google.com:80 which would bring you to google.com on port 80. If you left out port 80, your browser would navigate to the same location because port 80 is the default for HTTP.


  • The file path name is the destination on the server for where to access the file or program they are looking for. It may be treated as case insensitive by some servers, especially those that are based on Microsoft Windows
    Windows Server

    Windows Server is a brand name for server operating systems released by Microsoft Corporation. This name has been used for the following software releases:...
    . For example:
    • http://en.wikipedia.org/wiki/URL is correct, but http://en.wikipedia.org/WIKI/URL/ will result in an HTTP 404
      HTTP 404

      The 404 or Not Found error message is a Hypertext Transfer Protocol List of HTTP status codes indicating that the Web Browser was able to communicate with the server but either the server could not find what was requested, or it was configured not to fulfill the request and did not reveal the reason why....
       error page.


  • The anchor part when used with HTTP allows you to be directed to a specific location on the page after you have navigated there. For example, http://en.wikipedia.org/wiki/URL#Syntax would bring you to the beginning of the Syntax section of this page.


Absolute vs Relative URLs

An absolute URL is one that points to the exact location of a file. It is unique meaning that if two URLs are identical, they point to the same file. An example of this would be: http://en.wikipedia.org/wiki/File:Raster_to_Vector_Mechanical_Example.jpg

A relative URL points to the location of a file from a point of reference. This reference is usually the directory
Directory (file systems)

In computing, a directory, folder, catalog, or drawer is a virtual container within a digital file system, in which groups of files and other directories can be kept and organized....
 beneath the file. It is preceded by two dots (../directory_path/file.txt) for the directory below, one dot (./directory_path/file.txt) for the current directory or without the beginning slash( directory_path/file.txt), which is also the current directory.

URLs as locators

In its current strict technical meaning, a URL is a URI that, “in addition to identifying a resource
Resource (Web)

The concept of resource is primitive in the World Wide Web architecture, and is used in the definition of its fundamental elements. The term was first introduced to refer to targets of Uniform Resource Locators , but its definition has been further extended to include the referent of any Uniform Resource Identifier , or Internationalized Reso...
, [provides] a means of locating the resource by describing its primary access mechanism (e.g., its network ‘location’).”

Internet hostnames

On the Internet
Internet

The Internet is a global network of interconnected computers, enabling users to share information along multiple channels. Typically, a computer that connects to the Internet can access information from a vast array of available server and other computers by moving information from them to the computer's local memory....
, a hostname is a domain name
Domain name

The term domain name has multiple related meanings:* A hostname that identifies a computer or computers on the Internet. These names appear as a component of a Web site's Uniform Resource Locator, e.g....
 assigned to a host computer. This is usually a combination of the host's local name with its parent domain's name. For example, "en.wikipedia.org" consists of a local hostname ("en") and the domain name "wikipedia.org". This kind of hostname is translated into an IP address
IP address

An Internet Protocol address is a numerical identification that is assigned to devices participating in a computer network utilizing the Internet Protocol for communication between its nodes....
 via the local hosts file
Hosts file

The hosts file is a computer computer file used to store information on where to find a Node on a computer network. This file maps hostnames to IP addresses....
, or the Domain Name System
Domain name system

The Domain Name System is a hierarchical naming system for computers, services, or any resource participating in the Internet. It associates various information with domain names assigned to such participants....
 (DNS) resolver. It is possible for a single host computer to have several hostnames; but generally the operating system
Operating system

An operating system is an interface between hardware and applications; it is responsible for the management and coordination of activities and the sharing of the limited resources of the computer....
 of the host prefers to have one hostname that the host uses for itself.

Any domain name can also be a hostname, as long as the restrictions mentioned below are followed. So, for example, both "en.wikimedia.org" and "wikimedia.org" are hostnames because they both have IP address
IP address

An Internet Protocol address is a numerical identification that is assigned to devices participating in a computer network utilizing the Internet Protocol for communication between its nodes....
es assigned to them. The domain name "pmtpa.wikimedia.org" is not a hostname since it does not have an IP address, but "rr.pmtpa.wikimedia.org" is a hostname. All hostnames are domain names, but not all domain names are hostnames.

See also

  • CURIE
    Curie

    The curie is a unit of Radioactive decay, defined asThis is roughly the activity of 1 gram of the radium isotope 226Ra, a substance studied by the pioneers of radiology, Marie Curie and Pierre Curie....
     (Compact URI)
  • Extensible Resource Identifier
    Extensible Resource Identifier

    Extensible Resource Identifier is a scheme and resolution protocol for abstract identifiers compatible with Uniform Resource Identifiers and Internationalized Resource Identifiers, developed by the at OASIS ....
     (XRI)
  • Internationalized Resource Identifier
    Internationalized Resource Identifier

    On the Internet, the Internationalized Resource Identifier is a generalization of the Uniform Resource Identifier , which is in turn a generalization of the Uniform Resource Locator ....
     (IRI)
  • Uniform Resource Identifier
    Uniform Resource Identifier

    In Information technology, a Uniform Resource Identifier is a Character string of Character s used to Identifier or name a Resource on the Internet....
     (URI)
  • URL normalization
    URL normalization

    URL normalization is the process by which Uniform Resource Locator are modified and standardized in a consistent manner. The goal of the normalization process is to transform a URL into a normalized or canonical URL so it is possible to determine if two syntactically different URLs are equivalent....
  • URI scheme
    URI scheme

    In the field of computer networking, a URI scheme is the top level of the Uniform Resource Identifier naming structure. All URIs and absolute URI references are formed with a scheme name, followed by a Colon , and the remainder of the URI called the scheme-specific part....


External links

  • RFC 3986 Uniform Resource Identifier (URI): Generic Syntax ] ]