Translate Toolkit
Encyclopedia
The Translate Toolkit is a localization and translation toolkit. It provides a set of tools for working with localization
Localization
Localization or localisation, and represented as a numeronym as L10n, may refer to:* Language localization, the process of translating a product into different languages or adapting a product for a specific country or region...

 file formats and files that might need localization. The toolkit also provides an API on which to develop other localization tools.

The toolkit is written in the Python programming language
Python (programming language)
Python is a general-purpose, high-level programming language whose design philosophy emphasizes code readability. Python claims to "[combine] remarkable power with very clear syntax", and its standard library is large and comprehensive...

. It is free software
Free software
Free software, software libre or libre software is software that can be used, studied, and modified without restriction, and which can be copied and redistributed in modified or unmodified form either without restriction, or with restrictions that only ensure that further recipients can also do...

 originally developed and released by Translate.org.za in 2002 and is now maintained by Translate.org.za and community developers.

History

The toolkit was originally developed as the mozpotools by David Fraser for Translate.org.za. Translate.org.za had focused on translating KDE which used Gettext
Gettext
In computing, gettext is an internationalization and localization system commonly used for writing multilingual programs on Unix-like computer operating systems. The most commonly-used implementation of gettext is GNU gettext, released by the GNU Project in 1995.- History :gettext was originally...

 PO files for localization. With an internal change to focus on end-user, cross-platform, OSS software, the organisation decided to localize the Mozilla Suite. This required using new tools and new formats that were not as rich as Gettext PO. Thus mozpotools was created to convert the Mozilla DTD and .properties files to Gettext PO.

Various tools were developed as needed, including pocount, a tool to count source text words to allow correct estimations for work, pogrep, to search through translations, and pofilter, to check for various quality issues.

When Translate.org.za began translating OpenOffice.org it was only natural that the Translate Toolkit would be adapted to handle the OpenOffice.org internal file format. Translating OpenOffice.org using PO files is now the default translation method.

As part of the WordForge project the work received a major boost and the toolkit was further extended to manage XLIFF files alongside PO files. Further funded development has added other features including the ability to convert Open Document Format to XLIFF and the management of placeholders (Variables, acronyms, terminology, etc.).

Design goals

The main aim of the toolkit is to increase the quality of localisation and translation. This is achieved by firstly, focusing on good localisation formats thus the toolkit makes use of the PO and XLIFF localisation formats. This has the benefit that it stops the proliferation of localization formats and allows localizers to work with one good localization tool. For the toolkit this means building converters that can transform files to be translated into these two basic formats.

Secondly, to build tools that allow localizers to increase the general quality of their localization. These tools allow for the extraction of terminology and for checking for the consistent use of terminology. The tools allow for checking for various technical errors such as the correct use of variables.

Lastly, the toolkit provides a powerful localisation API that acts as a base on which to build other localization related tools.

Users

Many translators use the toolkit directly, to do quality checks and to transform files for translation. Further there are several indirect users of the Translate Toolkit API:
  • Pootle
    Pootle
    Pootle is an online translation management tool with translation interface. It is written in the Python programming language using the Django framework and is free software originally developed and released by Translate.org.za in 2004...

     - an online translation tool
  • open-tran - providing translation memory lookup
  • Wordforge (old name Pootling) - an offline translation tool for Windows and Linux
  • Rosetta - free translation web service offered by LaunchPad. It is used mainly by the Ubuntu community translation tool. See it in action in Launchpad Translations
  • OpenOffice.org
    OpenOffice.org
    OpenOffice.org, commonly known as OOo or OpenOffice, is an open-source application suite whose main components are for word processing, spreadsheets, presentations, graphics, and databases. OpenOffice is available for a number of different computer operating systems, is distributed as free software...

     - most community localization is done through PO files produced by the toolkit
  • Virtaal
    Virtaal
    Virtaal is a computer-assisted translation tool written in the Python programming language. It is free software developed and maintained by Translate.org.za....

     - a localisation and translation tool
  • Translatewiki.net

Supported Document Formats

  • Primary Localization Formats
    • Gettext
      Gettext
      In computing, gettext is an internationalization and localization system commonly used for writing multilingual programs on Unix-like computer operating systems. The most commonly-used implementation of gettext is GNU gettext, released by the GNU Project in 1995.- History :gettext was originally...

       PO
    • XLIFF
      XLIFF
      XLIFF is an XML-based format created to standardize localization. XLIFF was standardized by OASIS in 2002. Its current specification is v1.2 released on Feb-1-2008....

       (Normal and PO representations)
  • Other Localization Related Formats
    • TBX
    • Java .properties
      .properties
      .properties is a file extension for files mainly used in Java related technologies to store the configurable parameters of an application. They can also be used for storing strings for Internationalization and localization; these are known as Property Resource Bundles.Each parameter is stored as a...

    • Qt
      Qt (toolkit)
      Qt is a cross-platform application framework that is widely used for developing application software with a graphical user interface , and also used for developing non-GUI programs such as command-line tools and consoles for servers...

       .ts, .qm and .qph (Qt Phrase Book)
    • Gettext
      Gettext
      In computing, gettext is an internationalization and localization system commonly used for writing multilingual programs on Unix-like computer operating systems. The most commonly-used implementation of gettext is GNU gettext, released by the GNU Project in 1995.- History :gettext was originally...

       .mo
    • OmegaT
      OmegaT
      OmegaT is a computer-assisted translation tool written in the Java programming language. It is free software originally developed by Keith Godfrey in 2000, and is currently developed by a team led by Didier Briel. The name OmegaT is a registered trademark in Germany.OmegaT is intended for...

       glossaries
    • Haiku
      Haiku
      ' , plural haiku, is a very short form of Japanese poetry typically characterised by three qualities:* The essence of haiku is "cutting"...

       catkeys files
  • Other Formats
    • OpenDocument Format
    • Plain Text
    • Wiki: DokuWiki
      DokuWiki
      DokuWiki is a wiki application aimed at small companies’ documentation needs. DokuWiki is licensed under GPL 2 and written in the programming language PHP. It works on plain text files and thus needs no database. Its syntax is similar to the one used by MediaWiki.-History:DokuWiki was created by...

       and MediaWiki
      MediaWiki
      MediaWiki is a popular free web-based wiki software application. Developed by the Wikimedia Foundation, it is used to run all of its projects, including Wikipedia, Wiktionary and Wikinews. Numerous other wikis around the world also use it to power their websites...

       syntax
    • Mozilla DTD
    • OpenOffice.org SDF
    • PHP strings
    • OS X strings
    • Adobe Flex
      Adobe Flex
      Adobe Flex is a software development kit released by Adobe Systems for the development and deployment of cross-platform rich Internet applications based on the Adobe Flash platform...

       files
    • INI file
    • Windows / Wine .rc files
    • iCalendar
      ICalendar
      iCalendar is a computer file format which allows Internet users to send meeting requests and tasks to other Internet users, via email, or sharing files with an extension of .ics...

    • Symbian
      Symbian
      Symbian is a mobile operating system and computing platform designed for smartphones and currently maintained by Accenture. The Symbian platform is the successor to Symbian OS and Nokia Series 60; unlike Symbian OS, which needed an additional user interface system, Symbian includes a user...

       localization files
    • Subtitles
  • Translation Memory Formats
    • TMX
      Translation Memory eXchange
      TMX is an open XML standard for the exchange of translation memory data created by computer-aided translation and localization tools....

    • Wordfast
      Wordfast
      Wordfast is a provider of translation memory software. Wordfast provides platform-independent TM solutions for freelance translators, language service providers, and multi-national corporations.- History :...

       TM

OpenDocument Format support

Work was started in June 2008 to incorporate OpenDocument Format support. This work is funded by the NLnet
NLnet
NLnets history started in April 1982 with the announcement of a major initiative to develop and provide network services in Europe under the name EUnet.Stichting NLnet was formally established as a "stichting" in February 1989...

 Foundation and is a collaboration between Translate.org.za and Itaapy

See also

  • Computer-assisted Translation
    Computer-assisted translation
    Computer-assisted translation, computer-aided translation, or CAT is a form of translation wherein a human translator translates texts using computer software designed to support and facilitate the translation process....

  • Translation Memory
    Translation memory
    A translation memory, or TM, is a database that stores so-called "segments", which can be sentences or sentence-like units that have previously been translated. A translation memory system stores the words, phrases and paragraphs that have already been translated, in order to aid human translators...


External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK