Free/open-source platform for language technology

Apertium develops a free/open-source platform for machine translation and language technology. Apertium also develops data for many languages, with a focus on lesser-resourced and marginalised languages, but also develops data for larger languages. The platform, including data for tens of language pairs, a translation engine and auxiliary tools is being developed around the world, both in universities and companies (e.g. Prompsit Language Engineering) and by a growing numbers independent free-software developers. There are currently 43 published language pairs within the project (including a number of "firsts" — for example Spanish—Occitan, Breton—French, Basque—Spanish, North Sámi—Norwegian Bokmål, Italian—Sardinian and Kazakh–Tatar among others), and many more in development.

Primary Open Source License: GNU General Public License version 3.0 (GPL-3.0)

Programming Languages:

  • c++
  • python
  • xml
  • xslt
  • html


  • machine translation
  • machine learning
  • linguistics
  • natural language
  • languages