Skip to content

Localisation (English)

Tomasz Borek edited this page Dec 8, 2015 · 4 revisions

Localisation files

Localisation files are located in separate folders:

  • loc/ru — Russian localisation files
  • loc/en — English localisation files
  • etc

Format of files (for example sections.txt):

# Sections l19n

control-character  : Control character
basic-latin        : Basic Latin
latin-1-supplement : Latin-1 Supplement
latin-extended-A   : Latin extended A
latin-extended-B   : Latin extended B
ipa-extensions     : IPA Extensions

First the section key (see Main data files), then the translation.

Lines beginning with a # are comments and are ignored. Empty lines are ignored as well.

Inheritance from English

In the absence of translation any language (for example, in /loc/ru/sections.txt is no basic-latin), the name is taken from English.

The list of files with the names of objects

  • sections.txt — section names
  • sets.txt — sets names
  • types.txt — types of sections
  • languages.txt — language names
  • countries.txt — country names

The names of characters

The names of characters are located in the symbols folder, divided into 256 characters per file.

The file format is the same: hexadecimal Unicode number : name

2A00 : N-ary circled dot operator
2A01 : N-ary circled plus operator
2A02 : N-ary circled times operator
2A03 : N-ary union operator with dot
2A04 : N-ary union operator with plus
2A05 : N-ary square intersection operator
2A06 : N-ary square union operator
2A07 : Two logical and operator

For example, the character with the Unicode number 1F25 should be located in symbols/1F00.txt.

You can specify synonyms of the items in this file as well. A synonym is separated from the header by a colon:

number : name : synonym

Descriptions of sections

Text descriptions of sections (displayed on a section page) are located in the folder sections. Each section has its own file. For example, loc/de/sections/cyrillic.txt - the description of cyrillic in German.

Descriptions of characters

These are similar to the descriptions of sections.

loc/ru/symbols-desc/0185.txt — description of the character with the hexadecimal code 0185 Russian.

Other

Moreover, there are files described in different languages in other localisation folders

  • Data for Unicode-Table.com

Documentation in English

Clone this wiki locally