Encodings
Supported by WorldNames TLD Registry Systems
WorldNames' Multilingual
Web Address Services is founded on the UNICODE
- ISO-10646 standard, which supports all the characters (writing scripts)
and computer encodings required to represent practically all known languages
worldwide in an Internet Web Address. This includes: |
| Unicode/ISO-10646: |
UTF-5, UTF-7, UTF-8,
RACE, SGML ( syntax - escaped HTML) |
| ISO-8859-X: |
ISO-8859-1 (Latin 1) Western European
ISO-8859-2 (Latin 2) non-Cyrillic Central European
ISO-8859-3 (Latin 3) Esperanto, Galician, Maltese, Turkish ISO-8859-4
(Latin 4) Baltic Rim
ISO-8859-5 (Cyrillic)
ISO-8859-6 (Arabic)
ISO-8859-7 (Greek)
ISO-8859-8 (Hebrew)
ISO-8859-9 (Latin 5) Improved Turkish
ISO-8859-10 (Latin 6) Inuit, Lappish
ISO-8859-13 (Latin 7) Improved Baltic Rim
ISO-8859-14 (Latin 8) Celtic
ISO-8859-15 (Latin 9) Improved Latin 1, a.k.a. Latin0
|
| Chinese: |
GB2312 (mainland Guojia
Biaozhun simplified) (a.k.a. EUC-CN, ISO-IR-58)
HZ-GB-2312 a.k.a. HZ (7 bit mail-safe GB encapsulation)
GBK a.k.a. CP936 (extension of GB2312)
GB12345 (traditional variant of GB2312)
BIG5 (traditional/Taiwanese)
CP950 (Microsoft extension/variant of BIG5)
EUC-TW (Taiwanese, encapsulates CNS11643)
ISO-2022-CN (mail safe encapsulation of GB2312 and the two initial planes
of CNS11643)
ISO-2022-CN-EXT (ISO-2022-CN plus encapsulation of GB12345, BIG5, and the
rest of CNS11643)
ISO-646-CN a.k.a. GB_1988-80 |
| Japanese: |
EUC-JP (8 bit encapsulation
of JIS X 0201/0208/0212)
ISO-2022-JP (encapsulates JIS X 0201-1976 (Roman),
JIS X 0208-1978/1983)
ISO-2022-JP-2 (encapsulates JIS X 0201-1976 (Roman),
JIS X 0208-1978/1983,
JIS X 0212-1990) Shift-JIS a.k.a. SJIS, S-JIS (encapsulates JIS X 0201 (one
byte per character) and JIS X 0208)
ISO-646-JP a.k.a. JIS_C6220-1969-RO,
ISO-IR-14 ISO-646-JP-OCR-B a.k.a. JIS_C6229-1984-B,
ISO-IR-92,
JP-OCR-B CP932 (extension of Shift-JIS, uses ASCII instead of JIS X 0201-1976)
|
| Korean: |
EUC-KR (encapsulation
of KSC5601)
CP949 (EUC-KR extended with UHC (Unified Hangul Code)
ISO-2022-KR (mail safe encapsulation of KSC5601) |
| Thai |
TIS-620 a.k.a. ISO-IR-166
CP874 a.k.a. WINDOWS-874 MacThai |
| Lao |
MuleLao-1 CP1133 |
| Vietnamese |
VISCII
TCVN |
| Microsoft
CodePages: |
CP708 (MS-DOS Arabic)
CP850 (MS-DOS Multilingual Latin 1)
CP866 (MS-DOS Russian)
CP1250 (Latin2)
CP1251 (Cyrillic)
CP1252 (Latin1)
CP1253 (Greek)
CP1254 (Turkish)
CP1255 (Hebrew)
CP1256 (Arabic)
CP1257 (Baltic)
CP1258 (Vietnamese) |
| Apple: |
MacRoman
MacCentralEurope
MacIceland
MacCroatian
MacRomania
MacCyrillic
MacUkraine
MacGreek
MacTurkish
Macintosh
MacHebrew
MacArabic |
| Various:
|
KOI8-R (Russian)
KOI8-U (Ukrainian)
KOI8-RU (Russian/Ukrainian)
JUS_I.B1.002 a.k.a. ISO-IR-141,
ISO646-YU (languages of Yugoslavia)
ARMSCII-8 (Armenian)
Georgian-Academy
Georgian-PS |