CSets: Supplemental Unicode Mapping Tables
- Mapping tables for various languages.
- UTF-8 is an ASCII-preserving encoding method for Unicode (ISO 10646), the Universal Character Set (UCS). The
UCS encodes most of the world's writing systems in a single character set, allowing you to mix languages and scripts
within a document without needing any tricks for switching character sets. This web page is encoded directly in
Test page for UNICODE UTF-8 encoding.
- UNICODE is a really great 2-byte-encoding that gives every letter of any writing system in the world a unique code
number. Therefore, on one page you can write in various writing systems - this was impossible before UNICODE
existed, because you had to switch to various codepages depending on language, like "Latin 1", "Latin 2" or "Cyrillic".
Now you can have it all in one single web page! Our example shows Latin letters with diacritical marks of Central
European languages and Cyrillic characters.
Does Your Browser Support Multi-language ? [reference sheets]
- Unicode is the World's standard for encoding text. Most all of the characters used in modern writing systems have
already been assigned to unique code positions and work is under way to add some fairly exotic modern scripts as
well as provide standardized encoding for ancient scripts. If your browser has multilingual capabilities, it probably
uses Unicode to address the various letters, characters, and symbols shown on your screen.
Unicode Character Charts [Katakana] [Greek]
- The charts in this list are arranged in code point order.