User:Dcljr/Scripts
Quick index redesign
[edit]The redesigned Main Page has now gone live; see this page's history for older versions of the "quick index".
Browse by script in Unicode order
[edit]Working on this...
- Basic Latin (0000–007F)
- Special characters
- Latin-1 Supplement (0080–00FF)
- Latin Extended-A (0100–017F)
- Latin Extended-B (0180–024F)
- IPA Extensions (0250–02AF)
- Spacing Modifier Letters (02B0–02FF)
- Combining Diacritical Marks (0300–036F)
- Greek and Coptic (0370–03FF)
- Cyrillic (0400–04FF)
- Cyrillic Supplement (0500–052F)
- Armenian (0530–058F)
- Hebrew (0590–05FF)
- Arabic (0600–06FF)
- Syriac (0700–074F)
- Arabic Supplement (0750–077F)
- Thaana (0780–07BF)
- Indic scripts:
- Thai (0E00–0E7F)
- Lao (0E80–0EFF)
- Tibetan (0F00–0FFF)
- Burmese (1000–109F)
- Georgian (10A0–10FF)
- Hangul Jamo (1100–11FF)
- Ethiopic (1200–137F)
- Ethiopic Supplement (1380–139F)
- Cherokee (13A0–13FF)
- Unified Canadian Aboriginal Syllabics (1400–167F)
- Ogham (1680–169F)
- Runic (16A0–16FF)
- Filipino scripts:
- Khmer (1780–17FF)
- Mongolian (1800–18AF)
- Limbu (1900–194F)
- Tai Le (1950–197F)
- New Tai Lue (1980–19DF)
- Khmer Symbols (19E0–19FF)
- Buginese (1A00–1A1F)
- Phonetic Extensions (1D00–1D7F)
- Phonetic Extensions Supplement (1D80–1DBF)
- Combining Diacritical Marks Supplement (1DC0–1DFF)
- Latin Extended Additional (1E00–1EFF)
- Greek Extended (1F00–1FFF)
- Symbols:
- General Punctuation (2000–206F)
- Superscripts and Subscripts (2070–209F)
- Currency Symbols (20A0–20CF)
- Combining Diacritical Marks for Symbols (20D0–20FF)
- Letterlike Symbols (2100–214F)
- Number Forms (2150–218F)
- Arrows (2190–21FF)
- Mathematical Operators (2200–22FF)
- Miscellaneous Technical (2300–23FF)
- Control Pictures (2400–243F)
- Optical Character Recognition (2440–245F)
- Enclosed Alphanumerics (2460–24FF)
- Box Drawing (2500–257F)
- Block Elements (2580–259F)
- Geometric Shapes (25A0–25FF)
- Miscellaneous Symbols (2600–26FF)
- Dingbats (2700–27BF)
- Miscellaneous Mathematical Symbols-A (27C0–27EF)
- Supplemental Arrows-A (27F0–27FF)
- Braille Patterns (2800–28FF)
- Supplemental Arrows-B (2900–297F)
- Miscellaneous Mathematical Symbols-B (2980–29FF)
- Supplemental Mathematical Operators (2A00–2AFF)
- Miscellaneous Symbols and Arrows (2B00–2BFF)
- Glagolitic (2C00–2C5F)
- Coptic (2C80–2CFF)
- Georgian Supplement (2D00–2D2F)
- Tifinagh (2D30–2D7F)
- Ethiopic Extended (2D80–2DDF)
- Supplemental Punctuation (2E00–2E7F)
- CJK Radicals Supplement (2E80–2EFF)
- Kangxi Radicals (2F00–2FDF)
- Ideographic Description Characters (2FF0–2FFF)
- CJK Symbols and Punctuation (3000–303F)
- Hiragana (3040–309F)
- Katakana (30A0–30FF)
- Bopomofo (3100–312F)
- Hangul Compatibility Jamo (3130–318F)
- Kanbun (3190–319F)
- Bopomofo Extended (31A0–31BF)
- CJK Strokes (31C0–31EF)
- Katakana Phonetic Extensions (31F0–31FF)
- Enclosed CJK Letters and Months (3200–32FF)
- CJK Compatibility (3300–33FF)
- CJK Unified Ideographs Extension A (3400–4DBF)
- Yijing Hexagram Symbols (4DC0–4DFF)
- CJK Unified Ideographs (4E00–9FFF)
- Yi Syllables (A000–A48F)
- Yi Radicals (A490–A4CF)
- Modifier Tone Letters (A700–A71F)
- Syloti Nagri (A800–A82F)
- Hangul Syllables (AC00–D7AF)
- High Surrogates (D800–DB7F)
- High Private Use Surrogates (DB80–DBFF)
- Low Surrogates (DC00–DFFF)
- Private Use Area (E000–F8FF)
- CJK Compatibility Ideographs (F900–FAFF)
- Alphabetic Presentation Forms (FB00–FB4F)
- Arabic Presentation Forms-A (FB50–FDFF)
- Variation Selectors (FE00–FE0F)
- Vertical Forms (FE10–FE1F)
- Combining Half Marks (FE20–FE2F)
- CJK Compatibility Forms (FE30–FE4F)
- Small Form Variants (FE50–FE6F)
- Arabic Presentation Forms-B (FE70–FEFF)
- Halfwidth and Fullwidth Forms (FF00–FFEF)
- Specials (FFF0–FFFF)
Based on "Mapping of Unicode characters" at Wikipedia.
Browse by script name
[edit]Scripts are listed in alphabetical order. See below for a list by language name.
Armenian
[edit]Coptic
[edit]- Coptic alphabet:
- Coptic letters derived from Demotic: Ϣ–ϯ
Cyrillic
[edit]Georgian
[edit]- Georgian alphabet:
- ?: ჱ–ჼ
- ?: ⴀ–ⴥ
Greek
[edit]- Greek alphabet:
- ?: ἀ–ῼ
Latin
[edit]- ................
Japanese
[edit]- Kana:
- Kanji:
- By radical: ......
- Romaji: Romaji uses the Latin script. For easier browsing, please see Category:Romaji.
Browse by language name
[edit]- Armenian: uses Armenian alphabet
- Coptic: uses Coptic alphabet
- Georgian: uses Georgian alphabet
- Greek: uses Greek alphabet
- Japanese: uses Japanese kana and kanji, and sometimes the Latin alphabet (romaji)
- Russian: uses a modified Cyrillic alphabet
Scripts at Unicode.org
[edit]Based on The Unicode Character Code Charts By Script. Note that the Unicode character ranges are for entire blocks, not just the alphabets.
- European alphabets
- Ա–Ֆ ա–ֆ ﬓ–ﬗ (Armenian: U+0530–U+058A; ligatures: U+FB13–U+FB17)
- Ⲁ–ⲱ Ⲳ–ⳛ Ϣ–ϯ (Coptic: U+2C80–U+2CFF; Coptic derived from Demotic in Greek and Coptic: U+03E2–U+03EF)
- А–Я а–я Ѐ–Џ ѐ–ӹ Ԁ–ԏ (Cyrillic: U+0400–U+04FF; supplement: U+0500–U+052F)
- ა–ჰ Ⴀ–Ⴥ ჱ–ჼ ⴀ–ⴥ (Georgian: U+10A0–U+10FF; supplement: U+2D00–U+2D2F)
- Α–Ω α–ω ἀ–ῼ (Greek [and Coptic]: U+0370–U+03FF; extended: U+1F00–1FFF; see also Ancient Greek)
- Latin
- !- A-Z a-z (Basic Latin: U+0000–U+0079)
- Latin-1 (U+0080–) € &#x;
- Latin Extended A (U+0100–) Ā &#x;
- Latin Extended B (U+0180–) ƀ &#x;
- Latin Extended C (5.0)
- Latin Extended D (5.0)
- Latin Extended Additional (U+1E00–) Ḁ &#x;
- Latin Ligatures (see U+FB00–U+FB06, ff)
- Fullwidth Latin Letters (in UFF00.pdf: U+–U+) !
- Small Forms (U+FE50–) ﹐ &#x;
- See also: Phonetic Symbols
- See also: Combining Diacritical Marks
- Latin
- African Scripts
- Middle Eastern Scripts
- Arabic
- Hebrew
- Hebrew (U+0590–) &#x;
- Hebrew Presentation Forms (see U+FB1D–U+FB4F, Allpages/) יִ
- Other Middle Eastern Scripts
- American scripts
- Other Scripts
- Indic Scripts
- Bengali (U+0980–) ঀ &#x;
- Devanagari (U+0900–) ऀ &#x;
- Gujarati (U+0A80–) &#x;
- Gurmukhi (U+0A00–) &#x;
- Kannada (U+0C80–) ಀ &#x;
- Limbu (U+1900–) ᤀ &#x;
- Malayalam (U+0D00–) ഀ &#x;
- Oriya (U+0B00–) &#x;
- Sinhala (U+0D80–) &#x;
- Syloti Nagri (U+A800–) ꠀ &#x;
- Tamil (U+0B80–) &#x;
- Telugu (U+0C00–) ఀ &#x;
- Philippine Scripts
- South East Asian
- East Asian Scripts
- Han Ideographs
- Radicals and Strokes
- CJK Radicals Supplement (U+2E80–U+2EF3) ⺀–⻳
- KangXi Radicals (U+2F00–U+2FD5) ⼀–⿕
- CJK Strokes (U+31C0–) ㇀
- Ideographic Description (U+2FF0–) ⿰
- Chinese-specific
- Japanese-specific
- Korean-specific
- Yi
- Central Asian Scripts
- Ancient Scripts
- Ancient Greek
- Cuneiform
- Linear B
- Other Ancient Scripts
Punctuation and other symbols at Unicode.org
[edit]Based on Code Charts for Symbols and Punctuation.
- Punctuation
- General Punctuation
- U0000.pdf ASCII Punctuation
- U0080.pdf Latin-1 Punctuation
- U2000.pdf General Punctuation
- U2E00.pdf Supplemental Punctuation
- CJK Punctuation
- U3000.pdf CJK Punctuation
- UFF00.pdf Fullwidth ASCII Punctuation
- UFE10.pdf Vertical Forms
- General Punctuation
- Enclosed and Square
- U2460.pdf Enclosed Alphanumerics
- U3200.pdf .... CJK Letters and Months
- U3300.pdf CJK Compatibility
- See also: Letterlike Symbols
- Combining Diacritical Marks
- U0300.pdf Combining Diacritical Marks
- U20D0.pdf .... for Symbols
- U1DC0.pdf .... Supplement
- UFE20.pdf Combining Half Marks
- Phonetic Symbols
- U0250.pdf IPA Extensions
- U1D00.pdf Phonetic Extensions
- U1D80.pdf Phonetic Extensions Supplement
- UA700.pdf Modifier Tone Letters
- U02B0.pdf Spacing Modifier Letters
- See also: Super and Subscript
- Mathematical Symbols
- Numbers and Digits
- U0000.pdf ASCII Digits
- UFF00.pdf Fullwidth ASCII Digits
- U2150.pdf Number Forms
- U2070.pdf Super and Subscripts
- See also: specific scripts
- Letterlike Symbols
- U2100.pdf Letterlike Symbols
- U1D400.pdf Math Alphanumeric Symbols
- Arrows and Operators
- U2190.pdf Arrows
- U2200.pdf Mathematical Operators
- U2A00.pdf Suppl. Math Operators
- U27C0.pdf Misc. Math Symbols A
- U2980.pdf Misc. Math Symbols B
- U27F0.pdf Supplemental Arrows A
- U2900.pdf Supplemental Arrows B
- U2B00.pdf Misc. Symbols and Arrows
- Geometrical Symbols
- U25A0.pdf Geometrical Shapes
- U2500.pdf Box Drawing
- U2580.pdf Block Elements
- Technical Symbols
- U2400.pdf Control Pictures
- U2300.pdf Miscellaneous Technical
- U2440.pdf OCR
- Numbers and Digits
- Symbols
- Miscellaneous Symbols
- U2700.pdf Dingbats
- U2600.pdf Miscellaneous Symbols
- U1D300.pdf Tai Xuan Jing Symbols
- U4DC0.pdf Yijing Hexagrams
- U2800.pdf Braille Patterns
- Musical Notation
- U1D200.pdf Ancient Greek Musical...
- U1D000.pdf Byzantine Musical Symbols
- U1D100.pdf Western Musical Symbols
- Currency Symbols
- U0000.pdf Dollar Sign
- U0080.pdf Yen, Pound and Cent
- U20A0.pdf Currency Symbols
- UFF00.pdf Fullwidth Currency Symbols
- U2100.pdf Mark and...
- U20A0.pdf Pfennig (historic)
- UFB50.pdf Rial Sign
- See also: specific scripts
- Miscellaneous Symbols
- Specials
- Controls:
- U0000.pdf C0
- U0080.pdf C1
- U2000.pdf Layout Controls
- U2000.pdf Invisible Operators
- UFFF0.pdf Specials
- UE0000.pdf Tags
- UFE00.pdf Variation Selectors
- UE0100.pdf Variation Selectors Supplement
- Private Use
- UE000.pdf Private Use Area
- UF0000.pdf Suppl. Private Use Area A
- U100000.pdf Suppl. Private Use Area B
- Surrogates
- UD800.pdf High Surrogates
- High Private Use Surrogates
- UDC00.pdf Low Surrogates
- Noncharacters in Charts
- UFB50.pdf Reserved range
- UFFF0.pdf At End of BMP
- U1FF80.pdf At End of Plane 1
- U2FF80.pdf At End of Plane 2
- U3FF80.pdf At End of Plane 3
- U4FF80.pdf At End of Plane 4
- U5FF80.pdf At End of Plane 5
- U6FF80.pdf At End of Plane 6
- U7FF80.pdf At End of Plane 7
- U8FF80.pdf At End of Plane 8
- U9FF80.pdf At End of Plane 9
- UAFF80.pdf At End of Plane 10
- UBFF80.pdf At End of Plane 11
- UCFF80.pdf At End of Plane 12
- UDFF80.pdf At End of Plane 13
- UEFF80.pdf At End of Plane 14
- UFFF80.pdf At End of Plane 15
- U10FF80.pdf At End of Plane 16
Most common writing systems
[edit]Rough estimate of the most common scripts, based on data at List of languages by number of native speakers crossreferenced with scripts listed at List of languages by writing system:
- Latin, Chinese, Devanagari, Arabic, Bengali, Cyrillic, Japanese, Korean
Others that are common on the internet, according to Languages on the Internet (again, based on speakers, not scripts):
- Thai, Hebrew, Greek
Arabic, Armenian, Bengali, Bopomofo (Zhuyin), Braille, Buhid, Canadian Aboriginal, Cherokee, Common, Cypriot, Cyrillic, Deseret, Devanagari, Ethiopic, Georgian, Gothic, Greek, Gujarati, Gurmukhi, Han, Hangul, Hanunoo, Hebrew, Hiragana, Inherited, Kannada, Katakana, Khmer, Lao, Latin, Limbu, Linear B, Malayalam, Mongolian, Myanmar (Burmese), Ogham, Old Italic, Oriya, Osmanya, Runic, Shavian, Sinhala, Syriac, Tagalog, Tagbanwa, Tai Le, Tamil, Telugu, Thaana, Thai, Tibetan, Ugaritic, Yi