Khmer
Khmer is a Unicode block containing characters for writing the Khmer, or Cambodian, language.
The Khmer alphabet or Khmer script (IPA: ) is an abugida, which means that it's a consonant-driven script. It's used to write the Khmer language (the official language of Cambodia). Apart from that, the script is applied for Pali in the Buddhist liturgy of Cambodia and Thailand.
The origins of Khmer go back to the Pallava script, which it was adopted from. Pallava is a variant of the Grantha alphabet descended from the Brahmi script, which was used in southern India and South East Asia during the 5th and 6th centuries AD. I know, this chain seems complicated, but doesn't all linguistics? Anyway, the oldest Khmer inscription was found at Angkor Borei District in Takéo Province south of Phnom Penh and it dates back to 611.
As for the modern Khmer script, it differs a lot from its precedent forms on the inscriptions of the Angkor ruins. The Thai 0E00–0E7F and Lao 0E80–0EFF scripts have descended from an older form of the Khmer script.
Khmer is written from left to right. Words within one sentence or phrase usually come together with no spaces between them. Consonant clusters within a word are “stacked”, with the second (and occasionally third) consonant being written in reduced form under the main consonant. Originally there were 35 consonant characters, but modern Khmer uses only 33. Each character in fact represents a consonant sound together with an inherent vowel – either â or ô.
You might remember that Khmer is an abugida. That's why vowel sounds are more commonly represented as dependent vowels – additional marks accompanying a consonant character, and indicating what vowel sound is to be pronounced after that consonant (or consonant cluster). Most dependent vowels have two different pronunciations, depending in most cases on the inherent vowel of the consonant to which they are added. In some positions, a consonant written with no dependent vowel is taken to be followed by the sound of its inherent vowel.
Needless to say, there are also a number of diacritics used to indicate further modifications in pronunciation. The script also includes its own numerals and punctuation marks.
Properties
| Range | 1780–17FF |
| Characters | 128 |
Consonants
-
ក1780Khmer Letter Ka
-
ខ1781Khmer Letter Kha
-
គ1782Khmer Letter Ko
-
ឃ1783Khmer Letter Kho
-
ង1784Khmer Letter Ngo
-
ច1785Khmer Letter Ca
-
ឆ1786Khmer Letter Cha
-
ជ1787Khmer Letter Co
-
ឈ1788Khmer Letter Cho
-
ញ1789Khmer Letter Nyo
-
ដ178AKhmer Letter Da
-
ឋ178BKhmer Letter Ttha
-
ឌ178CKhmer Letter Do
-
ឍ178DKhmer Letter Ttho
-
ណ178EKhmer Letter Nno
-
ត178FKhmer Letter Ta
-
ថ1790Khmer Letter Tha
-
ទ1791Khmer Letter To
-
ធ1792Khmer Letter Tho
-
ន1793Khmer Letter No
-
ប1794Khmer Letter Ba
-
ផ1795Khmer Letter Pha
-
ព1796Khmer Letter Po
-
ភ1797Khmer Letter Pho
-
ម1798Khmer Letter Mo
-
យ1799Khmer Letter Yo
-
រ179AKhmer Letter Ro
-
ល179BKhmer Letter Lo
-
វ179CKhmer Letter Vo
-
ឝ179DKhmer Letter Sha
-
ឞ179EKhmer Letter Sso
-
ស179FKhmer Letter Sa
-
ហ17A0Khmer Letter Ha
-
ឡ17A1Khmer Letter La
-
អ17A2Khmer Letter Qa
Deprecated independent vowels for transliteration
Independent vowels
-
ឥ17A5Khmer Independent Vowel Qi
-
ឦ17A6Khmer Independent Vowel Qii
-
ឧ17A7Khmer Independent Vowel Qu
-
ឨ17A8Khmer Independent Vowel Quk
-
ឩ17A9Khmer Independent Vowel Quu
-
ឪ17AAKhmer Independent Vowel Quuv
-
ឫ17ABKhmer Independent Vowel Ry
-
ឬ17ACKhmer Independent Vowel Ryy
-
ឭ17ADKhmer Independent Vowel Ly
-
ឮ17AEKhmer Independent Vowel Lyy
-
ឯ17AFKhmer Independent Vowel Qe
-
ឰ17B0Khmer Independent Vowel Qai
-
ឱ17B1Khmer Independent Vowel Qoo Type One
-
ឲ17B2Khmer Independent Vowel Qoo Type Two
-
ឳ17B3Khmer Independent Vowel Qau
Inherent vowels
Dependent vowel signs
-
ា17B6Khmer Vowel Sign Aa
-
ិ17B7Khmer Vowel Sign I
-
ី17B8Khmer Vowel Sign Ii
-
ឹ17B9Khmer Vowel Sign Y
-
ឺ17BAKhmer Vowel Sign Yy
-
ុ17BBKhmer Vowel Sign U
-
ូ17BCKhmer Vowel Sign Uu
-
ួ17BDKhmer Vowel Sign Ua
Two-part dependent vowel signs
Dependent vowel signs
Two-part dependent vowel signs
Various signs
Consonant shifters
Various signs
-
់17CBKhmer Sign Bantoc
-
៌17CCKhmer Sign Robat
-
៍17CDKhmer Sign Toandakhiat
-
៎17CEKhmer Sign Kakabat
-
៏17CFKhmer Sign Ahsda
-
័17D0Khmer Sign Samyok Sannya
-
៑17D1Khmer Sign Viriam
-
្17D2Khmer Sign Coeng
Lunar date sign
Various signs
-
។17D4Khmer Sign Khan
-
៕17D5Khmer Sign Bariyoosan
-
៖17D6Khmer Sign Camnuc Pii Kuuh
-
ៗ17D7Khmer Sign Lek Too
-
៘17D8Khmer Sign Beyyal
-
៙17D9Khmer Sign Phnaek Muan
-
៚17DAKhmer Sign Koomuut
Currency symbol
Various signs
Digits
-
០17E0Khmer Digit Zero
-
១17E1Khmer Digit One
-
២17E2Khmer Digit Two
-
៣17E3Khmer Digit Three
-
៤17E4Khmer Digit Four
-
៥17E5Khmer Digit Five
-
៦17E6Khmer Digit Six
-
៧17E7Khmer Digit Seven
-
៨17E8Khmer Digit Eight
-
៩17E9Khmer Digit Nine
Numeric symbols for divination lore
-
៰17F0Khmer Symbol Lek Attak Son
-
៱17F1Khmer Symbol Lek Attak Muoy
-
៲17F2Khmer Symbol Lek Attak Pii
-
៳17F3Khmer Symbol Lek Attak Bei
-
៴17F4Khmer Symbol Lek Attak Buon
-
៵17F5Khmer Symbol Lek Attak Pram
-
៶17F6Khmer Symbol Lek Attak Pram-Muoy
-
៷17F7Khmer Symbol Lek Attak Pram-Pii
-
៸17F8Khmer Symbol Lek Attak Pram-Bei
-
៹17F9Khmer Symbol Lek Attak Pram-Buon
-
0: Basic Multilingual Plane
-
Basic Latin0000–007F
-
Latin-1 Supplement0080–00FF
-
Latin Extended-A0100–017F
-
Latin Extended-B0180–024F
-
IPA Extensions0250–02AF
-
Spacing Modifier Letters02B0–02FF
-
Combining Diacritical Marks0300–036F
-
Greek and Coptic0370–03FF
-
Cyrillic0400–04FF
-
Cyrillic Supplement0500–052F
-
Armenian0530–058F
-
Hebrew0590–05FF
-
Arabic0600–06FF
-
Syriac0700–074F
-
Arabic Supplement0750–077F
-
Thaana0780–07BF
-
NKo07C0–07FF
-
Samaritan0800–083F
-
Mandaic0840–085F
-
Syriac Supplement0860–086F
-
Arabic Extended-B0870–089F
-
Arabic Extended-A08A0–08FF
-
Devanagari0900–097F
-
Bengali0980–09FF
-
Gurmukhi0A00–0A7F
-
Gujarati0A80–0AFF
-
Oriya0B00–0B7F
-
Tamil0B80–0BFF
-
Telugu0C00–0C7F
-
Kannada0C80–0CFF
-
Malayalam0D00–0D7F
-
Sinhala0D80–0DFF
-
Thai0E00–0E7F
-
Lao0E80–0EFF
-
Tibetan0F00–0FFF
-
Myanmar1000–109F
-
Georgian10A0–10FF
-
Hangul Jamo1100–11FF
-
Ethiopic1200–137F
-
Ethiopic Supplement1380–139F
-
Cherokee13A0–13FF
-
Unified Canadian Aboriginal Syllabics1400–167F
-
Ogham1680–169F
-
Runic16A0–16FF
-
Tagalog1700–171F
-
Hanunoo1720–173F
-
Buhid1740–175F
-
Tagbanwa1760–177F
-
Khmer1780–17FF
-
Mongolian1800–18AF
-
Unified Canadian Aboriginal Syllabics Extended18B0–18FF
-
Limbu1900–194F
-
Tai Le1950–197F
-
New Tai Lue1980–19DF
-
Khmer Symbols19E0–19FF
-
Buginese1A00–1A1F
-
Tai Tham1A20–1AAF
-
Combining Diacritical Marks Extended1AB0–1AFF
-
Balinese1B00–1B7F
-
Sundanese1B80–1BBF
-
Batak1BC0–1BFF
-
Lepcha1C00–1C4F
-
Ol Chiki1C50–1C7F
-
Cyrillic Extended-C1C80–1C8F
-
Georgian Extended1C90–1CBF
-
Sundanese Supplement1CC0–1CCF
-
Vedic Extensions1CD0–1CFF
-
Phonetic Extensions1D00–1D7F
-
Phonetic Extensions Supplement1D80–1DBF
-
Combining Diacritical Marks Supplement1DC0–1DFF
-
Latin Extended Additional1E00–1EFF
-
Greek Extended1F00–1FFF
-
General Punctuation2000–206F
-
Superscripts and Subscripts2070–209F
-
Currency Symbols20A0–20CF
-
Combining Diacritical Marks for Symbols20D0–20FF
-
Letterlike Symbols2100–214F
-
Number Forms2150–218F
-
Arrows2190–21FF
-
Mathematical Operators2200–22FF
-
Miscellaneous Technical2300–23FF
-
Control Pictures2400–243F
-
Optical Character Recognition2440–245F
-
Enclosed Alphanumerics2460–24FF
-
Box Drawing2500–257F
-
Block Elements2580–259F
-
Geometric Shapes25A0–25FF
-
Miscellaneous Symbols2600–26FF
-
Dingbats2700–27BF
-
Miscellaneous Mathematical Symbols-A27C0–27EF
-
Supplemental Arrows-A27F0–27FF
-
Braille Patterns2800–28FF
-
Supplemental Arrows-B2900–297F
-
Miscellaneous Mathematical Symbols-B2980–29FF
-
Supplemental Mathematical Operators2A00–2AFF
-
Miscellaneous Symbols and Arrows2B00–2BFF
-
Glagolitic2C00–2C5F
-
Latin Extended-C2C60–2C7F
-
Coptic2C80–2CFF
-
Georgian Supplement2D00–2D2F
-
Tifinagh2D30–2D7F
-
Ethiopic Extended2D80–2DDF
-
Cyrillic Extended-A2DE0–2DFF
-
Supplemental Punctuation2E00–2E7F
-
CJK Radicals Supplement2E80–2EFF
-
Kangxi Radicals2F00–2FDF
-
Ideographic Description Characters2FF0–2FFF
-
CJK Symbols and Punctuation3000–303F
-
Hiragana3040–309F
-
Katakana30A0–30FF
-
Bopomofo3100–312F
-
Hangul Compatibility Jamo3130–318F
-
Kanbun3190–319F
-
Bopomofo Extended31A0–31BF
-
CJK Strokes31C0–31EF
-
Katakana Phonetic Extensions31F0–31FF
-
Enclosed CJK Letters and Months3200–32FF
-
CJK Compatibility3300–33FF
-
CJK Unified Ideographs Extension A3400–4DBF
-
Yijing Hexagram Symbols4DC0–4DFF
-
CJK Unified Ideographs4E00–9FFF
-
Yi SyllablesA000–A48F
-
Yi RadicalsA490–A4CF
-
LisuA4D0–A4FF
-
VaiA500–A63F
-
Cyrillic Extended-BA640–A69F
-
BamumA6A0–A6FF
-
Modifier Tone LettersA700–A71F
-
Latin Extended-DA720–A7FF
-
Syloti NagriA800–A82F
-
Common Indic Number FormsA830–A83F
-
Phags-paA840–A87F
-
SaurashtraA880–A8DF
-
Devanagari ExtendedA8E0–A8FF
-
Kayah LiA900–A92F
-
RejangA930–A95F
-
Hangul Jamo Extended-AA960–A97F
-
JavaneseA980–A9DF
-
Myanmar Extended-BA9E0–A9FF
-
ChamAA00–AA5F
-
Myanmar Extended-AAA60–AA7F
-
Tai VietAA80–AADF
-
Meetei Mayek ExtensionsAAE0–AAFF
-
Ethiopic Extended-AAB00–AB2F
-
Latin Extended-EAB30–AB6F
-
Cherokee SupplementAB70–ABBF
-
Meetei MayekABC0–ABFF
-
Hangul SyllablesAC00–D7AF
-
Hangul Jamo Extended-BD7B0–D7FF
-
High SurrogatesD800–DB7F
-
High Private Use SurrogatesDB80–DBFF
-
Low SurrogatesDC00–DFFF
-
Private Use AreaE000–F8FF
-
CJK Compatibility IdeographsF900–FAFF
-
Alphabetic Presentation FormsFB00–FB4F
-
Arabic Presentation Forms-AFB50–FDFF
-
Variation SelectorsFE00–FE0F
-
Vertical FormsFE10–FE1F
-
Combining Half MarksFE20–FE2F
-
CJK Compatibility FormsFE30–FE4F
-
Small Form VariantsFE50–FE6F
-
Arabic Presentation Forms-BFE70–FEFF
-
Halfwidth and Fullwidth FormsFF00–FFEF
-
SpecialsFFF0–FFFF
-
-
1: Supplementary Multilingual Plane
-
Linear B Syllabary10000–1007F
-
Linear B Ideograms10080–100FF
-
Aegean Numbers10100–1013F
-
Ancient Greek Numbers10140–1018F
-
Ancient Symbols10190–101CF
-
Phaistos Disc101D0–101FF
-
Lycian10280–1029F
-
Carian102A0–102DF
-
Coptic Epact Numbers102E0–102FF
-
Old Italic10300–1032F
-
Gothic10330–1034F
-
Old Permic10350–1037F
-
Ugaritic10380–1039F
-
Old Persian103A0–103DF
-
Deseret10400–1044F
-
Shavian10450–1047F
-
Osmanya10480–104AF
-
Osage104B0–104FF
-
Elbasan10500–1052F
-
Caucasian Albanian10530–1056F
-
Vithkuqi10570–105BF
-
Todhri105C0–105FF
-
Linear A10600–1077F
-
Latin Extended-F10780–107BF
-
Cypriot Syllabary10800–1083F
-
Imperial Aramaic10840–1085F
-
Palmyrene10860–1087F
-
Nabataean10880–108AF
-
Hatran108E0–108FF
-
Phoenician10900–1091F
-
Lydian10920–1093F
-
Sidetic10940–1095F
-
Meroitic Hieroglyphs10980–1099F
-
Meroitic Cursive109A0–109FF
-
Kharoshthi10A00–10A5F
-
Old South Arabian10A60–10A7F
-
Old North Arabian10A80–10A9F
-
Manichaean10AC0–10AFF
-
Avestan10B00–10B3F
-
Inscriptional Parthian10B40–10B5F
-
Inscriptional Pahlavi10B60–10B7F
-
Psalter Pahlavi10B80–10BAF
-
Old Turkic10C00–10C4F
-
Old Hungarian10C80–10CFF
-
Hanifi Rohingya10D00–10D3F
-
Garay10D40–10D8F
-
Rumi Numeral Symbols10E60–10E7F
-
Yezidi10E80–10EBF
-
Arabic Extended-C10EC0–10EFF
-
Old Sogdian10F00–10F2F
-
Sogdian10F30–10F6F
-
Old Uyghur10F70–10FAF
-
Chorasmian10FB0–10FDF
-
Elymaic10FE0–10FFF
-
Brahmi11000–1107F
-
Kaithi11080–110CF
-
Sora Sompeng110D0–110FF
-
Chakma11100–1114F
-
Mahajani11150–1117F
-
Sharada11180–111DF
-
Sinhala Archaic Numbers111E0–111FF
-
Khojki11200–1124F
-
Multani11280–112AF
-
Khudawadi112B0–112FF
-
Grantha11300–1137F
-
Tulu-Tigalari11380–113FF
-
Newa11400–1147F
-
Tirhuta11480–114DF
-
Siddham11580–115FF
-
Modi11600–1165F
-
Mongolian Supplement11660–1167F
-
Takri11680–116CF
-
Myanmar Extended-C116D0–116FF
-
Ahom11700–1174F
-
Dogra11800–1184F
-
Warang Citi118A0–118FF
-
Dives Akuru11900–1195F
-
Nandinagari119A0–119FF
-
Zanabazar Square11A00–11A4F
-
Soyombo11A50–11AAF
-
Unified Canadian Aboriginal Syllabics Extended-A11AB0–11ABF
-
Pau Cin Hau11AC0–11AFF
-
Devanagari Extended-A11B00–11B5F
-
Sharada Supplement11B60–11B7F
-
Sunuwar11BC0–11BFF
-
Bhaiksuki11C00–11C6F
-
Marchen11C70–11CBF
-
Masaram Gondi11D00–11D5F
-
Gunjala Gondi11D60–11DAF
-
Tolong Siki11DB0–11DEF
-
Makasar11EE0–11EFF
-
Kawi11F00–11F5F
-
Lisu Supplement11FB0–11FBF
-
Tamil Supplement11FC0–11FFF
-
Cuneiform12000–123FF
-
Cuneiform Numbers and Punctuation12400–1247F
-
Early Dynastic Cuneiform12480–1254F
-
Cypro-Minoan12F90–12FFF
-
Egyptian Hieroglyphs13000–1342F
-
Egyptian Hieroglyph Format Controls13430–1345F
-
Egyptian Hieroglyphs Extended-A13460–143FF
-
Anatolian Hieroglyphs14400–1467F
-
Gurung Khema16100–1613F
-
Bamum Supplement16800–16A3F
-
Mro16A40–16A6F
-
Tangsa16A70–16ACF
-
Bassa Vah16AD0–16AFF
-
Pahawh Hmong16B00–16B8F
-
Kirat Rai16D40–16D7F
-
Medefaidrin16E40–16E9F
-
Beria Erfe16EA0–16EDF
-
Miao16F00–16F9F
-
Ideographic Symbols and Punctuation16FE0–16FFF
-
Tangut17000–187FF
-
Tangut Components18800–18AFF
-
Khitan Small Script18B00–18CFF
-
Tangut Supplement18D00–18D7F
-
Tangut Components Supplement18D80–18DFF
-
Kana Extended-B1AFF0–1AFFF
-
Kana Supplement1B000–1B0FF
-
Kana Extended-A1B100–1B12F
-
Small Kana Extension1B130–1B16F
-
Nushu1B170–1B2FF
-
Duployan1BC00–1BC9F
-
Shorthand Format Controls1BCA0–1BCAF
-
Symbols for Legacy Computing Supplement1CC00–1CEBF
-
Miscellaneous Symbols Supplement1CEC0–1CEFF
-
Znamenny Musical Notation1CF00–1CFCF
-
Byzantine Musical Symbols1D000–1D0FF
-
Musical Symbols1D100–1D1FF
-
Ancient Greek Musical Notation1D200–1D24F
-
Kaktovik Numerals1D2C0–1D2DF
-
Mayan Numerals1D2E0–1D2FF
-
Tai Xuan Jing Symbols1D300–1D35F
-
Counting Rod Numerals1D360–1D37F
-
Mathematical Alphanumeric Symbols1D400–1D7FF
-
Sutton SignWriting1D800–1DAAF
-
Sutton SignWriting1D800–1DAAF
-
Latin Extended-G1DF00–1DFFF
-
Glagolitic Supplement1E000–1E02F
-
Cyrillic Extended-D1E030–1E08F
-
Nyiakeng Puachue Hmong1E100–1E14F
-
Toto1E290–1E2BF
-
Wancho1E2C0–1E2FF
-
Nag Mundari1E4D0–1E4FF
-
Ol Onal1E5D0–1E5FF
-
Tai Yo1E6C0–1E6FF
-
Ethiopic Extended-B1E7E0–1E7FF
-
Mende Kikakui1E800–1E8DF
-
Adlam1E900–1E95F
-
Indic Siyaq Numbers1EC70–1ECBF
-
Ottoman Siyaq Numbers1ED00–1ED4F
-
Arabic Mathematical Alphabetic Symbols1EE00–1EEFF
-
Mahjong Tiles1F000–1F02F
-
Domino Tiles1F030–1F09F
-
Playing Cards1F0A0–1F0FF
-
Enclosed Alphanumeric Supplement1F100–1F1FF
-
Enclosed Ideographic Supplement1F200–1F2FF
-
Miscellaneous Symbols and Pictographs1F300–1F5FF
-
Emoticons1F600–1F64F
-
Ornamental Dingbats1F650–1F67F
-
Transport and Map Symbols1F680–1F6FF
-
Alchemical Symbols1F700–1F77F
-
Geometric Shapes Extended1F780–1F7FF
-
Supplemental Arrows-C1F800–1F8FF
-
Supplemental Symbols and Pictographs1F900–1F9FF
-
Chess Symbols1FA00–1FA6F
-
Symbols and Pictographs Extended-A1FA70–1FAFF
-
Symbols for Legacy Computing1FB00–1FBFF
-
-
2: Supplementary Ideographic Plane
-
CJK Unified Ideographs Extension B20000–2A6DF
-
CJK Unified Ideographs Extension C2A700–2B73F
-
CJK Unified Ideographs Extension D2B740–2B81F
-
CJK Unified Ideographs Extension E2B820–2CEAF
-
CJK Unified Ideographs Extension F2CEB0–2EBEF
-
CJK Unified Ideographs Extension I2EBF0–2EE5F
-
CJK Compatibility Ideographs Supplement2F800–2FA1F
-
-
3: Tertiary Ideographic Plane
-
-
Planes 4–13: Not used
-
-
-
-
-
-
-
-
-
14: Supplementary Special-purpose Plane
-
15: Supplementary Private Use Area Plane – A
-
16: Supplementary Private Use Area Plane – B
-
Nothing found
┐( ˘_˘ )┌