Combining Diacritical Marks
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the Combining Grapheme Joiner, which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.
A diacritic /daɪ.əˈkrɪtɨk/ – also diacritical mark, diacritical point, or diacritical sign – is a glyph added to a letter, or basic glyph. The term derives from the Greek διακριτικός (diakritikós, “distinguishing”, from ancient Greek διά (diá, through) and κρίνω (krínein, to separate)). Diacritic is primarily an adjective, though sometimes used as a noun, whereas diacritical is only ever an adjective. Some diacritical marks, such as the acute (´) and grave (`), are often called accents. Diacritical marks may appear above or below a letter, or in some other position such as within the letter or between two letters.
The main use of diacritical marks in the Latin script is to change the sound-values of the letters to which they are added. Examples from English are the diaereses in naïve and Noël, which show that the vowel with the diaeresis mark is pronounced separately from the preceding vowel; the acute and grave accents, which can indicate that a final vowel is to be pronounced, as in saké and poetic breathèd; and the cedilla under the “c” in the borrowed French word façade, which shows it is pronounced /s/ rather than /k/. In other Latin alphabets, they may distinguish between homonyms, such as the French là (“there”) versus la (“the”), which are both pronounced . In Gaelic type, a dot over a consonant indicates lenition of the consonant in question.
In other alphabetic systems, diacritical marks may perform other functions. Vowel pointing systems, namely the Arabic harakat ( ـَ, ـُ, ـُ, etc.) and the Hebrew niqqud ( ַ, ֶ, ִ, ֹ , ֻ, etc.) systems, indicate sounds (vowels and tones) that are not conveyed by the basic alphabet. The Indic virama ( ् etc.) and the Arabic sukūn ( ـْـ ) mark the absence of a vowel. Cantillation marks indicate prosody. Other uses include the Early Cyrillic titlo ( ◌҃ ) and the Hebrew gershayim ( ״ ), which, respectively, mark abbreviations or acronyms, and Greek diacritical marks, which showed that letters of the alphabet were being used as numerals. In the Hanyu Pinyin official romanization system for Chinese, diacritics are used to mark the tones of the syllables in which the marked vowels occur.
In orthography and collation, a letter modified by a diacritic may be treated either as a new, distinct letter or as a letter–diacritic combination. This varies from language to language, and may vary from case to case within a language.
In some cases, letters are used as “in-line diacritics” in place of ancillary glyphs, because they modify the sound of the letter preceding them, as in the case of the “h” in English “sh” and “th”.
Properties
| Range | 0300–036F |
| Characters | 112 |
Ordinary diacritics
-
̀0300Combining Grave Accent
-
́0301Combining Acute Accent
-
̂0302Combining Circumflex Accent
-
̃0303Combining Tilde
-
̄0304Combining Macron
-
̅0305Combining Overline
-
̆0306Combining Breve
-
̇0307Combining Dot Above
-
̈0308Combining Diaeresis
-
̉0309Combining Hook Above
-
̊030ACombining Ring Above
-
̋030BCombining Double Acute Accent
-
̌030CCombining Caron
-
̍030DCombining Vertical Line Above
-
̎030ECombining Double Vertical Line Above
-
̏030FCombining Double Grave Accent
-
̐0310Combining Candrabindu
-
̑0311Combining Inverted Breve
-
̒0312Combining Turned Comma Above
-
̓0313Combining Comma Above
-
̔0314Combining Reversed Comma Above
-
̕0315Combining Comma Above Right
-
̖0316Combining Grave Accent Below
-
̗0317Combining Acute Accent Below
-
̘0318Combining Left Tack Below
-
̙0319Combining Right Tack Below
-
̚031ACombining Left Angle Above
-
̛031BCombining Horn
-
̜031CCombining Left Half Ring Below
-
̝031DCombining Up Tack Below
-
̞031ECombining Down Tack Below
-
̟031FCombining Plus Sign Below
-
̠0320Combining Minus Sign Below
-
̡0321Combining Palatalized Hook Below
-
̢0322Combining Retroflex Hook Below
-
̣0323Combining Dot Below
-
̤0324Combining Diaeresis Below
-
̥0325Combining Ring Below
-
̦0326Combining Comma Below
-
̧0327Combining Cedilla
-
̨0328Combining Ogonek
-
̩0329Combining Vertical Line Below
-
̪032ACombining Bridge Below
-
̫032BCombining Inverted Double Arch Below
-
̬032CCombining Caron Below
-
̭032DCombining Circumflex Accent Below
-
̮032ECombining Breve Below
-
̯032FCombining Inverted Breve Below
-
̰0330Combining Tilde Below
-
̱0331Combining Macron Below
-
̲0332Combining Low Line
-
̳0333Combining Double Low Line
Overstruck diacritics
-
̴0334Combining Tilde Overlay
-
̵0335Combining Short Stroke Overlay
-
̶0336Combining Long Stroke Overlay
-
̷0337Combining Short Solidus Overlay
-
̸0338Combining Long Solidus Overlay
Miscellaneous additions
-
̹0339Combining Right Half Ring Below
-
̺033ACombining Inverted Bridge Below
-
̻033BCombining Square Below
-
̼033CCombining Seagull Below
-
̽033DCombining X Above
-
̾033ECombining Vertical Tilde
-
̿033FCombining Double Overline
Vietnamese tone marks
Additions for Greek
-
͂0342Combining Greek Perispomeni
-
̓0343Combining Greek Koronis
-
̈́0344Combining Greek Dialytika Tonos
-
ͅ0345Combining Greek Ypogegrammeni
Additions for IPA
-
͆0346Combining Bridge Above
-
͇0347Combining Equals Sign Below
-
͈0348Combining Double Vertical Line Below
-
͉0349Combining Left Angle Below
-
͊034ACombining Not Tilde Above
IPA diacritics for disordered speech
-
͋034BCombining Homothetic Above
-
͌034CCombining Almost Equal To Above
-
͍034DCombining Left Right Arrow Below
-
͎034ECombining Upwards Arrow Below
Miscellaneous addition
Additions for the Uralic Phonetic Alphabet
-
͐0350Combining Right Arrowhead Above
-
͑0351Combining Left Half Ring Above
-
͒0352Combining Fermata
-
͓0353Combining X Below
-
͔0354Combining Left Arrowhead Below
-
͕0355Combining Right Arrowhead Below
-
͖0356Combining Right Arrowhead and Up Arrowhead Below
-
͗0357Combining Right Half Ring Above
Miscellaneous additions
-
͘0358Combining Dot Above Right
-
͙0359Combining Asterisk Below
-
͚035ACombining Double Ring Below
-
͛035BCombining Zigzag Above
Double diacritics
-
͜035CCombining Double Breve Below
-
͝035DCombining Double Breve
-
͞035ECombining Double Macron
-
͟035FCombining Double Macron Below
-
͠0360Combining Double Tilde
-
͡0361Combining Double Inverted Breve
-
͢0362Combining Double Rightwards Arrow Below
Medieval superscript letter diacritics
-
ͣ0363Combining Latin Small Letter A
-
ͤ0364Combining Latin Small Letter E
-
ͥ0365Combining Latin Small Letter I
-
ͦ0366Combining Latin Small Letter O
-
ͧ0367Combining Latin Small Letter U
-
ͨ0368Combining Latin Small Letter C
-
ͩ0369Combining Latin Small Letter D
-
ͪ036ACombining Latin Small Letter H
-
ͫ036BCombining Latin Small Letter M
-
ͬ036CCombining Latin Small Letter R
-
ͭ036DCombining Latin Small Letter T
-
ͮ036ECombining Latin Small Letter V
-
ͯ036FCombining Latin Small Letter X
-
0: Basic Multilingual Plane
-
Basic Latin0000–007F
-
Latin-1 Supplement0080–00FF
-
Latin Extended-A0100–017F
-
Latin Extended-B0180–024F
-
IPA Extensions0250–02AF
-
Spacing Modifier Letters02B0–02FF
-
Combining Diacritical Marks0300–036F
-
Greek and Coptic0370–03FF
-
Cyrillic0400–04FF
-
Cyrillic Supplement0500–052F
-
Armenian0530–058F
-
Hebrew0590–05FF
-
Arabic0600–06FF
-
Syriac0700–074F
-
Arabic Supplement0750–077F
-
Thaana0780–07BF
-
NKo07C0–07FF
-
Samaritan0800–083F
-
Mandaic0840–085F
-
Syriac Supplement0860–086F
-
Arabic Extended-B0870–089F
-
Arabic Extended-A08A0–08FF
-
Devanagari0900–097F
-
Bengali0980–09FF
-
Gurmukhi0A00–0A7F
-
Gujarati0A80–0AFF
-
Oriya0B00–0B7F
-
Tamil0B80–0BFF
-
Telugu0C00–0C7F
-
Kannada0C80–0CFF
-
Malayalam0D00–0D7F
-
Sinhala0D80–0DFF
-
Thai0E00–0E7F
-
Lao0E80–0EFF
-
Tibetan0F00–0FFF
-
Myanmar1000–109F
-
Georgian10A0–10FF
-
Hangul Jamo1100–11FF
-
Ethiopic1200–137F
-
Ethiopic Supplement1380–139F
-
Cherokee13A0–13FF
-
Unified Canadian Aboriginal Syllabics1400–167F
-
Ogham1680–169F
-
Runic16A0–16FF
-
Tagalog1700–171F
-
Hanunoo1720–173F
-
Buhid1740–175F
-
Tagbanwa1760–177F
-
Khmer1780–17FF
-
Mongolian1800–18AF
-
Unified Canadian Aboriginal Syllabics Extended18B0–18FF
-
Limbu1900–194F
-
Tai Le1950–197F
-
New Tai Lue1980–19DF
-
Khmer Symbols19E0–19FF
-
Buginese1A00–1A1F
-
Tai Tham1A20–1AAF
-
Combining Diacritical Marks Extended1AB0–1AFF
-
Balinese1B00–1B7F
-
Sundanese1B80–1BBF
-
Batak1BC0–1BFF
-
Lepcha1C00–1C4F
-
Ol Chiki1C50–1C7F
-
Cyrillic Extended-C1C80–1C8F
-
Georgian Extended1C90–1CBF
-
Sundanese Supplement1CC0–1CCF
-
Vedic Extensions1CD0–1CFF
-
Phonetic Extensions1D00–1D7F
-
Phonetic Extensions Supplement1D80–1DBF
-
Combining Diacritical Marks Supplement1DC0–1DFF
-
Latin Extended Additional1E00–1EFF
-
Greek Extended1F00–1FFF
-
General Punctuation2000–206F
-
Superscripts and Subscripts2070–209F
-
Currency Symbols20A0–20CF
-
Combining Diacritical Marks for Symbols20D0–20FF
-
Letterlike Symbols2100–214F
-
Number Forms2150–218F
-
Arrows2190–21FF
-
Mathematical Operators2200–22FF
-
Miscellaneous Technical2300–23FF
-
Control Pictures2400–243F
-
Optical Character Recognition2440–245F
-
Enclosed Alphanumerics2460–24FF
-
Box Drawing2500–257F
-
Block Elements2580–259F
-
Geometric Shapes25A0–25FF
-
Miscellaneous Symbols2600–26FF
-
Dingbats2700–27BF
-
Miscellaneous Mathematical Symbols-A27C0–27EF
-
Supplemental Arrows-A27F0–27FF
-
Braille Patterns2800–28FF
-
Supplemental Arrows-B2900–297F
-
Miscellaneous Mathematical Symbols-B2980–29FF
-
Supplemental Mathematical Operators2A00–2AFF
-
Miscellaneous Symbols and Arrows2B00–2BFF
-
Glagolitic2C00–2C5F
-
Latin Extended-C2C60–2C7F
-
Coptic2C80–2CFF
-
Georgian Supplement2D00–2D2F
-
Tifinagh2D30–2D7F
-
Ethiopic Extended2D80–2DDF
-
Cyrillic Extended-A2DE0–2DFF
-
Supplemental Punctuation2E00–2E7F
-
CJK Radicals Supplement2E80–2EFF
-
Kangxi Radicals2F00–2FDF
-
Ideographic Description Characters2FF0–2FFF
-
CJK Symbols and Punctuation3000–303F
-
Hiragana3040–309F
-
Katakana30A0–30FF
-
Bopomofo3100–312F
-
Hangul Compatibility Jamo3130–318F
-
Kanbun3190–319F
-
Bopomofo Extended31A0–31BF
-
CJK Strokes31C0–31EF
-
Katakana Phonetic Extensions31F0–31FF
-
Enclosed CJK Letters and Months3200–32FF
-
CJK Compatibility3300–33FF
-
CJK Unified Ideographs Extension A3400–4DBF
-
Yijing Hexagram Symbols4DC0–4DFF
-
CJK Unified Ideographs4E00–9FFF
-
Yi SyllablesA000–A48F
-
Yi RadicalsA490–A4CF
-
LisuA4D0–A4FF
-
VaiA500–A63F
-
Cyrillic Extended-BA640–A69F
-
BamumA6A0–A6FF
-
Modifier Tone LettersA700–A71F
-
Latin Extended-DA720–A7FF
-
Syloti NagriA800–A82F
-
Common Indic Number FormsA830–A83F
-
Phags-paA840–A87F
-
SaurashtraA880–A8DF
-
Devanagari ExtendedA8E0–A8FF
-
Kayah LiA900–A92F
-
RejangA930–A95F
-
Hangul Jamo Extended-AA960–A97F
-
JavaneseA980–A9DF
-
Myanmar Extended-BA9E0–A9FF
-
ChamAA00–AA5F
-
Myanmar Extended-AAA60–AA7F
-
Tai VietAA80–AADF
-
Meetei Mayek ExtensionsAAE0–AAFF
-
Ethiopic Extended-AAB00–AB2F
-
Latin Extended-EAB30–AB6F
-
Cherokee SupplementAB70–ABBF
-
Meetei MayekABC0–ABFF
-
Hangul SyllablesAC00–D7AF
-
Hangul Jamo Extended-BD7B0–D7FF
-
High SurrogatesD800–DB7F
-
High Private Use SurrogatesDB80–DBFF
-
Low SurrogatesDC00–DFFF
-
Private Use AreaE000–F8FF
-
CJK Compatibility IdeographsF900–FAFF
-
Alphabetic Presentation FormsFB00–FB4F
-
Arabic Presentation Forms-AFB50–FDFF
-
Variation SelectorsFE00–FE0F
-
Vertical FormsFE10–FE1F
-
Combining Half MarksFE20–FE2F
-
CJK Compatibility FormsFE30–FE4F
-
Small Form VariantsFE50–FE6F
-
Arabic Presentation Forms-BFE70–FEFF
-
Halfwidth and Fullwidth FormsFF00–FFEF
-
SpecialsFFF0–FFFF
-
-
1: Supplementary Multilingual Plane
-
Linear B Syllabary10000–1007F
-
Linear B Ideograms10080–100FF
-
Aegean Numbers10100–1013F
-
Ancient Greek Numbers10140–1018F
-
Ancient Symbols10190–101CF
-
Phaistos Disc101D0–101FF
-
Lycian10280–1029F
-
Carian102A0–102DF
-
Coptic Epact Numbers102E0–102FF
-
Old Italic10300–1032F
-
Gothic10330–1034F
-
Old Permic10350–1037F
-
Ugaritic10380–1039F
-
Old Persian103A0–103DF
-
Deseret10400–1044F
-
Shavian10450–1047F
-
Osmanya10480–104AF
-
Osage104B0–104FF
-
Elbasan10500–1052F
-
Caucasian Albanian10530–1056F
-
Vithkuqi10570–105BF
-
Todhri105C0–105FF
-
Linear A10600–1077F
-
Latin Extended-F10780–107BF
-
Cypriot Syllabary10800–1083F
-
Imperial Aramaic10840–1085F
-
Palmyrene10860–1087F
-
Nabataean10880–108AF
-
Hatran108E0–108FF
-
Phoenician10900–1091F
-
Lydian10920–1093F
-
Sidetic10940–1095F
-
Meroitic Hieroglyphs10980–1099F
-
Meroitic Cursive109A0–109FF
-
Kharoshthi10A00–10A5F
-
Old South Arabian10A60–10A7F
-
Old North Arabian10A80–10A9F
-
Manichaean10AC0–10AFF
-
Avestan10B00–10B3F
-
Inscriptional Parthian10B40–10B5F
-
Inscriptional Pahlavi10B60–10B7F
-
Psalter Pahlavi10B80–10BAF
-
Old Turkic10C00–10C4F
-
Old Hungarian10C80–10CFF
-
Hanifi Rohingya10D00–10D3F
-
Garay10D40–10D8F
-
Rumi Numeral Symbols10E60–10E7F
-
Yezidi10E80–10EBF
-
Arabic Extended-C10EC0–10EFF
-
Old Sogdian10F00–10F2F
-
Sogdian10F30–10F6F
-
Old Uyghur10F70–10FAF
-
Chorasmian10FB0–10FDF
-
Elymaic10FE0–10FFF
-
Brahmi11000–1107F
-
Kaithi11080–110CF
-
Sora Sompeng110D0–110FF
-
Chakma11100–1114F
-
Mahajani11150–1117F
-
Sharada11180–111DF
-
Sinhala Archaic Numbers111E0–111FF
-
Khojki11200–1124F
-
Multani11280–112AF
-
Khudawadi112B0–112FF
-
Grantha11300–1137F
-
Tulu-Tigalari11380–113FF
-
Newa11400–1147F
-
Tirhuta11480–114DF
-
Siddham11580–115FF
-
Modi11600–1165F
-
Mongolian Supplement11660–1167F
-
Takri11680–116CF
-
Myanmar Extended-C116D0–116FF
-
Ahom11700–1174F
-
Dogra11800–1184F
-
Warang Citi118A0–118FF
-
Dives Akuru11900–1195F
-
Nandinagari119A0–119FF
-
Zanabazar Square11A00–11A4F
-
Soyombo11A50–11AAF
-
Unified Canadian Aboriginal Syllabics Extended-A11AB0–11ABF
-
Pau Cin Hau11AC0–11AFF
-
Devanagari Extended-A11B00–11B5F
-
Sharada Supplement11B60–11B7F
-
Sunuwar11BC0–11BFF
-
Bhaiksuki11C00–11C6F
-
Marchen11C70–11CBF
-
Masaram Gondi11D00–11D5F
-
Gunjala Gondi11D60–11DAF
-
Tolong Siki11DB0–11DEF
-
Makasar11EE0–11EFF
-
Kawi11F00–11F5F
-
Lisu Supplement11FB0–11FBF
-
Tamil Supplement11FC0–11FFF
-
Cuneiform12000–123FF
-
Cuneiform Numbers and Punctuation12400–1247F
-
Early Dynastic Cuneiform12480–1254F
-
Cypro-Minoan12F90–12FFF
-
Egyptian Hieroglyphs13000–1342F
-
Egyptian Hieroglyph Format Controls13430–1345F
-
Egyptian Hieroglyphs Extended-A13460–143FF
-
Anatolian Hieroglyphs14400–1467F
-
Gurung Khema16100–1613F
-
Bamum Supplement16800–16A3F
-
Mro16A40–16A6F
-
Tangsa16A70–16ACF
-
Bassa Vah16AD0–16AFF
-
Pahawh Hmong16B00–16B8F
-
Kirat Rai16D40–16D7F
-
Medefaidrin16E40–16E9F
-
Beria Erfe16EA0–16EDF
-
Miao16F00–16F9F
-
Ideographic Symbols and Punctuation16FE0–16FFF
-
Tangut17000–187FF
-
Tangut Components18800–18AFF
-
Khitan Small Script18B00–18CFF
-
Tangut Supplement18D00–18D7F
-
Tangut Components Supplement18D80–18DFF
-
Kana Extended-B1AFF0–1AFFF
-
Kana Supplement1B000–1B0FF
-
Kana Extended-A1B100–1B12F
-
Small Kana Extension1B130–1B16F
-
Nushu1B170–1B2FF
-
Duployan1BC00–1BC9F
-
Shorthand Format Controls1BCA0–1BCAF
-
Symbols for Legacy Computing Supplement1CC00–1CEBF
-
Miscellaneous Symbols Supplement1CEC0–1CEFF
-
Znamenny Musical Notation1CF00–1CFCF
-
Byzantine Musical Symbols1D000–1D0FF
-
Musical Symbols1D100–1D1FF
-
Ancient Greek Musical Notation1D200–1D24F
-
Kaktovik Numerals1D2C0–1D2DF
-
Mayan Numerals1D2E0–1D2FF
-
Tai Xuan Jing Symbols1D300–1D35F
-
Counting Rod Numerals1D360–1D37F
-
Mathematical Alphanumeric Symbols1D400–1D7FF
-
Sutton SignWriting1D800–1DAAF
-
Sutton SignWriting1D800–1DAAF
-
Latin Extended-G1DF00–1DFFF
-
Glagolitic Supplement1E000–1E02F
-
Cyrillic Extended-D1E030–1E08F
-
Nyiakeng Puachue Hmong1E100–1E14F
-
Toto1E290–1E2BF
-
Wancho1E2C0–1E2FF
-
Nag Mundari1E4D0–1E4FF
-
Ol Onal1E5D0–1E5FF
-
Tai Yo1E6C0–1E6FF
-
Ethiopic Extended-B1E7E0–1E7FF
-
Mende Kikakui1E800–1E8DF
-
Adlam1E900–1E95F
-
Indic Siyaq Numbers1EC70–1ECBF
-
Ottoman Siyaq Numbers1ED00–1ED4F
-
Arabic Mathematical Alphabetic Symbols1EE00–1EEFF
-
Mahjong Tiles1F000–1F02F
-
Domino Tiles1F030–1F09F
-
Playing Cards1F0A0–1F0FF
-
Enclosed Alphanumeric Supplement1F100–1F1FF
-
Enclosed Ideographic Supplement1F200–1F2FF
-
Miscellaneous Symbols and Pictographs1F300–1F5FF
-
Emoticons1F600–1F64F
-
Ornamental Dingbats1F650–1F67F
-
Transport and Map Symbols1F680–1F6FF
-
Alchemical Symbols1F700–1F77F
-
Geometric Shapes Extended1F780–1F7FF
-
Supplemental Arrows-C1F800–1F8FF
-
Supplemental Symbols and Pictographs1F900–1F9FF
-
Chess Symbols1FA00–1FA6F
-
Symbols and Pictographs Extended-A1FA70–1FAFF
-
Symbols for Legacy Computing1FB00–1FBFF
-
-
2: Supplementary Ideographic Plane
-
CJK Unified Ideographs Extension B20000–2A6DF
-
CJK Unified Ideographs Extension C2A700–2B73F
-
CJK Unified Ideographs Extension D2B740–2B81F
-
CJK Unified Ideographs Extension E2B820–2CEAF
-
CJK Unified Ideographs Extension F2CEB0–2EBEF
-
CJK Unified Ideographs Extension I2EBF0–2EE5F
-
CJK Compatibility Ideographs Supplement2F800–2FA1F
-
-
3: Tertiary Ideographic Plane
-
-
Planes 4–13: Not used
-
-
-
-
-
-
-
-
-
14: Supplementary Special-purpose Plane
-
15: Supplementary Private Use Area Plane – A
-
16: Supplementary Private Use Area Plane – B
-
Nothing found
┐( ˘_˘ )┌