Basic Latin
The Basic Latin (or C0 Controls and Basic Latin) Unicode block is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding.
The Basic Latin block was included in its present from version 1.0.0 of the Unicode Standard, without addition or alteration of the character repertoire.
The classical Latin alphabet, also known as the Roman alphabet, is a writing system that evolved from the visually similar Cumaean Greek version of the . The Greek alphabet, including the Cumaean version, descended from the Phoenician abjad. The Etruscans who ruled early Rome adopted and modified the Cumaean Greek alphabet. The Etruscan alphabet was in turn adopted and further modified by the ancient Romans to write the Latin language.
During the Middle Ages scribes adapted the Latin alphabet for writing Romance languages, direct descendants of Latin, as well as Celtic, Germanic, Baltic, and some Slavic languages. With the age of colonialism and Christian evangelism, the Latin script spread beyond Europe, coming into use for writing indigenous American, Australian, Austronesian, Austroasiatic, and African languages. More recently, linguists have also tended to prefer the Latin script or the International Phonetic Alphabet (itself largely based on Latin script) when transcribing or creating written standards for non-European languages, such as the African reference alphabet.
The term Latin alphabet may refer to either the alphabet used to write Latin (as described in this article), or other alphabets based on the Latin script, which is the basic set of letters common to the various alphabets descended from the classical Latin one, such as the English alphabet. These Latin alphabets may discard letters, like the Rotokas alphabet, or add new letters, like the Danish and Norwegian alphabets. Letter shapes have evolved over the centuries, including the creation for Medieval Latin of lower-case forms which did not exist in the Classical period.
Properties
| Range | 0000–007F |
| Characters | 128 |
C0 controls
-
0000Null
-
0001Start of Heading
-
0002Start of Text
-
0003End of Text
-
0004End of Transmission
-
0005Enquiry
-
0006Acknowledge
-
0007Bell
-
0008Backspace
-
0009Horizontal Tabulation
-
000ANew Line
-
000BVertical Tabulation
-
000CForm Feed
-
000DCarriage Return
-
000EShift Out
-
000FShift In
-
0010Data Link Escape
-
0011Device Control One
-
0012Device Control Two
-
0013Device Control Three
-
0014Device Control Four
-
0015Negative Acknowledge
-
0016Synchronous Idle
-
0017End of Transmission Block
-
0018Cancel
-
0019End of Medium
-
001ASubstitute
-
001BEscape
-
001CFile Separator
-
001DGroup Separator
-
001ERecord Separator
-
001FUnit Separator
ASCII punctuation and symbols
-
0020Space
-
!0021Exclamation Mark
-
"0022Quotation Mark
-
#0023Number Sign
-
$0024Dollar Sign
-
%0025Percent Sign
-
&0026Ampersand
-
'0027Apostrophe
-
(0028Left Parenthesis
-
)0029Right Parenthesis
-
*002AAsterisk
ASCII math operator
ASCII punctuation
ASCII digits
-
00030Digit Zero
-
10031Digit One
-
20032Digit Two
-
30033Digit Three
-
40034Digit Four
-
50035Digit Five
-
60036Digit Six
-
70037Digit Seven
-
80038Digit Eight
-
90039Digit Nine
ASCII punctuation
ASCII mathematical operators
ASCII punctuation
Uppercase Latin alphabet
-
A0041Latin Capital Letter A
-
B0042Latin Capital Letter B
-
C0043Latin Capital Letter C
-
D0044Latin Capital Letter D
-
E0045Latin Capital Letter E
-
F0046Latin Capital Letter F
-
G0047Latin Capital Letter G
-
H0048Latin Capital Letter H
-
I0049Latin Capital Letter I
-
J004ALatin Capital Letter J
-
K004BLatin Capital Letter K
-
L004CLatin Capital Letter L
-
M004DLatin Capital Letter M
-
N004ELatin Capital Letter N
-
O004FLatin Capital Letter O
-
P0050Latin Capital Letter P
-
Q0051Latin Capital Letter Q
-
R0052Latin Capital Letter R
-
S0053Latin Capital Letter S
-
T0054Latin Capital Letter T
-
U0055Latin Capital Letter U
-
V0056Latin Capital Letter V
-
W0057Latin Capital Letter W
-
X0058Latin Capital Letter X
-
Y0059Latin Capital Letter Y
-
Z005ALatin Capital Letter Z
ASCII punctuation and symbols
-
[005BLeft Square Bracket
-
\005CReverse Solidus
-
]005DRight Square Bracket
-
^005ECircumflex Accent
-
_005FLow Line
-
`0060Grave Accent
Lowercase Latin alphabet
-
a0061Latin Small Letter A
-
b0062Latin Small Letter B
-
c0063Latin Small Letter C
-
d0064Latin Small Letter D
-
e0065Latin Small Letter E
-
f0066Latin Small Letter F
-
g0067Latin Small Letter G
-
h0068Latin Small Letter H
-
i0069Latin Small Letter I
-
j006ALatin Small Letter J
-
k006BLatin Small Letter K
-
l006CLatin Small Letter L
-
m006DLatin Small Letter M
-
n006ELatin Small Letter N
-
o006FLatin Small Letter O
-
p0070Latin Small Letter P
-
q0071Latin Small Letter Q
-
r0072Latin Small Letter R
-
s0073Latin Small Letter S
-
t0074Latin Small Letter T
-
u0075Latin Small Letter U
-
v0076Latin Small Letter V
-
w0077Latin Small Letter W
-
x0078Latin Small Letter X
-
y0079Latin Small Letter Y
-
z007ALatin Small Letter Z
ASCII punctuation and symbols
Control character
-
0: Basic Multilingual Plane
-
Basic Latin0000–007F
-
Latin-1 Supplement0080–00FF
-
Latin Extended-A0100–017F
-
Latin Extended-B0180–024F
-
IPA Extensions0250–02AF
-
Spacing Modifier Letters02B0–02FF
-
Combining Diacritical Marks0300–036F
-
Greek and Coptic0370–03FF
-
Cyrillic0400–04FF
-
Cyrillic Supplement0500–052F
-
Armenian0530–058F
-
Hebrew0590–05FF
-
Arabic0600–06FF
-
Syriac0700–074F
-
Arabic Supplement0750–077F
-
Thaana0780–07BF
-
NKo07C0–07FF
-
Samaritan0800–083F
-
Mandaic0840–085F
-
Syriac Supplement0860–086F
-
Arabic Extended-B0870–089F
-
Arabic Extended-A08A0–08FF
-
Devanagari0900–097F
-
Bengali0980–09FF
-
Gurmukhi0A00–0A7F
-
Gujarati0A80–0AFF
-
Oriya0B00–0B7F
-
Tamil0B80–0BFF
-
Telugu0C00–0C7F
-
Kannada0C80–0CFF
-
Malayalam0D00–0D7F
-
Sinhala0D80–0DFF
-
Thai0E00–0E7F
-
Lao0E80–0EFF
-
Tibetan0F00–0FFF
-
Myanmar1000–109F
-
Georgian10A0–10FF
-
Hangul Jamo1100–11FF
-
Ethiopic1200–137F
-
Ethiopic Supplement1380–139F
-
Cherokee13A0–13FF
-
Unified Canadian Aboriginal Syllabics1400–167F
-
Ogham1680–169F
-
Runic16A0–16FF
-
Tagalog1700–171F
-
Hanunoo1720–173F
-
Buhid1740–175F
-
Tagbanwa1760–177F
-
Khmer1780–17FF
-
Mongolian1800–18AF
-
Unified Canadian Aboriginal Syllabics Extended18B0–18FF
-
Limbu1900–194F
-
Tai Le1950–197F
-
New Tai Lue1980–19DF
-
Khmer Symbols19E0–19FF
-
Buginese1A00–1A1F
-
Tai Tham1A20–1AAF
-
Combining Diacritical Marks Extended1AB0–1AFF
-
Balinese1B00–1B7F
-
Sundanese1B80–1BBF
-
Batak1BC0–1BFF
-
Lepcha1C00–1C4F
-
Ol Chiki1C50–1C7F
-
Cyrillic Extended-C1C80–1C8F
-
Georgian Extended1C90–1CBF
-
Sundanese Supplement1CC0–1CCF
-
Vedic Extensions1CD0–1CFF
-
Phonetic Extensions1D00–1D7F
-
Phonetic Extensions Supplement1D80–1DBF
-
Combining Diacritical Marks Supplement1DC0–1DFF
-
Latin Extended Additional1E00–1EFF
-
Greek Extended1F00–1FFF
-
General Punctuation2000–206F
-
Superscripts and Subscripts2070–209F
-
Currency Symbols20A0–20CF
-
Combining Diacritical Marks for Symbols20D0–20FF
-
Letterlike Symbols2100–214F
-
Number Forms2150–218F
-
Arrows2190–21FF
-
Mathematical Operators2200–22FF
-
Miscellaneous Technical2300–23FF
-
Control Pictures2400–243F
-
Optical Character Recognition2440–245F
-
Enclosed Alphanumerics2460–24FF
-
Box Drawing2500–257F
-
Block Elements2580–259F
-
Geometric Shapes25A0–25FF
-
Miscellaneous Symbols2600–26FF
-
Dingbats2700–27BF
-
Miscellaneous Mathematical Symbols-A27C0–27EF
-
Supplemental Arrows-A27F0–27FF
-
Braille Patterns2800–28FF
-
Supplemental Arrows-B2900–297F
-
Miscellaneous Mathematical Symbols-B2980–29FF
-
Supplemental Mathematical Operators2A00–2AFF
-
Miscellaneous Symbols and Arrows2B00–2BFF
-
Glagolitic2C00–2C5F
-
Latin Extended-C2C60–2C7F
-
Coptic2C80–2CFF
-
Georgian Supplement2D00–2D2F
-
Tifinagh2D30–2D7F
-
Ethiopic Extended2D80–2DDF
-
Cyrillic Extended-A2DE0–2DFF
-
Supplemental Punctuation2E00–2E7F
-
CJK Radicals Supplement2E80–2EFF
-
Kangxi Radicals2F00–2FDF
-
Ideographic Description Characters2FF0–2FFF
-
CJK Symbols and Punctuation3000–303F
-
Hiragana3040–309F
-
Katakana30A0–30FF
-
Bopomofo3100–312F
-
Hangul Compatibility Jamo3130–318F
-
Kanbun3190–319F
-
Bopomofo Extended31A0–31BF
-
CJK Strokes31C0–31EF
-
Katakana Phonetic Extensions31F0–31FF
-
Enclosed CJK Letters and Months3200–32FF
-
CJK Compatibility3300–33FF
-
CJK Unified Ideographs Extension A3400–4DBF
-
Yijing Hexagram Symbols4DC0–4DFF
-
CJK Unified Ideographs4E00–9FFF
-
Yi SyllablesA000–A48F
-
Yi RadicalsA490–A4CF
-
LisuA4D0–A4FF
-
VaiA500–A63F
-
Cyrillic Extended-BA640–A69F
-
BamumA6A0–A6FF
-
Modifier Tone LettersA700–A71F
-
Latin Extended-DA720–A7FF
-
Syloti NagriA800–A82F
-
Common Indic Number FormsA830–A83F
-
Phags-paA840–A87F
-
SaurashtraA880–A8DF
-
Devanagari ExtendedA8E0–A8FF
-
Kayah LiA900–A92F
-
RejangA930–A95F
-
Hangul Jamo Extended-AA960–A97F
-
JavaneseA980–A9DF
-
Myanmar Extended-BA9E0–A9FF
-
ChamAA00–AA5F
-
Myanmar Extended-AAA60–AA7F
-
Tai VietAA80–AADF
-
Meetei Mayek ExtensionsAAE0–AAFF
-
Ethiopic Extended-AAB00–AB2F
-
Latin Extended-EAB30–AB6F
-
Cherokee SupplementAB70–ABBF
-
Meetei MayekABC0–ABFF
-
Hangul SyllablesAC00–D7AF
-
Hangul Jamo Extended-BD7B0–D7FF
-
High SurrogatesD800–DB7F
-
High Private Use SurrogatesDB80–DBFF
-
Low SurrogatesDC00–DFFF
-
Private Use AreaE000–F8FF
-
CJK Compatibility IdeographsF900–FAFF
-
Alphabetic Presentation FormsFB00–FB4F
-
Arabic Presentation Forms-AFB50–FDFF
-
Variation SelectorsFE00–FE0F
-
Vertical FormsFE10–FE1F
-
Combining Half MarksFE20–FE2F
-
CJK Compatibility FormsFE30–FE4F
-
Small Form VariantsFE50–FE6F
-
Arabic Presentation Forms-BFE70–FEFF
-
Halfwidth and Fullwidth FormsFF00–FFEF
-
SpecialsFFF0–FFFF
-
-
1: Supplementary Multilingual Plane
-
Linear B Syllabary10000–1007F
-
Linear B Ideograms10080–100FF
-
Aegean Numbers10100–1013F
-
Ancient Greek Numbers10140–1018F
-
Ancient Symbols10190–101CF
-
Phaistos Disc101D0–101FF
-
Lycian10280–1029F
-
Carian102A0–102DF
-
Coptic Epact Numbers102E0–102FF
-
Old Italic10300–1032F
-
Gothic10330–1034F
-
Old Permic10350–1037F
-
Ugaritic10380–1039F
-
Old Persian103A0–103DF
-
Deseret10400–1044F
-
Shavian10450–1047F
-
Osmanya10480–104AF
-
Osage104B0–104FF
-
Elbasan10500–1052F
-
Caucasian Albanian10530–1056F
-
Vithkuqi10570–105BF
-
Todhri105C0–105FF
-
Linear A10600–1077F
-
Latin Extended-F10780–107BF
-
Cypriot Syllabary10800–1083F
-
Imperial Aramaic10840–1085F
-
Palmyrene10860–1087F
-
Nabataean10880–108AF
-
Hatran108E0–108FF
-
Phoenician10900–1091F
-
Lydian10920–1093F
-
Sidetic10940–1095F
-
Meroitic Hieroglyphs10980–1099F
-
Meroitic Cursive109A0–109FF
-
Kharoshthi10A00–10A5F
-
Old South Arabian10A60–10A7F
-
Old North Arabian10A80–10A9F
-
Manichaean10AC0–10AFF
-
Avestan10B00–10B3F
-
Inscriptional Parthian10B40–10B5F
-
Inscriptional Pahlavi10B60–10B7F
-
Psalter Pahlavi10B80–10BAF
-
Old Turkic10C00–10C4F
-
Old Hungarian10C80–10CFF
-
Hanifi Rohingya10D00–10D3F
-
Garay10D40–10D8F
-
Rumi Numeral Symbols10E60–10E7F
-
Yezidi10E80–10EBF
-
Arabic Extended-C10EC0–10EFF
-
Old Sogdian10F00–10F2F
-
Sogdian10F30–10F6F
-
Old Uyghur10F70–10FAF
-
Chorasmian10FB0–10FDF
-
Elymaic10FE0–10FFF
-
Brahmi11000–1107F
-
Kaithi11080–110CF
-
Sora Sompeng110D0–110FF
-
Chakma11100–1114F
-
Mahajani11150–1117F
-
Sharada11180–111DF
-
Sinhala Archaic Numbers111E0–111FF
-
Khojki11200–1124F
-
Multani11280–112AF
-
Khudawadi112B0–112FF
-
Grantha11300–1137F
-
Tulu-Tigalari11380–113FF
-
Newa11400–1147F
-
Tirhuta11480–114DF
-
Siddham11580–115FF
-
Modi11600–1165F
-
Mongolian Supplement11660–1167F
-
Takri11680–116CF
-
Myanmar Extended-C116D0–116FF
-
Ahom11700–1174F
-
Dogra11800–1184F
-
Warang Citi118A0–118FF
-
Dives Akuru11900–1195F
-
Nandinagari119A0–119FF
-
Zanabazar Square11A00–11A4F
-
Soyombo11A50–11AAF
-
Unified Canadian Aboriginal Syllabics Extended-A11AB0–11ABF
-
Pau Cin Hau11AC0–11AFF
-
Devanagari Extended-A11B00–11B5F
-
Sharada Supplement11B60–11B7F
-
Sunuwar11BC0–11BFF
-
Bhaiksuki11C00–11C6F
-
Marchen11C70–11CBF
-
Masaram Gondi11D00–11D5F
-
Gunjala Gondi11D60–11DAF
-
Tolong Siki11DB0–11DEF
-
Makasar11EE0–11EFF
-
Kawi11F00–11F5F
-
Lisu Supplement11FB0–11FBF
-
Tamil Supplement11FC0–11FFF
-
Cuneiform12000–123FF
-
Cuneiform Numbers and Punctuation12400–1247F
-
Early Dynastic Cuneiform12480–1254F
-
Cypro-Minoan12F90–12FFF
-
Egyptian Hieroglyphs13000–1342F
-
Egyptian Hieroglyph Format Controls13430–1345F
-
Egyptian Hieroglyphs Extended-A13460–143FF
-
Anatolian Hieroglyphs14400–1467F
-
Gurung Khema16100–1613F
-
Bamum Supplement16800–16A3F
-
Mro16A40–16A6F
-
Tangsa16A70–16ACF
-
Bassa Vah16AD0–16AFF
-
Pahawh Hmong16B00–16B8F
-
Kirat Rai16D40–16D7F
-
Medefaidrin16E40–16E9F
-
Beria Erfe16EA0–16EDF
-
Miao16F00–16F9F
-
Ideographic Symbols and Punctuation16FE0–16FFF
-
Tangut17000–187FF
-
Tangut Components18800–18AFF
-
Khitan Small Script18B00–18CFF
-
Tangut Supplement18D00–18D7F
-
Tangut Components Supplement18D80–18DFF
-
Kana Extended-B1AFF0–1AFFF
-
Kana Supplement1B000–1B0FF
-
Kana Extended-A1B100–1B12F
-
Small Kana Extension1B130–1B16F
-
Nushu1B170–1B2FF
-
Duployan1BC00–1BC9F
-
Shorthand Format Controls1BCA0–1BCAF
-
Symbols for Legacy Computing Supplement1CC00–1CEBF
-
Miscellaneous Symbols Supplement1CEC0–1CEFF
-
Znamenny Musical Notation1CF00–1CFCF
-
Byzantine Musical Symbols1D000–1D0FF
-
Musical Symbols1D100–1D1FF
-
Ancient Greek Musical Notation1D200–1D24F
-
Kaktovik Numerals1D2C0–1D2DF
-
Mayan Numerals1D2E0–1D2FF
-
Tai Xuan Jing Symbols1D300–1D35F
-
Counting Rod Numerals1D360–1D37F
-
Mathematical Alphanumeric Symbols1D400–1D7FF
-
Sutton SignWriting1D800–1DAAF
-
Sutton SignWriting1D800–1DAAF
-
Latin Extended-G1DF00–1DFFF
-
Glagolitic Supplement1E000–1E02F
-
Cyrillic Extended-D1E030–1E08F
-
Nyiakeng Puachue Hmong1E100–1E14F
-
Toto1E290–1E2BF
-
Wancho1E2C0–1E2FF
-
Nag Mundari1E4D0–1E4FF
-
Ol Onal1E5D0–1E5FF
-
Tai Yo1E6C0–1E6FF
-
Ethiopic Extended-B1E7E0–1E7FF
-
Mende Kikakui1E800–1E8DF
-
Adlam1E900–1E95F
-
Indic Siyaq Numbers1EC70–1ECBF
-
Ottoman Siyaq Numbers1ED00–1ED4F
-
Arabic Mathematical Alphabetic Symbols1EE00–1EEFF
-
Mahjong Tiles1F000–1F02F
-
Domino Tiles1F030–1F09F
-
Playing Cards1F0A0–1F0FF
-
Enclosed Alphanumeric Supplement1F100–1F1FF
-
Enclosed Ideographic Supplement1F200–1F2FF
-
Miscellaneous Symbols and Pictographs1F300–1F5FF
-
Emoticons1F600–1F64F
-
Ornamental Dingbats1F650–1F67F
-
Transport and Map Symbols1F680–1F6FF
-
Alchemical Symbols1F700–1F77F
-
Geometric Shapes Extended1F780–1F7FF
-
Supplemental Arrows-C1F800–1F8FF
-
Supplemental Symbols and Pictographs1F900–1F9FF
-
Chess Symbols1FA00–1FA6F
-
Symbols and Pictographs Extended-A1FA70–1FAFF
-
Symbols for Legacy Computing1FB00–1FBFF
-
-
2: Supplementary Ideographic Plane
-
CJK Unified Ideographs Extension B20000–2A6DF
-
CJK Unified Ideographs Extension C2A700–2B73F
-
CJK Unified Ideographs Extension D2B740–2B81F
-
CJK Unified Ideographs Extension E2B820–2CEAF
-
CJK Unified Ideographs Extension F2CEB0–2EBEF
-
CJK Unified Ideographs Extension I2EBF0–2EE5F
-
CJK Compatibility Ideographs Supplement2F800–2FA1F
-
-
3: Tertiary Ideographic Plane
-
-
Planes 4–13: Not used
-
-
-
-
-
-
-
-
-
14: Supplementary Special-purpose Plane
-
15: Supplementary Private Use Area Plane – A
-
16: Supplementary Private Use Area Plane – B
-
Nothing found
┐( ˘_˘ )┌