Reconhecimento óptico de caracteres
Optical character recognition (OCR) is the mechanical or electronic conversion of images of typewritten or printed text into machine-encoded text. It is widely used as a form of data entry from printed paper data records, whether passport documents, invoices, bank statements, computerized receipts, business cards, mail, printouts of static-data, or any suitable documentation. It is a common method of digitizing printed texts so that it can be electronically edited, searched, stored more compactly, displayed on-line, and used in machine processes such as machine translation, text-to-speech, key data and text mining. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.
Early versions needed to be trained with images of each character, and worked on one font at a time. Advanced systems that have a high degree of recognition accuracy for most fonts are now common. Some systems are capable of reproducing formatted output that closely approximates the original page including images, columns, and other non-textual components.
Propriedades
Intervalo | 2440–245F |
Personagens | 32 |
Lista de Caracteres
-
OCR-A
-
2440⑀
-
2441⑁
-
2442⑂
-
2443⑃
-
2444⑄
-
2445⑅
-
MICR
-
2446⑆
-
2447⑇
-
2448⑈
-
2449⑉
-
OCR
-
244A⑊
-
244B
-
244C
-
244D
-
244E
-
244F
-
2450
-
2451
-
2452
-
2453
-
2454
-
2455
-
2456
-
2457
-
2458
-
2459
-
245A
-
245B
-
245C
-
245D
-
245E
-
245F
Tabela de Caracteres
-
0: Plano Multilingue Básico
-
Latino básico0000–007F
-
Suplemento Latino-10080–00FF
-
Latino estendido-A0100–017F
-
Latino estendido-B0180–024F
-
Extensões IPA0250–02AF
-
Letras de modificação de espaço02B0–02FF
-
Combinação de marcas diacríticas0300–036F
-
Grego e Copta0370–03FF
-
Cirílico0400–04FF
-
Suplemento Cirílico0500–052F
-
Armênio0530–058F
-
hebraico0590–05FF
-
árabe0600–06FF
-
Siríaco0700–074F
-
Suplemento árabe0750–077F
-
Thaana0780–07BF
-
NKo07C0–07FF
-
samaritano0800–083F
-
Mandaic0840–085F
-
Syriac Supplement0860–086F
-
Arabic Extended-B0870–089F
-
Árabe estendido-A08A0–08FF
-
Devanagari0900–097F
-
bengali0980–09FF
-
Gurmukhi0A00–0A7F
-
Gujarati0A80–0AFF
-
Oriya0B00–0B7F
-
tâmil0B80–0BFF
-
Telugu0C00–0C7F
-
Kannada0C80–0CFF
-
Malayalam0D00–0D7F
-
Sinhala0D80–0DFF
-
tailandês0E00–0E7F
-
Lao0E80–0EFF
-
Tibetano0F00–0FFF
-
Mianmar1000–109F
-
Georgiano10A0–10FF
-
Hangul Jamo1100–11FF
-
Etíope1200–137F
-
Suplemento Etíope1380–139F
-
Cherokee13A0–13FF
-
Syllabics Unificado Canadense Aborígenes1400–167F
-
Ogham1680–169F
-
Rúnico16A0–16FF
-
Tagalog1700–171F
-
Hanunoo1720–173F
-
Buhid1740–175F
-
Tagbanwa1760–177F
-
Khmer1780–17FF
-
mongol1800–18AF
-
Sílabas aborígenes canadenses unificadas ampliadas18B0–18FF
-
Limbu1900–194F
-
Tai Le1950–197F
-
Novo Tai Lue1980–19DF
-
Símbolos Khmer19E0–19FF
-
Buginese1A00–1A1F
-
Tai Tham1A20–1AAF
-
Combinação de Marcas Diacríticas Ampliadas1AB0–1AFF
-
Balinês1B00–1B7F
-
Sundanese1B80–1BBF
-
Batak1BC0–1BFF
-
Lepcha1C00–1C4F
-
Ol Chiki1C50–1C7F
-
Cyrillic Extended C1C80–1C8F
-
Georgian Extended1C90–1CBF
-
Suplemento Sundanese1CC0–1CCF
-
Extensões Védicas1CD0–1CFF
-
Extensões Fonéticas1D00–1D7F
-
Suplemento de Extensões Fonéticas1D80–1DBF
-
Suplemento Combinando Marcas Diacríticas1DC0–1DFF
-
Latino estendido adicional1E00–1EFF
-
Grego estendido1F00–1FFF
-
Pontuação geral2000–206F
-
Superescritos e subescritos2070–209F
-
Símbolos de moeda20A0–20CF
-
Combinação de marcas diacríticas para símbolos20D0–20FF
-
Símbolos letterlike2100–214F
-
Formas de números2150–218F
-
Setas2190–21FF
-
Operadores matemáticos2200–22FF
-
Símbolos técnicos miscelâneos2300–23FF
-
Pictures controle2400–243F
-
Reconhecimento óptico de caracteres2440–245F
-
Alfanuméricos fechados2460–24FF
-
Desenho de caixas2500–257F
-
Elementos de bloco2580–259F
-
Formas geométricas25A0–25FF
-
Símbolos miscelâneos2600–26FF
-
Dingbats2700–27BF
-
Símbolos Matemáticos Diversos-A27C0–27EF
-
Setas suplementares-A27F0–27FF
-
Padrões em Braille2800–28FF
-
Setas suplementares-B2900–297F
-
Símbolos Matemáticos Diversos-B2980–29FF
-
Operadores Matemáticos Suplementares2A00–2AFF
-
Símbolos e flechas variados2B00–2BFF
-
Glagolítica2C00–2C5F
-
Latino estendido-C2C60–2C7F
-
cóptico2C80–2CFF
-
Suplemento georgiano2D00–2D2F
-
Tifinagh2D30–2D7F
-
Extensão etíope2D80–2DDF
-
Cyrillic Extended-A2DE0–2DFF
-
Pontuação suplementar2E00–2E7F
-
Suplemento de CJK Radicals2E80–2EFF
-
Radicais de Kangxi2F00–2FDF
-
Caracteres Ideográficos2FF0–2FFF
-
CJK Símbolos e Pontuação3000–303F
-
Hiragana3040–309F
-
Katakana30A0–30FF
-
Bopomofo3100–312F
-
Compatibilidade Hangul Jamo3130–318F
-
Kanbun3190–319F
-
Bopomofo Estendido31A0–31BF
-
CJK Strokes31C0–31EF
-
Extensões fonéticas Katakana31F0–31FF
-
Letras e Meses CJK Incluídos3200–32FF
-
Compatibilidade CJK3300–33FF
-
CJK Unified Ideographs Extensão A3400–4DBF
-
Símbolos do Hexagram de Yijing4DC0–4DFF
-
CJK Unified Ideographs4E00–9FFF
-
Sílabas de YiA000–A48F
-
Yi RadicalsA490–A4CF
-
LisuA4D0–A4FF
-
VaiA500–A63F
-
Cyrillic estendido-BA640–A69F
-
BamumA6A0–A6FF
-
Letras de tons modificadoresA700–A71F
-
Latino estendido-DA720–A7FF
-
Syloti NagriA800–A82F
-
Formulários Numéricos ComunsA830–A83F
-
Phags-paA840–A87F
-
SaurashtraA880–A8DF
-
Devanagari estendidoA8E0–A8FF
-
Kayah LiA900–A92F
-
RejangA930–A95F
-
Hangul Jamo Extended-AA960–A97F
-
JavanêsA980–A9DF
-
Mianmar estendido-BA9E0–A9FF
-
ChamAA00–AA5F
-
Mianmar Estendida-AAA60–AA7F
-
Tai VietAA80–AADF
-
Extensões Meetei MayekAAE0–AAFF
-
Ethiopic Extended-AAB00–AB2F
-
Latino estendido-EAB30–AB6F
-
Suplemento CherokeeAB70–ABBF
-
Meetei MayekABC0–ABFF
-
Sílabas do HangulAC00–D7AF
-
Hangul Jamo estendido-BD7B0–D7FF
-
Altos SubstitutosD800–DB7F
-
Substitutos elevados de uso privadoDB80–DBFF
-
Substitutos baixosDC00–DFFF
-
Áreas de uso privadoE000–F8FF
-
Ideogramas de Compatibilidade CJKF900–FAFF
-
Formas de apresentação alfabéticaFB00–FB4F
-
Árabe Formulários de Apresentação-AFB50–FDFF
-
Seletores de variaçãoFE00–FE0F
-
Formas verticaisFE10–FE1F
-
Combinação de Meias MarcasFE20–FE2F
-
Formulários de Compatibilidade CJKFE30–FE4F
-
Variantes de Formulário PequenoFE50–FE6F
-
Formas de apresentação em árabe-BFE70–FEFF
-
Formas halfwidth e fullwidthFF00–FFEF
-
Área especialFFF0–FFFF
-
-
1: Plano Multilingue opcional
-
Silabário Linear B10000–1007F
-
Ideogramas Linear B10080–100FF
-
Números do Egeu10100–1013F
-
Números do grego antigo10140–1018F
-
Símbolos antigos10190–101CF
-
Disco de Phaistos101D0–101FF
-
Lycian10280–1029F
-
Carian102A0–102DF
-
Números copta do Epact102E0–102FF
-
Velho Itálico10300–1032F
-
gótico10330–1034F
-
Velho Permic10350–1037F
-
Ugaritic10380–1039F
-
Velho Persa103A0–103DF
-
Deseret10400–1044F
-
Shavian10450–1047F
-
Osmanya10480–104AF
-
Osage104B0–104FF
-
Elbasan10500–1052F
-
Caucasiano albanês10530–1056F
-
Vithkuqi10570–105BF
-
Todhri105C0–105FF
-
Linear A10600–1077F
-
Latin Extended-F10780–107BF
-
Syllabary cipriota10800–1083F
-
Aramaico imperial10840–1085F
-
Palmireno10860–1087F
-
Nabateu10880–108AF
-
Hatran108E0–108FF
-
Fenício10900–1091F
-
Lydian10920–1093F
-
Hieróglifos Meroíticos10980–1099F
-
Meroitic Cursive109A0–109FF
-
Kharoshthi10A00–10A5F
-
velho árabe do Sul10A60–10A7F
-
velho árabe do Norte10A80–10A9F
-
Maniqueísta10AC0–10AFF
-
Avestan10B00–10B3F
-
Parthian Inscriptional10B40–10B5F
-
Inscrições Pahlavi10B60–10B7F
-
Psalter Pahlavi10B80–10BAF
-
Turkic velho10C00–10C4F
-
Velho Húngaro10C80–10CFF
-
Hanifi Rohingya10D00–10D3F
-
Garay10D40–10D8F
-
Rumi Numeral Símbolos10E60–10E7F
-
Yezidi10E80–10EBF
-
Arabic Extended-C10EC0–10EFF
-
Old Sogdian10F00–10F2F
-
Sogdian10F30–10F6F
-
Old Uyghur10F70–10FAF
-
Chorasmian10FB0–10FDF
-
Elymaic10FE0–10FFF
-
Brahmi11000–1107F
-
Kaithi11080–110CF
-
Sora Sompeng110D0–110FF
-
Chakma11100–1114F
-
Mahajani11150–1117F
-
Sharada11180–111DF
-
Sinhala Archaic Números111E0–111FF
-
Khojki11200–1124F
-
Multani11280–112AF
-
Khudawadi112B0–112FF
-
Grantha11300–1137F
-
Tulu-Tigalari11380–113FF
-
Newa11400–1147F
-
Tirhuta11480–114DF
-
Siddham11580–115FF
-
Modi11600–1165F
-
Suplemento Mongol11660–1167F
-
Takri11680–116CF
-
Myanmar Estendido-C116D0–116FF
-
Ahom11700–1174F
-
Dogra11800–1184F
-
Warang Citi118A0–118FF
-
Dives Akuru11900–1195F
-
Nandinagari119A0–119FF
-
Zanabazar Square11A00–11A4F
-
Soyombo11A50–11AAF
-
Unified Canadian Aboriginal Syllabics Extended-A11AB0–11ABF
-
Pau Cin Hau11AC0–11AFF
-
Devanagari Extended-A11B00–11B5F
-
Sunuwar11BC0–11BFF
-
Bhaiksuki11C00–11C6F
-
Marchen11C70–11CBF
-
Masaram Gondi11D00–11D5F
-
Gunjala Gondi11D60–11DAF
-
Makasar11EE0–11EFF
-
Kawi11F00–11F5F
-
Lisu Supplement11FB0–11FBF
-
Tamil Supplement11FC0–11FFF
-
cuneiforme12000–123FF
-
Números cuneiformes e pontuação12400–1247F
-
Cuneiforme cínico primitivo12480–1254F
-
Cypro-Minoan12F90–12FFF
-
Hieróglifos egípcios13000–1342F
-
Egyptian Hieroglyph Format Controls13430–1345F
-
Hieróglifos egípcios estendidos-A13460–143FF
-
Hieróglifos da Anatólia14400–1467F
-
Gurung Khema16100–1613F
-
Suplemento Bamum16800–16A3F
-
Mro16A40–16A6F
-
Tangsa16A70–16ACF
-
Bassa Vah16AD0–16AFF
-
Pahawh Hmong16B00–16B8F
-
Kirat Rai16D40–16D7F
-
Medefaidrin16E40–16E9F
-
Miao16F00–16F9F
-
Símbolos ideográficos e pontuação16FE0–16FFF
-
Tangut17000–187FF
-
Componentes Tangut18800–18AFF
-
Khitan Small Script18B00–18CFF
-
Tangut Supplement18D00–18D7F
-
Kana Extended-B1AFF0–1AFFF
-
Suplemento Kana1B000–1B0FF
-
Kana Extended-A1B100–1B12F
-
Small Kana Extension1B130–1B16F
-
Nushu1B170–1B2FF
-
Duployan1BC00–1BC9F
-
Controles de formato abreviado1BCA0–1BCAF
-
Símbolos para Suplemento de Computação Legado1CC00–1CEBF
-
Znamenny Musical Notation1CF00–1CFCF
-
Símbolos musicais bizantinos1D000–1D0FF
-
Símbolos musicais1D100–1D1FF
-
Notação musical grega antiga1D200–1D24F
-
Kaktovik Numerals1D2C0–1D2DF
-
Mayan Numerals1D2E0–1D2FF
-
Símbolos de Tai Xuan Jing1D300–1D35F
-
Numeração de Rodas de contagem1D360–1D37F
-
Símbolos Alfanuméricos Matemáticos1D400–1D7FF
-
Sutton SignWriting1D800–1DAAF
-
Latin Extended-G1DF00–1DFFF
-
Suplemento glagolítico1E000–1E02F
-
Cyrillic Extended-D1E030–1E08F
-
Nyiakeng Puachue Hmong1E100–1E14F
-
Toto1E290–1E2BF
-
Wancho1E2C0–1E2FF
-
Nag Mundari1E4D0–1E4FF
-
Ol Onal1E5D0–1E5FF
-
Ethiopic Extended-B1E7E0–1E7FF
-
Mende Kikakui1E800–1E8DF
-
Adlam1E900–1E95F
-
Indic Siyaq Numbers1EC70–1ECBF
-
Ottoman Siyaq Numbers1ED00–1ED4F
-
Árabe Matemática Alfabética Símbolos1EE00–1EEFF
-
Azulejos Mahjong1F000–1F02F
-
Domino Tiles1F030–1F09F
-
Jogar às cartas1F0A0–1F0FF
-
Suplemento alfanumérico fechado1F100–1F1FF
-
Suplemento ideográfico fechado1F200–1F2FF
-
Símbolos e pictogramas diversos1F300–1F5FF
-
Emoticons (emoji)1F600–1F64F
-
Dingbats ornamentais1F650–1F67F
-
Transporte e símbolos de mapa1F680–1F6FF
-
Símbolos Alquímicos1F700–1F77F
-
Formas geométricas alargadas1F780–1F7FF
-
Setas suplementares-C1F800–1F8FF
-
Símbolos e pictogramas adicionais1F900–1F9FF
-
Chess Symbols1FA00–1FA6F
-
Symbols and Pictographs Extended-A1FA70–1FAFF
-
Symbols for Legacy Computing1FB00–1FBFF
-
-
2: Plano ideográfico adicional
-
CJK Unified Ideographs Extension B20000–2A6DF
-
CJK Unified Ideographs Extension C2A700–2B73F
-
CJK Unified Ideographs Extension D2B740–2B81F
-
CJK Unified Ideographs Extension E2B820–2CEAF
-
CJK Unified Ideographs Extension F2CEB0–2EBEF
-
CJK Unified Ideographs Extension I2EBF0–2EE5F
-
CJK Compatibility Ideographs Supplement2F800–2FA1F
-
-
3: Plano ideográfico terciário
-
4-13: Não utilizado
-
-
-
-
-
-
-
-
-
-
14: Plano adicional especializado
-
15: Área adicional para uso privado – A
-
16: Área adicional para uso privado – B
-
Nada encontrado
┐( ˘_˘ )┌