Search references for UNICODE. Phrases containing UNICODE
See searches and references containing UNICODE!UNICODE
Character encoding standard
uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode (also known as The Unicode Standard
Unicode
Symbols for emotional cues in text
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Emoji
Purposely unassigned Unicode code points
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the standard. Three Private Use
Private_Use_Areas
scripts in Unicode include: Ahom (Unicode block) Balinese (Unicode block) Batak (Unicode block) Bhaiksuki (Unicode block) Buhid (Unicode block) Buginese
List_of_Unicode_characters
Unicode block containing some special codepoints and two non-characters
Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF, containing these code points:
Specials_(Unicode_block)
ASCII-compatible variable-width encoding of Unicode
used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode Transformation Format – 8-bit. As of 2026, almost
UTF-8
Script used to write the Greek language
character list in Unicode Unicode collation charts – including Greek and Coptic letters, sorted by shape Examples of Greek handwriting Greek Unicode Issues (Nick
Greek_alphabet
Unicode denominator & numerator glyphs
rendering support, you may see question marks, boxes, or other symbols. Unicode has subscripted and superscripted versions of a number of characters including
Unicode subscripts and superscripts
Unicode_subscripts_and_superscripts
Input characters using their Unicode code points
Unicode input is a method to encode specific characters that are not directly available on a physical keyboard. Characters can be entered either by selecting
Unicode_input
Ninth letter of the Latin alphabet
LETTER I The positions 0x49 and 0x69 were used by ASCII and inherited by Unicode. EBCDIC used 0xC9 and 0x89 for I and i. Brown & Kiddle (1870) The institutes
I
Unicode block
symbols in Unicode Unicode input "Unicode character database". The Unicode Standard. Retrieved 2023-07-26. "Enumerated Versions of The Unicode Standard"
Arrows_(Unicode_block)
Emoji and others representing or depicting heart shapes
found its way into many character sets and encodings, including those of Unicode. Some characters depict the shape directly, others reference it in a more
Hearts_in_Unicode
Nonprofit organization that coordinates the development of the Unicode Standard
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California, U.S. Its primary
Unicode_Consortium
Block of Unicode symbols
may see question marks, boxes, or other symbols. Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF. The BLACK CIRCLE is
Geometric Shapes (Unicode block)
Geometric_Shapes_(Unicode_block)
marks, boxes, or other symbols. The Unicode Standard encodes almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive
Mathematical operators and symbols in Unicode
Mathematical_operators_and_symbols_in_Unicode
Symbols used in pre-19th-century chemistry
This article contains Unicode alchemical symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of alchemical
Alchemical_symbol
Unicode character
The byte-order mark (BOM) is a particular usage of the special Unicode character code, U+FEFF ZERO WIDTH NO-BREAK SPACE, whose appearance as a magic number
Byte_order_mark
Continuous group of 65536 Unicode code points
In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds
Plane_(Unicode)
Punctuation mark used to join words
entity. In character encoding for use with computers, it is represented in Unicode by any of several characters. These include the dual-use hyphen-minus,
Hyphen
Named range of Unicode code points
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode
Unicode_block
Computer text file character representing blank space
three-character-cells-wide SPACE symbol "SPC" (analogous to Unicode's single-cell-wide U+2420). The Braille Patterns Unicode block contains U+2800 ⠀ BRAILLE PATTERN BLANK
Whitespace_character
Using numbers to represent text characters
representing more characters were created, such as ASCII, ISO/IEC 8859, and Unicode encodings such as UTF-8 and UTF-16. The most popular character encoding
Character_encoding
Punctuation mark
"Small Form Variants" (PDF). The Unicode Standard. Unicode Consortium. "Ogham Code Chart" (PDF). The Unicode Standard. Unicode Consortium. Archived (PDF) from
Bracket
Characters for drawing frames and boxes
screen and portraying drop shadows. Unicode includes 128 such characters in the Box Drawing block. In many Unicode fonts, only the subset that is also
Box-drawing_characters
Set of alphabetic symbols that allow for special handling
The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country
Regional_indicator_symbol
Computer font that maps glyphs to code points defined in the Unicode Standard
Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The term has become archaic because the vast majority
Unicode_font
Text characters representing chess pieces
rendering support, you may see question marks, boxes, or other symbols. Unicode has text representations of chess pieces. These allow to produce the symbols
Chess_symbols_in_Unicode
Graphemes for various number systems
A numeral (often called number in Unicode) is a character that denotes a number. The decimal number digits 0–9 are used widely in various writing systems
Numerals_in_Unicode
Letter of the Latin alphabet; used in the German language
"ISO/IEC 8859-1:1998 to Unicode". Unicode Consortium. Whistler, Ken (2015-12-02) [1999-07-27]. "ISO/IEC 8859-2:1999 to Unicode". Unicode Consortium. Whistler
ß
Typographic symbol class
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Dingbat
Unicode character block
CJK Unified Ideographs is a Unicode block containing the most common CJK ideographs used in modern Chinese, Japanese, Korean and Vietnamese characters
CJK Unified Ideographs (Unicode block)
CJK_Unified_Ideographs_(Unicode_block)
definition of a Cyrillic letter for this list is a character encoded in the Unicode standard that a has script property of 'Cyrillic' and the general category
List_of_Cyrillic_letters
Punctuation and accent mark (~, ◌̃)
vs Unicode mapping table", JIS X 0213:2004, X 0213, archived from the original on 24 May 2009, retrieved 28 April 2009. Shift-JIS to Unicode, Unicode, archived
Tilde
You may need rendering support to display the Unicode emoticons or emoji in this article correctly. Unicode 17.0 specifies a total of 3,953 emoji using
List_of_emojis
Alternative forms for the Cyrillic letter O
Cyrillic letter O. They were proposed for inclusion into Unicode in 2007 and incorporated as in Unicode 5.1. Monocular O (Ꙩ ꙩ) is one of the rare glyph variants
Cyrillic_O_variants
Originally, these icons consisted of ASCII art, and later, Shift JIS art and Unicode art. In recent times, graphical icons, both static and animated, have joined
List_of_emoticons
Unicode character block
rendering support, you may see question marks, boxes, or other symbols. In Unicode, the Sumero-Akkadian Cuneiform script is covered in three blocks in the
Cuneiform_(Unicode_block)
Graphical symbol or pictogram used to point or indicate direction
Modifier Letters Unicode blocks. Dingbat Box-drawing character Box Drawing (Unicode Block) Block Elements (Unicode Block) Geometric Shapes (Unicode block) HTML
Arrow_(symbol)
Something that represents an idea, process, or physical entity
"Unicode Technical Report #28: Unicode 3.2". Unicode Consortium. 27 March 2002. Retrieved 23 June 2022. Jenkins, John H. (26 August 2021). "Unicode Standard
Symbol
Mathematical symbol representing infinity
Components for Unicode. Unicode Consortium. Retrieved 2022-02-19 – via GitHub. "IBM-970". International Components for Unicode. Unicode Consortium. May
Infinity_symbol
Glyph combining two or more letterforms
handle Unicode, and have the correct Unicode fonts installed, some or all of these will display correctly. See also the provided graphic. Unicode maintains
Ligature_(writing)
Aspect of the Unicode standard
Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same
Unicode_equivalence
Subset of characters in Unicode
v t e In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems. Some
Script_(Unicode)
Slanting line punctuation mark (/)
5 also slash mark: DIAGONAL : 4 "Unicode 1.1 Composite Name List, including default properties". Unicode.org. Unicode Consortium. 5 July 1995. Archived
Slash_(punctuation)
assignments, Unicode resolved this issue. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts"
List_of_typefaces
Punctuation mark with various forms
that involve these keys. Also, techniques using their Unicode code points are available; see Unicode input. Macintosh character sets have always had curved
Quotation_mark
Unicode character block
symbols in Unicode "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09. "Unicode character database". The Unicode Standard
Combining_Diacritical_Marks
Unicode code point property names and their uses
The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)
Unicode_character_property
String collation algorithm
The Unicode collation algorithm (UCA) is an algorithm defined in Unicode Technical Report #10, which is a customizable method to produce binary keys from
Unicode_collation_algorithm
This article contains uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
List_of_numeral_systems
Alternative width characters in East Asian typography
character, hence the name. Halfwidth and Fullwidth Forms is also the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth
Halfwidth_and_fullwidth_forms
Emoji featuring laughing crying face
part of the Emoticons block of Unicode and was added to the Unicode Standard in 2010 in Unicode 6.0, the first Unicode release intended to release emoji
Face_with_Tears_of_Joy_emoji
definition of a Latin-script letter for this list is a character encoded in the Unicode Standard that has a script property of 'Latin' and the general category
List_of_Latin-script_letters
Brahmic script
represented by combining multiple Unicode code points, as can be seen in the Unicode Tamil Syllabary below. In Unicode 5.1, named sequences were added for
Tamil_script
Characters from the Latin script encoded in the Unicode Standard
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended
Latin_script_in_Unicode
Neo-grotesque sans-serif typeface
Arial Unicode MS is a TrueType font and the extended version of the font Arial. Compared to Arial, it includes higher line height, omits kerning pairs
Arial_Unicode_MS
Japanese syllabary
added to the Unicode Standard in October, 1991 with the release of version 1.0. The Unicode block for Hiragana is U+3040–U+309F: The Unicode hiragana block
Hiragana
Unicode block
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Miscellaneous_Symbols
Complete list of the characters available on most computers
rendering support, you may see question marks, boxes, or other symbols. The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list
Universal Character Set characters
Universal_Character_Set_characters
Unicode character block
Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters
Block_Elements
Unicode character block
This article contains Unicode Braille characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Braille
Braille_Patterns
Unicode script encoding
As of Unicode version 17.0, Cyrillic script is encoded across several blocks: Cyrillic: U+0400–U+04FF, 256 characters Cyrillic Supplement: U+0500–U+052F
Cyrillic_script_in_Unicode
Family of abugida writing systems
13: South and Central Asia-II" (PDF). The Unicode Standard, Version 11.0. Mountain View, California: Unicode, Inc. June 2018. ISBN 978-1-936213-19-1. Aditya
Brahmic_scripts
Unicode character block
is a Unicode block containing runic characters. It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014)
Runic_(Unicode_block)
Twentieth letter of the Latin alphabet
This article contains uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
T
Unicode character block
symbols. Look up Appendix:Unicode/Egyptian Hieroglyphs in Wiktionary, the free dictionary. Egyptian Hieroglyphs is a Unicode block containing the Gardiner's
Egyptian Hieroglyphs (Unicode block)
Egyptian_Hieroglyphs_(Unicode_block)
Unicode character block
This article contains uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Musical Symbols (Unicode block)
Musical_Symbols_(Unicode_block)
Japanese syllabary
to the Unicode standard in October 2010 with the release of version 6.0. The Unicode block for Kana Supplement is U+1B000–U+1B0FF: The Unicode block for
Katakana
Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special
Arabic_script_in_Unicode
script. For a far more comprehensive list of symbols and signs, see List of Unicode characters. For other languages and symbol sets (especially in mathematics
List of typographical symbols and punctuation marks
List_of_typographical_symbols_and_punctuation_marks
Unicode character block
The Unicode Standard. Retrieved 2025-10-13. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2025-10-13. The Unicode Consortium
Arabic_(Unicode_block)
Twenty-first letter in the Greek alphabet
descends from phi. Like other Greek letters, lowercase phi (encoded as the Unicode character U+03C6 φ GREEK SMALL LETTER PHI) is used as a mathematical or
Phi
Writing system
shown to conform to the Unicode definition of a character: this aspect is the responsibility of the typeface designer. The Unicode 5.1 standard, released
Cyrillic_script
Punctuation mark (,)
their names in the Unicode Standard are 'letter with a cedilla'. They were introduced to the Unicode standard before 1992 and, per Unicode Consortium policy
Comma
Unicode character block
(Unicode block) "Unicode character database". The Unicode Standard. Retrieved 2023-07-26. "Enumerated Versions of The Unicode Standard". The Unicode Standard
Javanese_(Unicode_block)
Currency symbol for the Indian Rupee (INR)
officially adopted by the end of 2010. The sign was given a codepoint (20B9) in Unicode with the release of version 6.0 on 12 October 2010, making it usable worldwide
Indian_rupee_sign
Unicode character block
Symbols is a Unicode block containing arrows, dots, enclosures, and overlays for modifying symbol characters. Its block name in Unicode 1.0 was simply
Combining Diacritical Marks for Symbols
Combining_Diacritical_Marks_for_Symbols
Native alphabet of the Korean language
not fully supported in Unicode until the late 2000s. Private Use Areas were often used for that purpose until official Unicode implementation was added
Hangul
Typographical symbol
The symbol -, known in Unicode as hyphen-minus, is the form of hyphen most commonly used in digital documents. On most keyboards, it is the only character
Hyphen-minus
Characters representing cultural, political and religious symbols
rendering support, you may see question marks, boxes, or other symbols. Unicode contains a number of characters that represent various cultural, political
Religious and political symbols in Unicode
Religious_and_political_symbols_in_Unicode
Unicode character block
This article contains Unicode alchemical symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of alchemical
Alchemical Symbols (Unicode block)
Alchemical_Symbols_(Unicode_block)
Sequence of characters that forms a search pattern
for Unicode. In most respects it makes no difference what the character set is, but some issues do arise when extending regexes to support Unicode. Supported
Regular_expression
Effort to map CJK characters in Unicode
boxes, or other symbols. Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han
Han_unification
Unicode text character not part of a natural language script
In computing, a Unicode symbol is a Unicode character which is not part of a script used to write a natural language, but is nonetheless available for
Unicode_symbol
and other symbols are supported by the Unicode character encoding standard. As of version 17.0 of the Unicode Standard, 518 characters in the following
Greek_script_in_Unicode
Unicode for depicting playing cards' fonts and symbols
Unicode is a computing industry standard for the handling of fonts and symbols. Within it is a set of code points representing playing cards, and another
Playing_cards_in_Unicode
Unicode character block
Tags is a Unicode block containing formatting tag characters. The block is designed to mirror ASCII. It was originally intended for language tags, but
Tags_(Unicode_block)
Control characters in bidirectional text
bidirectional text. Unicode defines three such characters: the left-to-right mark, the right-to-left mark and the Arabic letter mark. In Unicode, the three marks
Implicit_directional_marks
Unicode character block
In Unicode, the Sumero-Akkadian Cuneiform script is covered in three blocks in the Supplementary Multilingual Plane (SMP): U+12000–U+123FF Cuneiform U+12400–U+1247F
Cuneiform Numbers and Punctuation
Cuneiform_Numbers_and_Punctuation
Japanese syllabic writing systems
2026. "Kana Supplement" (PDF). Unicode 15.1. Unicode. Retrieved 11 March 2024. "Kana Extended-A" (PDF). Unicode 15.1. Unicode. Retrieved 11 March 2024. 關根江山
Kana
Unicode block
in Unicode Unicode symbols Mathematical operators and symbols in Unicode Mathematical Alphanumeric Symbols (Unicode block) Currency Symbols (Unicode block)
Letterlike_Symbols
Unicode block
marks, boxes, or other symbols. Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits
Mathematical Alphanumeric Symbols
Mathematical_Alphanumeric_Symbols
System of Chinese character radicals
that order characters by radical and stroke count. They are encoded in Unicode alongside other CJK characters, under the block "Kangxi radicals", while
Kangxi_radicals
SI-derived unit of area
The square metre may be used with all SI prefixes used with the metre. Unicode has several characters used to represent metric area units, but these are
Square_metre
Semisyllabary used to transcribe Chinese
system by the International Organization for Standardization (ISO) and Unicode. Analogous to how the word alphabet is derived from the names of the first
Bopomofo
Variant of the Latin alphabet
române, 2005, p. LII (in Romanian) Unicode 3.0 standard, p.162 "Unicode.org". "Unicode.org". "Unicode.org". "Unicode 5.2 Chapter 7, European Alphabetic
Romanian_alphabet
Symbol often denoting 'yes' or 'correct'
Hand gesture indicating approval or disapproval Unicode input – Input characters using their Unicode code points X mark – Symbol with multiple meanings
Check_mark
Unicode character block
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block
Basic_Latin_(Unicode_block)
Unicode character block
in Unicode Latin script in Unicode "Unicode character database". The Unicode Standard. Retrieved 2023-07-26. "Enumerated Versions of The Unicode Standard"
Superscripts_and_Subscripts
Markup language and file format
across the Internet. It is a textual data format with strong support via Unicode for different human languages. Although the design of XML focuses on documents
XML
UNICODE
UNICODE
UNICODE
UNICODE
Boy/Male
Australian, Irish
A Poet; Philosopher
Surname or Lastname
English
English : patronymic from a pet form of Rudge.The founder of this influential American family was Thomas Ruggles (1584–1644) of Sudbury, Suffolk, England, who settled in Roxbury, MA, in 1637.
Boy/Male
Hindu, Indian, Traditional
Sun
Boy/Male
Tamil
Rishi Rochan | ரிஷீ ரோசநÂ
Sage, Ray of light
Girl/Female
Indian, Punjabi, Sikh
One who Conquer Lord of Mind
Girl/Female
French
Female
Yiddish
(סִיסל) Yiddish name SISEL means "sweet."
Boy/Male
Hindu, Indian
Lord Ayyappa
Male
Esperanto
Variant spelling of Esperanto Michaelo, MIHHAELO means "who is like God?"
Girl/Female
Assamese, Gujarati, Hindu, Indian, Kannada, Malayalam, Marathi, Telugu
Cloud from Heaven
UNICODE
UNICODE
UNICODE
UNICODE
UNICODE