This online utility converts ASCII characters to Unicode text. Anything that you paste or enter in the input area automatically gets converted to Unicode and is printed in the output area. It supports all Unicode symbols and it works with emoji characters. Created by encoding gurus from team Browserling.
This online utility converts ASCII characters to Unicode text. Anything that you paste or enter in the input area automatically gets converted to Unicode and is printed in the output area. It supports all Unicode symbols and it works with emoji characters. Created by encoding gurus from team Browserling.
This browser-based utility reinterprets sequences of ASCII and ANSI characters as Unicode data. The ASCII encoding uses only 7 bits and supports only 95 printable characters. The most popular Unicode encoding that most browsers implement uses a variable-width encoding from 7 to 21 bits and supports 1,100,000 printable characters. As the simplest characters (a-z, A-Z, numbers, and punctuation marks) use the same amount of bits (7 bits) in both ASCII and Unicode, they are left unchanged in the encoding process. All other Unicode characters are displayed as two, three, or four broken ANSI characters because the ASCII encoding doesn't support multi-byte values. To determine how many bytes each output Unicode symbol uses, we convert the broken input text to binary base and look at the first few beginning bits. This bit pattern tells us how many bytes to consume to create a Unicode glyph. Single-byte Unicode glyphs have the high bit set to zero. What that means is they start with the 0-bit and look like "0bbbbbbb", where "b" is any bit value. So a single byte Unicode glyph matches the same ASCII glyph. Double-byte glyphs start with the pattern "110bbbbb", followed by "10bbbbbb". The total number of free bits in double-byte glyphs is eleven (five in the first byte and six in the second byte). Triple-byte glyphs start with the pattern "1110bbbb", followed by two "10bbbbbb" patterns. In this case, the number of usable bits is sixteen (four in the first byte and six in bytes two and three). Quadruple-byte glyphs start with the pattern "11110bbb" and are followed by three patterns of "01bbbbbb". The total number of bits, in this case, is twenty-one (three in the first byte and six in each of the other three bytes). When we combine these one-, two-, three-, or four-byte sequences together and let the browser decode them to its default encoding, we get valid Unicode characters.
This browser-based utility reinterprets sequences of ASCII and ANSI characters as Unicode data. The ASCII encoding uses only 7 bits and supports only 95 printable characters. The most popular Unicode encoding that most browsers implement uses a variable-width encoding from 7 to 21 bits and supports 1,100,000 printable characters. As the simplest characters (a-z, A-Z, numbers, and punctuation marks) use the same amount of bits (7 bits) in both ASCII and Unicode, they are left unchanged in the encoding process. All other Unicode characters are displayed as two, three, or four broken ANSI characters because the ASCII encoding doesn't support multi-byte values. To determine how many bytes each output Unicode symbol uses, we convert the broken input text to binary base and look at the first few beginning bits. This bit pattern tells us how many bytes to consume to create a Unicode glyph. Single-byte Unicode glyphs have the high bit set to zero. What that means is they start with the 0-bit and look like "0bbbbbbb", where "b" is any bit value. So a single byte Unicode glyph matches the same ASCII glyph. Double-byte glyphs start with the pattern "110bbbbb", followed by "10bbbbbb". The total number of free bits in double-byte glyphs is eleven (five in the first byte and six in the second byte). Triple-byte glyphs start with the pattern "1110bbbb", followed by two "10bbbbbb" patterns. In this case, the number of usable bits is sixteen (four in the first byte and six in bytes two and three). Quadruple-byte glyphs start with the pattern "11110bbb" and are followed by three patterns of "01bbbbbb". The total number of bits, in this case, is twenty-one (three in the first byte and six in each of the other three bytes). When we combine these one-, two-, three-, or four-byte sequences together and let the browser decode them to its default encoding, we get valid Unicode characters.
This example fixes distorted ASCII characters and prints a readable quote by Ralph Waldo Emerson in the output. The quote is written using Unicode's bold-italic letter font that uses 4-byte sequences for each letter. In the broken ASCII text, each letter is represented by four extended ASCII symbols so to restore them, the browser reads four bytes at a time and reencodes these bytes as Unicode. The quote marks and horizontal bar symbol use 3-byte sequences (three ASCII symbols). The author’s name uses simple English letters, which are equally readable in ASCII and Unicode encodings, so the browser doesn't need to reencode them.
This example restores broken Unicode emoticons in a short phrase. The destroyed emoticons are printed as three ANSI characters. To restore them, the utility decodes them to binary, extracts the Unicode bits from them, calculates the code point values, and converts them to valid Unicode emojis. The broken sequence "â" is "☔" ("rain"), and "ð" is "🌷" ("flower"), so the phrase is "No rain. No flowers."
This example looks like Zalgo destroyed it but no, it's not Zalgo, it's just the wrong Unicode encoding. The input text is interpreted as raw ASCII characters but in reality, it's Unicode characters. Because of this encoding mismatch, the text is absolutely unreadable. To understand the input data, it's converted into bytes and encoded to Unicode characters. As a result, we get a list of scientific emojis together with their descriptions in the small-caps Unicode font.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here
We're Browserling — a friendly and fun cross-browser testing company powered by alien technology. At Browserling we love to make people's lives easier, so we created this collection of online Unicode tools. Our tools are focused on gettings things done and they have the simplest possible user interface. As soon as you load your Unicode data in the input of any of our tools, you'll instantly get the result in the output. Behind the scenes, our tools are actually powered by our web developer tools that we created over the last couple of years. Check them out!