This online utility converts Unicode data to ASCII characters. Anything that you paste or enter in the input area automatically gets converted to ASCII and is printed in the output area. It supports all Unicode symbols and it works with emoji characters. Created by encoding gurus from team Browserling.
This online utility converts Unicode data to ASCII characters. Anything that you paste or enter in the input area automatically gets converted to ASCII and is printed in the output area. It supports all Unicode symbols and it works with emoji characters. Created by encoding gurus from team Browserling.
This browser-based utility converts your Unicode data to the ASCII encoding. To do this, it first splits the Unicode data into graphemes and finds the code point values of each grapheme. A grapheme is usually a single glyph (such as a letter, number, ideogram, logogram, or an emoticon) but it can also be a combination of glyphs (such as text with combining characters). The browser's default encoding stores glyphs as sequences of one, two, three, or four bytes. Code points in the range from 0 to 127 use one byte (actually less than that – only 7 bits). Code points in the range from 128 to 2047 use two bytes. Code points in the range from 2048 to 65,535 use three bytes. Code points in the range from 65,536 to 1,114,111 use four bytes. If a symbol is encoded using just one byte, then the Unicode symbol will be exactly the same as the ASCII symbol and won't change its value when being converted to the ASCII encoding. Characters that use more than one byte are represented as two, three, or four extended ASCII characters, one for each byte. A two-byte Unicode symbol has the binary format "110xxxxx 10xxxxxx", where "x" is a usable bit, so it has 5+6=11 usable bits. A three-byte Unicode symbol has the binary format "1110xxxx 10xxxxxx 10xxxxxx" with 4+6+6=16 usable bits. A four-byte Unicode symbol has the binary format "11110xxx 10xxxxxx 10xxxxxx 10xxxxxx" with 3+6+6+6=21 usable bits. Let's analyze which extended ASCII characters are used in each multi-byte mode. If it's a two-byte encoding, then the first byte "110xxxxx" has 5 free bits and can have 25 = 32 values, and the second byte "10xxxxxx" has 6 free bits and can have 26 = 64 values. If you look at the extended ASCII table, then the possible values for the first byte "110xxxxx" are "ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß" and for the second byte "10xxxxxx" are " ¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾¿". If it's a three-byte encoding, then the first byte "1110xxxx" has 4 free bits and can have 24 = 16 values. The second and third bytes are the same as in the two-byte case. The 16 possible values for the first byte "1110xxxx" are "àáâãäåæçèéêëìíîï". If it's a four-byte encoding, then the first byte "11110xxx" has only 3 free bits and can have 23 = 8 values. The bytes two, three, and four are the same as in previous cases. The possible values for the first byte "11110xxx" are "ðñòóôõö÷".
This browser-based utility converts your Unicode data to the ASCII encoding. To do this, it first splits the Unicode data into graphemes and finds the code point values of each grapheme. A grapheme is usually a single glyph (such as a letter, number, ideogram, logogram, or an emoticon) but it can also be a combination of glyphs (such as text with combining characters). The browser's default encoding stores glyphs as sequences of one, two, three, or four bytes. Code points in the range from 0 to 127 use one byte (actually less than that – only 7 bits). Code points in the range from 128 to 2047 use two bytes. Code points in the range from 2048 to 65,535 use three bytes. Code points in the range from 65,536 to 1,114,111 use four bytes. If a symbol is encoded using just one byte, then the Unicode symbol will be exactly the same as the ASCII symbol and won't change its value when being converted to the ASCII encoding. Characters that use more than one byte are represented as two, three, or four extended ASCII characters, one for each byte. A two-byte Unicode symbol has the binary format "110xxxxx 10xxxxxx", where "x" is a usable bit, so it has 5+6=11 usable bits. A three-byte Unicode symbol has the binary format "1110xxxx 10xxxxxx 10xxxxxx" with 4+6+6=16 usable bits. A four-byte Unicode symbol has the binary format "11110xxx 10xxxxxx 10xxxxxx 10xxxxxx" with 3+6+6+6=21 usable bits. Let's analyze which extended ASCII characters are used in each multi-byte mode. If it's a two-byte encoding, then the first byte "110xxxxx" has 5 free bits and can have 25 = 32 values, and the second byte "10xxxxxx" has 6 free bits and can have 26 = 64 values. If you look at the extended ASCII table, then the possible values for the first byte "110xxxxx" are "ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß" and for the second byte "10xxxxxx" are " ¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾¿". If it's a three-byte encoding, then the first byte "1110xxxx" has 4 free bits and can have 24 = 16 values. The second and third bytes are the same as in the two-byte case. The 16 possible values for the first byte "1110xxxx" are "àáâãäåæçèéêëìíîï". If it's a four-byte encoding, then the first byte "11110xxx" has only 3 free bits and can have 23 = 8 values. The bytes two, three, and four are the same as in previous cases. The possible values for the first byte "11110xxx" are "ðñòóôõö÷".
In this example, we convert a quote by Thomas Carlyle that's written in Unicode bold-script font to extended ASCII bytes. The characters in the bold-script font use code points from the range U+1D4D0 to U+1D503. The value 1D4D0 (in hex) is 120,016 (in decimal) and the value 1D503 is 120,067. As these values are in the code point range 65,536 – 1,114,111, each character is represented by four bytes. The first byte is always in the form "11110xxx" and the other three bytes are in the form "10xxxxxx". The first byte has only three bits available, which means it can only be printed in ASCII as one of these ASCII symbols "ðñòóôõö÷". The remaining three bytes have 64 possible values. In the output, every Unicode character is printed as four ASCII characters. The spaces, the dot, and the comma remain unchanged because they are equal in both encodings.
This example uses two emoticons in place of two words. All other words in the input sentence use ordinary Latin alphabet letters. The uppercase letters belong to the code positions from 65 to 90 and the lowercase letters belong to the range from 97 to 122. These letters are defined in the 7-bit ASCII table, so they don't change during the conversion. The dot (code point value 46) and apostrophe (code point value 39) symbols are also in the ASCII table and don't change. However, the emojis are not in the ASCII table so they turn into sequences of 4 bytes, and each byte is printed as an ANSI character, so you get four ANSI characters per emoji.
This example replaces a list of ball sports emoticons with extended ASCII symbols. The output includes 1-byte, 2-byte, 3-byte, and 4-byte characters. Latin letters are regular ASCII characters and are encoded in 1 byte. The "«" and "»" characters are represented as two bytes in Unicode and are converted to two symbols from the Windows-1252 (and ISO 8859-1) character encoding. The first two balls and the en dash symbol use three bytes, so they are converted to three characters. And the last four balls are represented as four bytes, so they are converted to four characters.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here
We're Browserling — a friendly and fun cross-browser testing company powered by alien technology. At Browserling we love to make people's lives easier, so we created this collection of online Unicode tools. Our tools are focused on gettings things done and they have the simplest possible user interface. As soon as you load your Unicode data in the input of any of our tools, you'll instantly get the result in the output. Behind the scenes, our tools are actually powered by our web developer tools that we created over the last couple of years. Check them out!