This browser-based utility escapes Unicode data. Anything that you paste or enter in the text area on the left automatically gets escaped on the right. It supports the most popular Unicode encodings (such as UTF-8, UTF-16, UCS-2, UTF-32, and UCS-4) and it works with emoji characters. You can escape Unicode symbols to sequences of bytes or code points and adjust the escape format. Created by encoding gurus from team Browserling.
This browser-based utility escapes Unicode data. Anything that you paste or enter in the text area on the left automatically gets escaped on the right. It supports the most popular Unicode encodings (such as UTF-8, UTF-16, UCS-2, UTF-32, and UCS-4) and it works with emoji characters. You can escape Unicode symbols to sequences of bytes or code points and adjust the escape format. Created by encoding gurus from team Browserling.
This utility converts Unicode characters to Unicode escape sequences. All input symbols and emojis are first converted to bytes or code points and then these values are formatted according to the selected escape format. The byte escape format represents each Unicode character as one or more bytes. The most popular byte escape formats are backslash-escape (for example, \x61), percent-escape (for example, %61), dollar-escape (for example, $61), and curly and angle bracket escape (for example, \x{61} and <61>). Bytes can be printed in binary, octal, decimal, or hexadecimal base and they can be prefixed with "0x" (hex), "o" (octal), or "0b" (binary) base prefixes. The code point escape format represents each Unicode character as a unique code position. The most common code position escape sequences are backslash-u escape "\u" (for example, \u0061), uni-prefix escape "uni" (for example, uni0061), percent-sign escape (for example, %61), and HTML-escape in hex and decimal notations (for example, a or a). Code points can also use bases 2, 8, 10, and 16. We made a list of the most popular escape formats for bytes and code points and you can select them from the predefined list for quick access. If you can't find an escape format that you need, you can enter it yourself in the "Custom Byte Format" and "Custom Code Point Format" fields by first selecting the "Custom Format" from the predefined list above the fields. To define bytes and code points in the format string, use the percent symbol % followed by one of the letters: "B" (which means insert a byte or code point in the binary base), "O" (octal base), "D" (decimal base), or "H" (hex base). When working with byte sequences, you can select the encoding of input Unicode data in the second column of options. By default, the UTF8 encoding is used but you can also choose UTF16, UTF32, UCS2, and UCS4, and select Big Endian or Little Endian byte order formats, and add a BOM marker. The third column of options allows you to change the text case of the escaped units in the output and adjust the separator between escaped bytes or escaped code points.
This utility converts Unicode characters to Unicode escape sequences. All input symbols and emojis are first converted to bytes or code points and then these values are formatted according to the selected escape format. The byte escape format represents each Unicode character as one or more bytes. The most popular byte escape formats are backslash-escape (for example, \x61), percent-escape (for example, %61), dollar-escape (for example, $61), and curly and angle bracket escape (for example, \x{61} and <61>). Bytes can be printed in binary, octal, decimal, or hexadecimal base and they can be prefixed with "0x" (hex), "o" (octal), or "0b" (binary) base prefixes. The code point escape format represents each Unicode character as a unique code position. The most common code position escape sequences are backslash-u escape "\u" (for example, \u0061), uni-prefix escape "uni" (for example, uni0061), percent-sign escape (for example, %61), and HTML-escape in hex and decimal notations (for example, a or a). Code points can also use bases 2, 8, 10, and 16. We made a list of the most popular escape formats for bytes and code points and you can select them from the predefined list for quick access. If you can't find an escape format that you need, you can enter it yourself in the "Custom Byte Format" and "Custom Code Point Format" fields by first selecting the "Custom Format" from the predefined list above the fields. To define bytes and code points in the format string, use the percent symbol % followed by one of the letters: "B" (which means insert a byte or code point in the binary base), "O" (octal base), "D" (decimal base), or "H" (hex base). When working with byte sequences, you can select the encoding of input Unicode data in the second column of options. By default, the UTF8 encoding is used but you can also choose UTF16, UTF32, UCS2, and UCS4, and select Big Endian or Little Endian byte order formats, and add a BOM marker. The third column of options allows you to change the text case of the escaped units in the output and adjust the separator between escaped bytes or escaped code points.
In this example, we convert Aristotle's quote written in bold-italic Unicode font to a code point escape sequence that works in JavaScript. In JavaScript, you can define Unicode strings as sequences of backslashes, followed by the "u" character and the code point wrapped in curly brackets. Code points must be 5 digits long and be in hexadecimal base. We output the entire escape sequence surrounded by double quotes and use lower case letters for hex digits. If you paste the output text in JavaScript console, you'll see the original text.
This example loads a beautifully formatted Unicode text and escapes it to valid Ruby Unicode sequences. In the Ruby programming language, Unicode glyphs are represented in the backslash-u format. This format has the form \uUUUU, where UUUU is either a code point of the glyph or a surrogate pair. To demonstrate more possibilities, we comma-separate individual code positions and print them in lower case.
This example uses the 0xHH byte format to escape data with Unicode family emojis. The 0xHH format is simply hexadecimal bytes of the input data. As we have selected the UTF8 encoding, it converts emojis and other characters to a sequence of one, two, three, or four bytes per Unicode symbol. Then it adds the hexadecimal prefix "0x" to each byte and displays bytes separated by spaces.
This example converts a quote by Lao Tzu to a Python escape format. In Python code, Unicode characters can be encoded using a special escape sequence, which is a backslash, an uppercase or lowercase letter "u", followed by eight hex digits. This format is not in the quick format list, so we set it manually as a custom format. To do it, we select the "Custom" entry in the code point list and enter the "\U%HHHHHHHH" in the field below it. Eight "H" letters mean a zero-padded hexadecimal number with length 8 and the symbol "%" means the start of the hexadecimal format.
In this example, we encode a funny Unicode face as a binary bit sequence. Each byte in the sequence has a length of 8 bits and we also add the binary prefix "0b" before each byte. As we're using the UTF-16 Unicode encoding with the Little Endian byte order format, each Unicode character has two or four bytes. The output is formatted so that each escape unit is printed on a new line.
In this example, we use the "Escaped Octal Bytes" format to print dragon and dinosaur emojis as octal numbers. We treat the emoticons as UCS-4 Big Endian data (which is the same as UTF-32 Big Endian) and add the Byte Order Mark (BOM) in front of the data. In the output, we get a series of backslash-escaped semicolon-separated octets.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here
We're Browserling — a friendly and fun cross-browser testing company powered by alien technology. At Browserling we love to make people's lives easier, so we created this collection of online Unicode tools. Our tools are focused on gettings things done and they have the simplest possible user interface. As soon as you load your Unicode data in the input of any of our tools, you'll instantly get the result in the output. Behind the scenes, our tools are actually powered by our web developer tools that we created over the last couple of years. Check them out!