This browser-based utility converts Unicode data to bytes. Anything that you paste or enter in the text area on the left automatically gets converted to bytes on the right. It supports the most popular Unicode encodings, such as UTF-8, UTF-16, UCS-2, UTF-32, and UCS-4, and it works with emoji characters. For the multi-byte encodings, you can select the byte order format and add a BOM marker. For the output bytes, you can choose between binary, octal, decimal, hexadecimal, or any other radix from 2 to 36. You can also add a prefix and zero-padding to bytes, set the output separator, and skip whitespace characters. Created by encoding gurus from team Browserling.
This browser-based utility converts Unicode data to bytes. Anything that you paste or enter in the text area on the left automatically gets converted to bytes on the right. It supports the most popular Unicode encodings, such as UTF-8, UTF-16, UCS-2, UTF-32, and UCS-4, and it works with emoji characters. For the multi-byte encodings, you can select the byte order format and add a BOM marker. For the output bytes, you can choose between binary, octal, decimal, hexadecimal, or any other radix from 2 to 36. You can also add a prefix and zero-padding to bytes, set the output separator, and skip whitespace characters. Created by encoding gurus from team Browserling.
This utility converts Unicode characters to bytes in the given encoding and base. You can use any of the five most popular Unicode encodings (UTF8/UTF16/UCS2/UTF32/UCS4) and use binary to hexatridecimal bases for the bytes. The difference between the encodings is how many bytes are required to represent any of 1,114,112 Unicode glyphs in memory. In the UTF8 encoding, 1 to 4 bytes (8, 16, 24, or 32 bits) are required to store a character. In the UTF16 and UCS2 encodings, one symbol is represented by a pair of bytes or two pairs of bytes (16 or 32 bits). In the UTF32 and UCS4 encodings, the representation is fixed-length and uses 4 bytes (exactly 32 bits). A sequence of two bytes is called a word and a sequence of four bytes is called a double-word. There are two ways to store bytes in words and double words – Big Endian and Little Endian. In the Big Endian format, the most significant bytes are stored first, and in the Little Endian format, the least significant bytes are stored first. To make it easier to determine the byte order, you can add a special byte mark in front of the bytes. This mark is called BOM – Byte Order Mark. By default, the bytes are converted to radix-16 (hex) before they are printed but you can quickly switch between binary, octal, decimal, and hex output formats. For the binary output format, you can add the 0b-prefix and enable padding to 8 bits. For the octal output format, you can add the o-prefix and enable padding to 3 digits. For the hex output format, you can add the 0x-prefix and enable padding to two digits. You can also change the case of base prefix symbols and output digits. For example, with the uppercase prefix and uppercase digits, the hex value "a0" will be printed as "0XA0". To improve the output and make the bytes more readable, you can customize the separator character that goes between the bytes and also use three quick options to skip converting spaces, tabs, and newlines.
This utility converts Unicode characters to bytes in the given encoding and base. You can use any of the five most popular Unicode encodings (UTF8/UTF16/UCS2/UTF32/UCS4) and use binary to hexatridecimal bases for the bytes. The difference between the encodings is how many bytes are required to represent any of 1,114,112 Unicode glyphs in memory. In the UTF8 encoding, 1 to 4 bytes (8, 16, 24, or 32 bits) are required to store a character. In the UTF16 and UCS2 encodings, one symbol is represented by a pair of bytes or two pairs of bytes (16 or 32 bits). In the UTF32 and UCS4 encodings, the representation is fixed-length and uses 4 bytes (exactly 32 bits). A sequence of two bytes is called a word and a sequence of four bytes is called a double-word. There are two ways to store bytes in words and double words – Big Endian and Little Endian. In the Big Endian format, the most significant bytes are stored first, and in the Little Endian format, the least significant bytes are stored first. To make it easier to determine the byte order, you can add a special byte mark in front of the bytes. This mark is called BOM – Byte Order Mark. By default, the bytes are converted to radix-16 (hex) before they are printed but you can quickly switch between binary, octal, decimal, and hex output formats. For the binary output format, you can add the 0b-prefix and enable padding to 8 bits. For the octal output format, you can add the o-prefix and enable padding to 3 digits. For the hex output format, you can add the 0x-prefix and enable padding to two digits. You can also change the case of base prefix symbols and output digits. For example, with the uppercase prefix and uppercase digits, the hex value "a0" will be printed as "0XA0". To improve the output and make the bytes more readable, you can customize the separator character that goes between the bytes and also use three quick options to skip converting spaces, tabs, and newlines.
In this example, we convert a Rumi quote written in a full-width Unicode font to octal bytes in UTF-8 encoding. The UTF-8 encoding uses 1, 2, 3, or 4 bytes for each character. The octal bytes are padded and use three digits for each byte. We have also added the octal o-prefix and have separated the output values with the space symbol.
This example turns a beautiful Unicode phrase into hexadecimal UTF-32 encoding with the Little Endian byte order. The UTF-32 encoding uses 4 bytes for each glyph and to quickly determine the encoding and byte order, it adds the BOM marker (0xFF, 0xFE, 0x00, 0x00) in front of the output. The output bytes are comma-separated, padded to two digits, and the hexadecimal prefix "0X" is printed in uppercase.
In this example, we convert a list of Unicode vegetable emojis to bytes with a custom radix-20. The input list is first encoded to UTF-8 encoding, then it's converted to individual bytes, then bytes are converted to radix-20, and then they are printed. We have also activated options to skip newlines, tabs, and spaces during the conversion so that the input structure is preserved.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here
We're Browserling — a friendly and fun cross-browser testing company powered by alien technology. At Browserling we love to make people's lives easier, so we created this collection of online Unicode tools. Our tools are focused on gettings things done and they have the simplest possible user interface. As soon as you load your Unicode data in the input of any of our tools, you'll instantly get the result in the output. Behind the scenes, our tools are actually powered by our web developer tools that we created over the last couple of years. Check them out!