This browser-based utility counts individual characters (graphemes) and the total number of bytes in Unicode text. Anything that you paste or enter in the text area on the left automatically gets split into symbols or bytes and gets counted on the right. It supports the most popular Unicode encodings (such as UTF-8, UTF-16, and UTF-32) and it works with emoji characters. You can also select input Unicode text encoding and adjust whitespace counting. Created by encoding gurus from team Browserling.
This utility finds the length of Unicode data. It can calculate the number of graphemes in Unicode text or the number of bytes these graphemes take when encoded in various Unicode encodings. A grapheme is a single symbol of a written language. Some examples of graphemes are regular letters (e.g. "e" or "ǣ"), symbols (e.g. "☯" or "☎"), numbers (e.g. "5" or "Π"), emojis (e.g. "🌙" or "💎") and many others. Each grapheme has a length of one but when encoded in computer memory, it can consist of many bytes. The number of bytes required depends on the selected Unicode encoding. For example, the star character "⭐" uses 3 bytes in UTF8 encoding, 2 bytes in UTF16 encoding, and 4 bytes in UTF32 encoding. When calculating the length of Unicode text, you can switch between grapheme mode and byte mode in the options. We also have three extra options for managing whitespace counting. Namely, you can exclude counting newlines, tabs, and spaces. If the input text has several lines (separated by newlines), then you can find the length of each line separately. This is done by activating the "Count Lines Separately" option. When this option is active, the program will count the number of graphemes or bytes individually for each line and display the length next to each line.
In this example, we draw a maze using Unicode box drawing symbols. The objective of this puzzle is for the mouse to get to the cheese. The mouse, cheese, and all box characters are graphemes and we count how many of them are in this puzzle. We also include newlines, tabs, and spaces in the result. We find that this maze is made out of 258 graphemes.
In this example, we load a six-step recipe for a delicious sandwich as the input, with each step written on a separate line. To count the number of characters on each line (each step), we select the "Count Graphemes" mode and also activate the "Count Lines Separately" option. At the output, we get a vertical list of six numbers that correspond to the length of each line of text.
In this example, we create the logo for the American animation studio "Pixar" using different Unicode characters. It is easy to see that the name consists of only five graphemes, so we decided to calculate the number of bytes here. It turns out that these five symbols take 26 bytes in the UTF8 encoding, 24 bytes in UTF16 encoding, and 28 in UTF32 encoding.
This example counts the number of bytes of a chess set in the UTF16 encoding. It ignores newline characters, and as a result, the output value is 500 bytes. For UTF32 encoding there are twice as many bytes, namely 1000 because one character in UTF16 usually takes 2 bytes but in UTF32 always takes 4 bytes. For UTF8 encoding it is much less – 298 bytes because it's a variable-width encoding with one to four bytes per symbol.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
Quickly find code positions of all Unicode values.
Quickly decode code positions to Unicode values.
Quickly encode Unicode values to UTF-8 encoding.
Quickly encode Unicode values to UTF-16 encoding.
Quickly encode Unicode values to UTF-32 encoding.
Quickly create a picture from Unicode symbols.
Quickly generate random Unicode text in a given range.
Quickly generate all Unicode values from the given code point interval.
Quickly filter Unicode symbols that are within the given code point interval.
Quickly split Unicode data into graphemes.
Quickly sort Unicode glyphs in increasing or decreasing order.
Quickly find the length of Unicode text.
Quickly increase Unicode code point values.
Quickly decrease Unicode code point values.
Quickly reverse the order of symbols in Unicode text.
Quickly rotate Unicode characters to the left and right.
Quickly create multiple copies of Unicode text.
Quickly extract all characters from Unicode text.
Quickly split Unicode data into pieces.
Quickly split Unicode text into chunks of constant length.
Quickly merge Unicode snippets together.
Quickly shorten Unicode text to the given length.
Quickly left-pad Unicode text with any character.
Quickly right-pad Unicode text with any character.
Quickly align Unicode data to the center.
Quickly align Unicode data to the right.
Quickly convert ordinary numbers to fancy Unicode numbers.
Quickly convert Unicode numbers back to regular numbers.
Quickly convert ordinary letters to Unicode letters in various fonts.
Quickly convert Unicode letters back to regular Latin letters.
Quickly convert ordinary text to fancy Unicode text.
Quickly convert fancy Unicode text back to regular text.
Quickly combine input Unicode with diacritical marks.
Quickly delete diacritical marks from the Unicode text.
Quickly spoof regular text using Unicode homoglyphs.
Quickly unspoof homoglyphs in text with regular letters.
Quickly check if the given Unicode text is spoofed or not.
Quickly circularly rearrange Unicode symbols.
Quickly convert Unicode data to escape sequences.
Quickly encode Unicode data to HTML entities.
Quickly URL-escape Unicode symbols.
Quickly encode Unicode values to base64.
Quickly encode Unicode values to a data URI.
Quickly convert Unicode characters to raw bytes.
Quickly convert Unicode data to base-2 (binary).
Quickly convert Unicode data to base-8 (octal).
Quickly convert Unicode data to base-10 (decimal).
Quickly convert Unicode data to base-16 (hexadecimal).
Quickly convert Unicode symbols to raw ASCII bytes.
Quickly convert ASCII bytes to Unicode symbols.
Quickly convert Unicode text to a string literal.
Quickly create a picture from Unicode emojis.
Quickly release Zalgo on your Unicode text.
Quickly make Zalgo text readable again.
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here