This browser-based utility unspoofs Unicode text that contains homoglyphs (symbols that visually resemble or are identical to other symbols) by converting them into regular Latin letters and numbers. You can paste or enter text containing multiple homoglyphs in the input area and it will be transformed into plain text written in ASCII characters in the output. This tool can unspoof letters, punctuation marks, spaces, digits, and it even removes zero-width spaces between individual symbols. Created by encoding gurus from team Browserling.
This browser-based utility unspoofs Unicode text that contains homoglyphs (symbols that visually resemble or are identical to other symbols) by converting them into regular Latin letters and numbers. You can paste or enter text containing multiple homoglyphs in the input area and it will be transformed into plain text written in ASCII characters in the output. This tool can unspoof letters, punctuation marks, spaces, digits, and it even removes zero-width spaces between individual symbols. Created by encoding gurus from team Browserling.
This online utility detects unusual letters in the text and replaces them with regular English letters from the ASCII character set. It performs the opposite operation of the Spoof Unicode Text utility, which creates spoofed text by replacing English letters with Unicode homoglyphs. Homoglyphs are characters that look visually the same or are completely identical to other characters but have different Unicode code points. For example, the Latin letter "A" (U+0041), the Cyrillic letter "А" (U+0410), and the Greek capital letter alpha "Α" (U+0391) look almost the same but have different code points and are located in different Unicode blocks. The term "homoglyph" comes from the fact that these characters have a similar appearance (called a glyph), but they are not necessarily related in terms of their meaning or origin. Homoglyphs can appear in different writing systems, including Latin, Cyrillic, Greek, Turkish, and Arabic, among others. In addition to letters, our tool can also handle the homoglyphs of other characters, such as punctuation marks and spaces. For example, it can replace the single "Low-9" quotation mark "‚" (U+201A) back to the regular comma symbol "," (U+002C). The program recognizes and replaces over 15 different whitespace characters, including the four-per-em space " " (U+2005, also called mid space), em space " " (U+2003, also called mutton), and the thin space " " (U+2009). The program is also able to remove invisible zero-width spaces that are often inserted between characters to mix up the text. Such spaces are zero-width space (U+200B), zero-width no-break space (U+FEFF), and Mongolian vowel separator (U+180E). If you want to find exactly which letters and symbols in the text are spoofed, you can use the Check Spoofed Unicode Text tool that we also created.
This online utility detects unusual letters in the text and replaces them with regular English letters from the ASCII character set. It performs the opposite operation of the Spoof Unicode Text utility, which creates spoofed text by replacing English letters with Unicode homoglyphs. Homoglyphs are characters that look visually the same or are completely identical to other characters but have different Unicode code points. For example, the Latin letter "A" (U+0041), the Cyrillic letter "А" (U+0410), and the Greek capital letter alpha "Α" (U+0391) look almost the same but have different code points and are located in different Unicode blocks. The term "homoglyph" comes from the fact that these characters have a similar appearance (called a glyph), but they are not necessarily related in terms of their meaning or origin. Homoglyphs can appear in different writing systems, including Latin, Cyrillic, Greek, Turkish, and Arabic, among others. In addition to letters, our tool can also handle the homoglyphs of other characters, such as punctuation marks and spaces. For example, it can replace the single "Low-9" quotation mark "‚" (U+201A) back to the regular comma symbol "," (U+002C). The program recognizes and replaces over 15 different whitespace characters, including the four-per-em space " " (U+2005, also called mid space), em space " " (U+2003, also called mutton), and the thin space " " (U+2009). The program is also able to remove invisible zero-width spaces that are often inserted between characters to mix up the text. Such spaces are zero-width space (U+200B), zero-width no-break space (U+FEFF), and Mongolian vowel separator (U+180E). If you want to find exactly which letters and symbols in the text are spoofed, you can use the Check Spoofed Unicode Text tool that we also created.
This is an example from our commenting system. Someone posted a quote from Abraham Lincoln in our comments and it was not detected by our plagiarism checker. It turns out that the quote used multiple homoglyphs (here it's the letters "а", "е", "у", and "о") and when our checker scanned the text, it did not know it was regular English letters. Now that we run it through our program, it turns all these homoglyphs into regular English letters from the ASCII character set.
In this example, we fix letter and word spacing in a small text fragment. Although it visually appears that all the letters are joined together and the same space character is used between the words throughout the text, this is actually not the case. Different space characters are used between words. For instance, between the first and second word, there is an "En Quad" (U+2000), between the second and third word, there is an "Em Quad" (U+2001), and so on. In addition to visible spaces, there are also many invisible spaces called "Zero Width Space" (U+200B) inserted between letters. But as soon as we insert this text as the input, we get clean text with normal spaces (U+0020) between the words and no zero-width spaces anywhere in the output.
In this example, we use our text unspoofer program to normalize text that we copied from a historical document. After copying a passage of such text from an old file, we notice that the text now contains many homoglyphs for various letters that differ slightly in style and size. To obtain a usable clean text without old letters, we input this document in the tool and get nice ASCII output.
You can pass input to this tool via ?input query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!
View and edit Unicode in a browser-based editor.
Spell out the names of Unicode characters in the input text.
URL-unescape Unicode text.
Convert base-2 data to Unicode encoding.
Convert base-8 data to Unicode encoding.
Convert base-10 data to Unicode encoding.
Convert base-16 data to Unicode encoding.
Convert Unicode text to any radix.
Convert any radix data to Unicode.
Convert Unicode text to ISO-8859-1 encoding.
Convert ISO-859-1 encoded data to Unicode.
Convert Unicode text to ISO-8859-2 encoding.
Convert ISO-8859-2 encoded data to Unicode.
Convert Unicode text to Ecoji encoding.
Convert Ecoji encoded data to Unicode.
Convert raw bytes to Unicode.
Check the Unicode version of the given Unicode characters.
Check if the given Unicode has valid encoding.
Encode Unicode text to Punycode encoding.
Decode Punycode encoding to Unicode.
Convert base64 data to Unicode text.
Convert Unicode to a valid data URL.
Convert a valid data URL to Unicode text.
Decode HTML entities to Unicode data.
Decode UTF8 encoding to Unicode.
Decode UTF16 encoding to Unicode.
Decode UTF32 encoding to Unicode.
Convert all Unicode characters to uppercase.
Convert all Unicode characters to lowercase.
Generate a list of all country flag icons.
Generate a list of all Unicode arrows.
Generate a list of all Unicode animals.
Generate a list of all Unicode flowers and plants.
Generate a list of all Unicode block elements.
Generate a list of all Egyptian hieroglyphs.
Generate a list of all currency symbols.
Use Unicode colors to generate a rainbow.
Create a smiley face from Unicode symbols.
Generate a list of random emojis.
Randomize case of all Unicode characters.
Convert all Unicode characters to lowercase.
Encode Unicode to JSON.
Decode JSON to Unicode.
Randomly rearrange the order of input graphemes.
Generate Alt codes for Unicode characters.
Generate Unicode glyphs from Alt codes.
Print statistics about Unicode data and code points.
Extract a part from Unicode data.
Generate waves with Unicode symbols.
Generate graphs using Unicode symbols.
Wrap a message in a Unicode box.
Subscribe to our updates. We'll let you know when we release new tools, features, and organize online workshops.
Enter your email here
We're Browserling — a friendly and fun cross-browser testing company powered by alien technology. At Browserling we love to make people's lives easier, so we created this collection of online Unicode tools. Our tools are focused on gettings things done and they have the simplest possible user interface. As soon as you load your Unicode data in the input of any of our tools, you'll instantly get the result in the output. Behind the scenes, our tools are actually powered by our web developer tools that we created over the last couple of years. Check them out!