Character Encoding Detector
Quick Tips
- • This tool runs entirely in your browser - your data stays private.
- • Press Ctrl+V (Cmd+V on Mac) to quickly paste text.
- • Use the Copy button to save your result to clipboard.
- • Bookmark this page for quick access!
Detect the character encoding of text (UTF-8, ISO-8859-1, etc.).
Examples
Text with UTF-8 BOM
UTF-8 (BOM detected, high confidence)
Caf\xe9 in bytes
ISO-8859-1 or Windows-1252 (medium confidence)
Plain ASCII text
ASCII/UTF-8 compatible (any encoding works)
Frequently Asked Questions
Text becomes garbled when opened with the wrong encoding. If a file was saved as UTF-8 but opened as ISO-8859-1 (or vice versa), characters display incorrectly. Detecting the actual encoding lets you open it correctly.
UTF-8 is a variable-length encoding that can represent any Unicode character. It's backward-compatible with ASCII and has become the standard for web content and modern systems. Most new text should use UTF-8.
Detection accuracy depends on text length and character variety. Longer text with special characters (accents, symbols) is easier to identify. Short plain ASCII text is ambiguous since many encodings are identical for basic characters.
A BOM is a special byte sequence at the start of a file that identifies its encoding. UTF-8's BOM is EF BB BF. UTF-16 uses FF FE or FE FF. When present, BOMs provide definitive encoding identification.
Yes, especially for ASCII characters (0-127) which are identical across many encodings. Detection relies on non-ASCII characters to distinguish encodings. Pure ASCII text is compatible with UTF-8, ISO-8859-1, and others.
Related Tools
Duplicate Remover
<p>The Duplicate Remover is an essential text processing too...
Tab to Spaces Converter
<p>Our Tab to Spaces converter replaces tab characters with...
Spaces to Tabs Converter
<p>Our Spaces to Tabs converter replaces space-based indenta...
Remove Punctuation
<p>Our Remove Punctuation tool strips punctuation marks from...
Remove Duplicate Lines
Remove duplicate lines from your text while preserving the o...
Remove Empty Lines
Remove all empty or blank lines from your text. Clean up mes...
Related Articles
Homoglyph Attacks: How Lookalike Characters Threaten Security
Learn how homoglyph attacks use lookalike characters to deceive users and systems. Understand the security risks and how to detect these sophisticated threats.
Read moreHow to Fix Mojibake and Broken Text Encoding
Learn how to identify and fix mojibake and other text encoding problems. Understand why characters appear garbled and how to restore them to readable text.
Read moreUnicode Normalization Explained: A Complete Guide for Developers
Learn how Unicode normalization works and why it matters for text processing. Master NFC, NFD, NFKC, and NFKD forms with practical examples.
Read more