Unicode Text Normalizer
Quick Tips
- • This tool runs entirely in your browser - your data stays private.
- • Press Ctrl+V (Cmd+V on Mac) to quickly paste text.
- • Use the Copy button to save your result to clipboard.
- • Bookmark this page for quick access!
Normalize Unicode text to NFC, NFD, NFKC, or NFKD forms.
Examples
e + combining acute (e\u0301)
e with acute as single character (NFC)
fi ligature (U+FB01)
fi as separate letters (NFKC)
Half symbol ½
1/2 (NFKC)
Fullwidth ABC
Standard ABC (NFKC)
Frequently Asked Questions
Unicode normalization converts text to a standard form where characters with multiple possible encodings use just one consistent encoding. This ensures that visually identical text is also identical at the byte level for comparison and storage.
NFC (Composed) is recommended for most purposes: web content, databases, and general text storage. It creates compact text with precomposed characters. Use NFKC if you also want to normalize visual variants like ligatures and symbols.
The same visual character can have multiple Unicode representations. An accented letter might be one codepoint or a base letter plus combining mark. Normalizing both strings to the same form makes them match if they're visually identical.
C (Composed) and D (Decomposed) handle canonical equivalence (same character, different encoding). KC and KD additionally handle compatibility equivalence (visually similar but distinct characters, like ligatures). KC/KD are more aggressive transformations.
NFC and NFD preserve meaning—they only standardize encoding. NFKC and NFKD may change meaning by converting compatibility characters: the "fi" ligature becomes separate letters, which might matter for exact text reproduction.
Related Tools
Duplicate Remover
<p>The Duplicate Remover is an essential text processing too...
Tab to Spaces Converter
<p>Our Tab to Spaces converter replaces tab characters with...
Spaces to Tabs Converter
<p>Our Spaces to Tabs converter replaces space-based indenta...
Remove Punctuation
<p>Our Remove Punctuation tool strips punctuation marks from...
Remove Duplicate Lines
Remove duplicate lines from your text while preserving the o...
Remove Empty Lines
Remove all empty or blank lines from your text. Clean up mes...
Related Articles
Homoglyph Attacks: How Lookalike Characters Threaten Security
Learn how homoglyph attacks use lookalike characters to deceive users and systems. Understand the security risks and how to detect these sophisticated threats.
Read moreHow to Fix Mojibake and Broken Text Encoding
Learn how to identify and fix mojibake and other text encoding problems. Understand why characters appear garbled and how to restore them to readable text.
Read moreHow to Wrap Lines in Quotes for Code and Data Processing
Learn how to efficiently wrap text lines in quotes for programming arrays, SQL queries, and data processing. Complete guide with practical examples.
Read more