International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

ICU  >  Demonstrations  > 

Normalization Browser




Decomposition exclusions:
Unicode version:
(Help)

Normalization Results
ModeQuick CheckNormalizedText
Input c3a4 c3ab c3af 00e1 00e2 00e3쎤쎫쎯áâã
NFDNO110a 1167 11ab 110a 1167 11b2 110a 1167 11b6 0061 0301 0061 0302 0061 0303 쎤쎫쎯áâã
NFCYESc3a4 c3ab c3af 00e1 00e2 00e3 쎤쎫쎯áâã
NFKDNO110a 1167 11ab 110a 1167 11b2 110a 1167 11b6 0061 0301 0061 0302 0061 0303 쎤쎫쎯áâã
NFKCYESc3a4 c3ab c3af 00e1 00e2 00e3 쎤쎫쎯áâã
FCDYESc3a4 c3ab c3af 00e1 00e2 00e3 쎤쎫쎯áâã

About this demo

The options flags are for a prototype to demonstrate tailored normalization as mentioned as Unicode public review issue 7. Uncheck all of these options for regular Unicode Normalization.

Hangul excludes AC00..D7A3. CJK Compat. excludes CJK Compatibility Ideographs (those with a canonical decomposition).

The Unicode 3.2 option performs normalization according to Unicode 3.2 (except for NormalizationCorrections) even if ICU otherwise supports a higher version.

FCD is not a normalization form but a test for whether text is canonically ordered. "Normalizing to FCD" does not generate a unique form but only one of potentially many that are canonically ordered. See UTN #5 Canonical Equivalence in Applications.


Unicode version 15.1 — ICU 74.1