International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A8A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󨡀
E8840
󨡁
E8841
󨡂
E8842
󨡃
E8843
󨡄
E8844
󨡅
E8845
󨡆
E8846
󨡇
E8847
󨡈
E8848
󨡉
E8849
󨡊
E884A
󨡋
E884B
󨡌
E884C
󨡍
E884D
󨡎
E884E
󨡏
E884F
80
90
󨡐
E8850
󨡑
E8851
󨡒
E8852
󨡓
E8853
󨡔
E8854
󨡕
E8855
󨡖
E8856
󨡗
E8857
󨡘
E8858
󨡙
E8859
󨡚
E885A
󨡛
E885B
󨡜
E885C
󨡝
E885D
󨡞
E885E
󨡟
E885F
90
A0
󨡠
E8860
󨡡
E8861
󨡢
E8862
󨡣
E8863
󨡤
E8864
󨡥
E8865
󨡦
E8866
󨡧
E8867
󨡨
E8868
󨡩
E8869
󨡪
E886A
󨡫
E886B
󨡬
E886C
󨡭
E886D
󨡮
E886E
󨡯
E886F
A0
B0
󨡰
E8870
󨡱
E8871
󨡲
E8872
󨡳
E8873
󨡴
E8874
󨡵
E8875
󨡶
E8876
󨡷
E8877
󨡸
E8878
󨡹
E8879
󨡺
E887A
󨡻
E887B
󨡼
E887C
󨡽
E887D
󨡾
E887E
󨡿
E887F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]