International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F198A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񘡀
58840
񘡁
58841
񘡂
58842
񘡃
58843
񘡄
58844
񘡅
58845
񘡆
58846
񘡇
58847
񘡈
58848
񘡉
58849
񘡊
5884A
񘡋
5884B
񘡌
5884C
񘡍
5884D
񘡎
5884E
񘡏
5884F
80
90
񘡐
58850
񘡑
58851
񘡒
58852
񘡓
58853
񘡔
58854
񘡕
58855
񘡖
58856
񘡗
58857
񘡘
58858
񘡙
58859
񘡚
5885A
񘡛
5885B
񘡜
5885C
񘡝
5885D
񘡞
5885E
񘡟
5885F
90
A0
񘡠
58860
񘡡
58861
񘡢
58862
񘡣
58863
񘡤
58864
񘡥
58865
񘡦
58866
񘡧
58867
񘡨
58868
񘡩
58869
񘡪
5886A
񘡫
5886B
񘡬
5886C
񘡭
5886D
񘡮
5886E
񘡯
5886F
A0
B0
񘡰
58870
񘡱
58871
񘡲
58872
񘡳
58873
񘡴
58874
񘡵
58875
񘡶
58876
񘡷
58877
񘡸
58878
񘡹
58879
񘡺
5887A
񘡻
5887B
񘡼
5887C
񘡽
5887D
񘡾
5887E
񘡿
5887F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]