International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F193A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񓠀
53800
񓠁
53801
񓠂
53802
񓠃
53803
񓠄
53804
񓠅
53805
񓠆
53806
񓠇
53807
񓠈
53808
񓠉
53809
񓠊
5380A
񓠋
5380B
񓠌
5380C
񓠍
5380D
񓠎
5380E
񓠏
5380F
80
90
񓠐
53810
񓠑
53811
񓠒
53812
񓠓
53813
񓠔
53814
񓠕
53815
񓠖
53816
񓠗
53817
񓠘
53818
񓠙
53819
񓠚
5381A
񓠛
5381B
񓠜
5381C
񓠝
5381D
񓠞
5381E
񓠟
5381F
90
A0
񓠠
53820
񓠡
53821
񓠢
53822
񓠣
53823
񓠤
53824
񓠥
53825
񓠦
53826
񓠧
53827
񓠨
53828
񓠩
53829
񓠪
5382A
񓠫
5382B
񓠬
5382C
񓠭
5382D
񓠮
5382E
񓠯
5382F
A0
B0
񓠰
53830
񓠱
53831
񓠲
53832
񓠳
53833
񓠴
53834
񓠵
53835
񓠶
53836
񓠷
53837
񓠸
53838
񓠹
53839
񓠺
5383A
񓠻
5383B
񓠼
5383C
񓠽
5383D
񓠾
5383E
񓠿
5383F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]