International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B8A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𸠀
38800
𸠁
38801
𸠂
38802
𸠃
38803
𸠄
38804
𸠅
38805
𸠆
38806
𸠇
38807
𸠈
38808
𸠉
38809
𸠊
3880A
𸠋
3880B
𸠌
3880C
𸠍
3880D
𸠎
3880E
𸠏
3880F
80
90
𸠐
38810
𸠑
38811
𸠒
38812
𸠓
38813
𸠔
38814
𸠕
38815
𸠖
38816
𸠗
38817
𸠘
38818
𸠙
38819
𸠚
3881A
𸠛
3881B
𸠜
3881C
𸠝
3881D
𸠞
3881E
𸠟
3881F
90
A0
𸠠
38820
𸠡
38821
𸠢
38822
𸠣
38823
𸠤
38824
𸠥
38825
𸠦
38826
𸠧
38827
𸠨
38828
𸠩
38829
𸠪
3882A
𸠫
3882B
𸠬
3882C
𸠭
3882D
𸠮
3882E
𸠯
3882F
A0
B0
𸠰
38830
𸠱
38831
𸠲
38832
𸠳
38833
𸠴
38834
𸠵
38835
𸠶
38836
𸠷
38837
𸠸
38838
𸠹
38839
𸠺
3883A
𸠻
3883B
𸠼
3883C
𸠽
3883D
𸠾
3883E
𸠿
3883F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]