International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F0998C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𙌀
19300
𙌁
19301
𙌂
19302
𙌃
19303
𙌄
19304
𙌅
19305
𙌆
19306
𙌇
19307
𙌈
19308
𙌉
19309
𙌊
1930A
𙌋
1930B
𙌌
1930C
𙌍
1930D
𙌎
1930E
𙌏
1930F
80
90
𙌐
19310
𙌑
19311
𙌒
19312
𙌓
19313
𙌔
19314
𙌕
19315
𙌖
19316
𙌗
19317
𙌘
19318
𙌙
19319
𙌚
1931A
𙌛
1931B
𙌜
1931C
𙌝
1931D
𙌞
1931E
𙌟
1931F
90
A0
𙌠
19320
𙌡
19321
𙌢
19322
𙌣
19323
𙌤
19324
𙌥
19325
𙌦
19326
𙌧
19327
𙌨
19328
𙌩
19329
𙌪
1932A
𙌫
1932B
𙌬
1932C
𙌭
1932D
𙌮
1932E
𙌯
1932F
A0
B0
𙌰
19330
𙌱
19331
𙌲
19332
𙌳
19333
𙌴
19334
𙌵
19335
𙌶
19336
𙌷
19337
𙌸
19338
𙌹
19339
𙌺
1933A
𙌻
1933B
𙌼
1933C
𙌽
1933D
𙌾
1933E
𙌿
1933F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]