International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0BA91

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𺑀
3A440
𺑁
3A441
𺑂
3A442
𺑃
3A443
𺑄
3A444
𺑅
3A445
𺑆
3A446
𺑇
3A447
𺑈
3A448
𺑉
3A449
𺑊
3A44A
𺑋
3A44B
𺑌
3A44C
𺑍
3A44D
𺑎
3A44E
𺑏
3A44F
80
90
𺑐
3A450
𺑑
3A451
𺑒
3A452
𺑓
3A453
𺑔
3A454
𺑕
3A455
𺑖
3A456
𺑗
3A457
𺑘
3A458
𺑙
3A459
𺑚
3A45A
𺑛
3A45B
𺑜
3A45C
𺑝
3A45D
𺑞
3A45E
𺑟
3A45F
90
A0
𺑠
3A460
𺑡
3A461
𺑢
3A462
𺑣
3A463
𺑤
3A464
𺑥
3A465
𺑦
3A466
𺑧
3A467
𺑨
3A468
𺑩
3A469
𺑪
3A46A
𺑫
3A46B
𺑬
3A46C
𺑭
3A46D
𺑮
3A46E
𺑯
3A46F
A0
B0
𺑰
3A470
𺑱
3A471
𺑲
3A472
𺑳
3A473
𺑴
3A474
𺑵
3A475
𺑶
3A476
𺑷
3A477
𺑸
3A478
𺑹
3A479
𺑺
3A47A
𺑻
3A47B
𺑼
3A47C
𺑽
3A47D
𺑾
3A47E
𺑿
3A47F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]