International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0AA90

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𪐀
2A400
𪐁
2A401
𪐂
2A402
𪐃
2A403
𪐄
2A404
𪐅
2A405
𪐆
2A406
𪐇
2A407
𪐈
2A408
𪐉
2A409
𪐊
2A40A
𪐋
2A40B
𪐌
2A40C
𪐍
2A40D
𪐎
2A40E
𪐏
2A40F
80
90
𪐐
2A410
𪐑
2A411
𪐒
2A412
𪐓
2A413
𪐔
2A414
𪐕
2A415
𪐖
2A416
𪐗
2A417
𪐘
2A418
𪐙
2A419
𪐚
2A41A
𪐛
2A41B
𪐜
2A41C
𪐝
2A41D
𪐞
2A41E
𪐟
2A41F
90
A0
𪐠
2A420
𪐡
2A421
𪐢
2A422
𪐣
2A423
𪐤
2A424
𪐥
2A425
𪐦
2A426
𪐧
2A427
𪐨
2A428
𪐩
2A429
𪐪
2A42A
𪐫
2A42B
𪐬
2A42C
𪐭
2A42D
𪐮
2A42E
𪐯
2A42F
A0
B0
𪐰
2A430
𪐱
2A431
𪐲
2A432
𪐳
2A433
𪐴
2A434
𪐵
2A435
𪐶
2A436
𪐷
2A437
𪐸
2A438
𪐹
2A439
𪐺
2A43A
𪐻
2A43B
𪐼
2A43C
𪐽
2A43D
𪐾
2A43E
𪐿
2A43F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]