International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39995

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󙕀
D9540
󙕁
D9541
󙕂
D9542
󙕃
D9543
󙕄
D9544
󙕅
D9545
󙕆
D9546
󙕇
D9547
󙕈
D9548
󙕉
D9549
󙕊
D954A
󙕋
D954B
󙕌
D954C
󙕍
D954D
󙕎
D954E
󙕏
D954F
80
90
󙕐
D9550
󙕑
D9551
󙕒
D9552
󙕓
D9553
󙕔
D9554
󙕕
D9555
󙕖
D9556
󙕗
D9557
󙕘
D9558
󙕙
D9559
󙕚
D955A
󙕛
D955B
󙕜
D955C
󙕝
D955D
󙕞
D955E
󙕟
D955F
90
A0
󙕠
D9560
󙕡
D9561
󙕢
D9562
󙕣
D9563
󙕤
D9564
󙕥
D9565
󙕦
D9566
󙕧
D9567
󙕨
D9568
󙕩
D9569
󙕪
D956A
󙕫
D956B
󙕬
D956C
󙕭
D956D
󙕮
D956E
󙕯
D956F
A0
B0
󙕰
D9570
󙕱
D9571
󙕲
D9572
󙕳
D9573
󙕴
D9574
󙕵
D9575
󙕶
D9576
󙕷
D9577
󙕸
D9578
󙕹
D9579
󙕺
D957A
󙕻
D957B
󙕼
D957C
󙕽
D957D
󙕾
D957E
󙕿
D957F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]