International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A692

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򦒀
A6480
򦒁
A6481
򦒂
A6482
򦒃
A6483
򦒄
A6484
򦒅
A6485
򦒆
A6486
򦒇
A6487
򦒈
A6488
򦒉
A6489
򦒊
A648A
򦒋
A648B
򦒌
A648C
򦒍
A648D
򦒎
A648E
򦒏
A648F
80
90
򦒐
A6490
򦒑
A6491
򦒒
A6492
򦒓
A6493
򦒔
A6494
򦒕
A6495
򦒖
A6496
򦒗
A6497
򦒘
A6498
򦒙
A6499
򦒚
A649A
򦒛
A649B
򦒜
A649C
򦒝
A649D
򦒞
A649E
򦒟
A649F
90
A0
򦒠
A64A0
򦒡
A64A1
򦒢
A64A2
򦒣
A64A3
򦒤
A64A4
򦒥
A64A5
򦒦
A64A6
򦒧
A64A7
򦒨
A64A8
򦒩
A64A9
򦒪
A64AA
򦒫
A64AB
򦒬
A64AC
򦒭
A64AD
򦒮
A64AE
򦒯
A64AF
A0
B0
򦒰
A64B0
򦒱
A64B1
򦒲
A64B2
򦒳
A64B3
򦒴
A64B4
򦒵
A64B5
򦒶
A64B6
򦒷
A64B7
򦒸
A64B8
򦒹
A64B9
򦒺
A64BA
򦒻
A64BB
򦒼
A64BC
򦒽
A64BD
򦒾
A64BE
򦒿
A64BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]