International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39992

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󙒀
D9480
󙒁
D9481
󙒂
D9482
󙒃
D9483
󙒄
D9484
󙒅
D9485
󙒆
D9486
󙒇
D9487
󙒈
D9488
󙒉
D9489
󙒊
D948A
󙒋
D948B
󙒌
D948C
󙒍
D948D
󙒎
D948E
󙒏
D948F
80
90
󙒐
D9490
󙒑
D9491
󙒒
D9492
󙒓
D9493
󙒔
D9494
󙒕
D9495
󙒖
D9496
󙒗
D9497
󙒘
D9498
󙒙
D9499
󙒚
D949A
󙒛
D949B
󙒜
D949C
󙒝
D949D
󙒞
D949E
󙒟
D949F
90
A0
󙒠
D94A0
󙒡
D94A1
󙒢
D94A2
󙒣
D94A3
󙒤
D94A4
󙒥
D94A5
󙒦
D94A6
󙒧
D94A7
󙒨
D94A8
󙒩
D94A9
󙒪
D94AA
󙒫
D94AB
󙒬
D94AC
󙒭
D94AD
󙒮
D94AE
󙒯
D94AF
A0
B0
󙒰
D94B0
󙒱
D94B1
󙒲
D94B2
󙒳
D94B3
󙒴
D94B4
󙒵
D94B5
󙒶
D94B6
󙒷
D94B7
󙒸
D94B8
󙒹
D94B9
󙒺
D94BA
󙒻
D94BB
󙒼
D94BC
󙒽
D94BD
󙒾
D94BE
󙒿
D94BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]