International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09D98

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𝘀
1D600
𝘁
1D601
𝘂
1D602
𝘃
1D603
𝘄
1D604
𝘅
1D605
𝘆
1D606
𝘇
1D607
𝘈
1D608
𝘉
1D609
𝘊
1D60A
𝘋
1D60B
𝘌
1D60C
𝘍
1D60D
𝘎
1D60E
𝘏
1D60F
80
90
𝘐
1D610
𝘑
1D611
𝘒
1D612
𝘓
1D613
𝘔
1D614
𝘕
1D615
𝘖
1D616
𝘗
1D617
𝘘
1D618
𝘙
1D619
𝘚
1D61A
𝘛
1D61B
𝘜
1D61C
𝘝
1D61D
𝘞
1D61E
𝘟
1D61F
90
A0
𝘠
1D620
𝘡
1D621
𝘢
1D622
𝘣
1D623
𝘤
1D624
𝘥
1D625
𝘦
1D626
𝘧
1D627
𝘨
1D628
𝘩
1D629
𝘪
1D62A
𝘫
1D62B
𝘬
1D62C
𝘭
1D62D
𝘮
1D62E
𝘯
1D62F
A0
B0
𝘰
1D630
𝘱
1D631
𝘲
1D632
𝘳
1D633
𝘴
1D634
𝘵
1D635
𝘶
1D636
𝘷
1D637
𝘸
1D638
𝘹
1D639
𝘺
1D63A
𝘻
1D63B
𝘼
1D63C
𝘽
1D63D
𝘾
1D63E
𝘿
1D63F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]