International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28AA1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򊡀
8A840
򊡁
8A841
򊡂
8A842
򊡃
8A843
򊡄
8A844
򊡅
8A845
򊡆
8A846
򊡇
8A847
򊡈
8A848
򊡉
8A849
򊡊
8A84A
򊡋
8A84B
򊡌
8A84C
򊡍
8A84D
򊡎
8A84E
򊡏
8A84F
80
90
򊡐
8A850
򊡑
8A851
򊡒
8A852
򊡓
8A853
򊡔
8A854
򊡕
8A855
򊡖
8A856
򊡗
8A857
򊡘
8A858
򊡙
8A859
򊡚
8A85A
򊡛
8A85B
򊡜
8A85C
򊡝
8A85D
򊡞
8A85E
򊡟
8A85F
90
A0
򊡠
8A860
򊡡
8A861
򊡢
8A862
򊡣
8A863
򊡤
8A864
򊡥
8A865
򊡦
8A866
򊡧
8A867
򊡨
8A868
򊡩
8A869
򊡪
8A86A
򊡫
8A86B
򊡬
8A86C
򊡭
8A86D
򊡮
8A86E
򊡯
8A86F
A0
B0
򊡰
8A870
򊡱
8A871
򊡲
8A872
򊡳
8A873
򊡴
8A874
򊡵
8A875
򊡶
8A876
򊡷
8A877
򊡸
8A878
򊡹
8A879
򊡺
8A87A
򊡻
8A87B
򊡼
8A87C
򊡽
8A87D
򊡾
8A87E
򊡿
8A87F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]