International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A48B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񤋀
642C0
񤋁
642C1
񤋂
642C2
񤋃
642C3
񤋄
642C4
񤋅
642C5
񤋆
642C6
񤋇
642C7
񤋈
642C8
񤋉
642C9
񤋊
642CA
񤋋
642CB
񤋌
642CC
񤋍
642CD
񤋎
642CE
񤋏
642CF
80
90
񤋐
642D0
񤋑
642D1
񤋒
642D2
񤋓
642D3
񤋔
642D4
񤋕
642D5
񤋖
642D6
񤋗
642D7
񤋘
642D8
񤋙
642D9
񤋚
642DA
񤋛
642DB
񤋜
642DC
񤋝
642DD
񤋞
642DE
񤋟
642DF
90
A0
񤋠
642E0
񤋡
642E1
񤋢
642E2
񤋣
642E3
񤋤
642E4
񤋥
642E5
񤋦
642E6
񤋧
642E7
񤋨
642E8
񤋩
642E9
񤋪
642EA
񤋫
642EB
񤋬
642EC
񤋭
642ED
񤋮
642EE
񤋯
642EF
A0
B0
񤋰
642F0
񤋱
642F1
񤋲
642F2
񤋳
642F3
񤋴
642F4
񤋵
642F5
񤋶
642F6
񤋷
642F7
񤋸
642F8
񤋹
642F9
񤋺
642FA
񤋻
642FB
񤋼
642FC
񤋽
642FD
񤋾
642FE
񤋿
642FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]