International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BF99

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򿙀
BF640
򿙁
BF641
򿙂
BF642
򿙃
BF643
򿙄
BF644
򿙅
BF645
򿙆
BF646
򿙇
BF647
򿙈
BF648
򿙉
BF649
򿙊
BF64A
򿙋
BF64B
򿙌
BF64C
򿙍
BF64D
򿙎
BF64E
򿙏
BF64F
80
90
򿙐
BF650
򿙑
BF651
򿙒
BF652
򿙓
BF653
򿙔
BF654
򿙕
BF655
򿙖
BF656
򿙗
BF657
򿙘
BF658
򿙙
BF659
򿙚
BF65A
򿙛
BF65B
򿙜
BF65C
򿙝
BF65D
򿙞
BF65E
򿙟
BF65F
90
A0
򿙠
BF660
򿙡
BF661
򿙢
BF662
򿙣
BF663
򿙤
BF664
򿙥
BF665
򿙦
BF666
򿙧
BF667
򿙨
BF668
򿙩
BF669
򿙪
BF66A
򿙫
BF66B
򿙬
BF66C
򿙭
BF66D
򿙮
BF66E
򿙯
BF66F
A0
B0
򿙰
BF670
򿙱
BF671
򿙲
BF672
򿙳
BF673
򿙴
BF674
򿙵
BF675
򿙶
BF676
򿙷
BF677
򿙸
BF678
򿙹
BF679
򿙺
BF67A
򿙻
BF67B
򿙼
BF67C
򿙽
BF67D
򿙾
BF67E
򿙿
BF67F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]