International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A499

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񤙀
64640
񤙁
64641
񤙂
64642
񤙃
64643
񤙄
64644
񤙅
64645
񤙆
64646
񤙇
64647
񤙈
64648
񤙉
64649
񤙊
6464A
񤙋
6464B
񤙌
6464C
񤙍
6464D
񤙎
6464E
񤙏
6464F
80
90
񤙐
64650
񤙑
64651
񤙒
64652
񤙓
64653
񤙔
64654
񤙕
64655
񤙖
64656
񤙗
64657
񤙘
64658
񤙙
64659
񤙚
6465A
񤙛
6465B
񤙜
6465C
񤙝
6465D
񤙞
6465E
񤙟
6465F
90
A0
񤙠
64660
񤙡
64661
񤙢
64662
񤙣
64663
񤙤
64664
񤙥
64665
񤙦
64666
񤙧
64667
񤙨
64668
񤙩
64669
񤙪
6466A
񤙫
6466B
񤙬
6466C
񤙭
6466D
񤙮
6466E
񤙯
6466F
A0
B0
񤙰
64670
񤙱
64671
񤙲
64672
񤙳
64673
񤙴
64674
񤙵
64675
񤙶
64676
񤙷
64677
񤙸
64678
񤙹
64679
񤙺
6467A
񤙻
6467B
񤙼
6467C
񤙽
6467D
񤙾
6467E
񤙿
6467F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]