International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F19898

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񘘀
58600
񘘁
58601
񘘂
58602
񘘃
58603
񘘄
58604
񘘅
58605
񘘆
58606
񘘇
58607
񘘈
58608
񘘉
58609
񘘊
5860A
񘘋
5860B
񘘌
5860C
񘘍
5860D
񘘎
5860E
񘘏
5860F
80
90
񘘐
58610
񘘑
58611
񘘒
58612
񘘓
58613
񘘔
58614
񘘕
58615
񘘖
58616
񘘗
58617
񘘘
58618
񘘙
58619
񘘚
5861A
񘘛
5861B
񘘜
5861C
񘘝
5861D
񘘞
5861E
񘘟
5861F
90
A0
񘘠
58620
񘘡
58621
񘘢
58622
񘘣
58623
񘘤
58624
񘘥
58625
񘘦
58626
񘘧
58627
񘘨
58628
񘘩
58629
񘘪
5862A
񘘫
5862B
񘘬
5862C
񘘭
5862D
񘘮
5862E
񘘯
5862F
A0
B0
񘘰
58630
񘘱
58631
񘘲
58632
񘘳
58633
񘘴
58634
񘘵
58635
񘘶
58636
񘘷
58637
񘘸
58638
񘘹
58639
񘘺
5863A
񘘻
5863B
񘘼
5863C
񘘽
5863D
񘘾
5863E
񘘿
5863F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]