International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F18EA2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎢀
4E880
񎢁
4E881
񎢂
4E882
񎢃
4E883
񎢄
4E884
񎢅
4E885
񎢆
4E886
񎢇
4E887
񎢈
4E888
񎢉
4E889
񎢊
4E88A
񎢋
4E88B
񎢌
4E88C
񎢍
4E88D
񎢎
4E88E
񎢏
4E88F
80
90
񎢐
4E890
񎢑
4E891
񎢒
4E892
񎢓
4E893
񎢔
4E894
񎢕
4E895
񎢖
4E896
񎢗
4E897
񎢘
4E898
񎢙
4E899
񎢚
4E89A
񎢛
4E89B
񎢜
4E89C
񎢝
4E89D
񎢞
4E89E
񎢟
4E89F
90
A0
񎢠
4E8A0
񎢡
4E8A1
񎢢
4E8A2
񎢣
4E8A3
񎢤
4E8A4
񎢥
4E8A5
񎢦
4E8A6
񎢧
4E8A7
񎢨
4E8A8
񎢩
4E8A9
񎢪
4E8AA
񎢫
4E8AB
񎢬
4E8AC
񎢭
4E8AD
񎢮
4E8AE
񎢯
4E8AF
A0
B0
񎢰
4E8B0
񎢱
4E8B1
񎢲
4E8B2
񎢳
4E8B3
񎢴
4E8B4
񎢵
4E8B5
񎢶
4E8B6
񎢷
4E8B7
񎢸
4E8B8
񎢹
4E8B9
񎢺
4E8BA
񎢻
4E8BB
񎢼
4E8BC
񎢽
4E8BD
񎢾
4E8BE
񎢿
4E8BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]