International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A080

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𠀀
20000
𠀁
20001
𠀂
20002
𠀃
20003
𠀄
20004
𠀅
20005
𠀆
20006
𠀇
20007
𠀈
20008
𠀉
20009
𠀊
2000A
𠀋
2000B
𠀌
2000C
𠀍
2000D
𠀎
2000E
𠀏
2000F
80
90
𠀐
20010
𠀑
20011
𠀒
20012
𠀓
20013
𠀔
20014
𠀕
20015
𠀖
20016
𠀗
20017
𠀘
20018
𠀙
20019
𠀚
2001A
𠀛
2001B
𠀜
2001C
𠀝
2001D
𠀞
2001E
𠀟
2001F
90
A0
𠀠
20020
𠀡
20021
𠀢
20022
𠀣
20023
𠀤
20024
𠀥
20025
𠀦
20026
𠀧
20027
𠀨
20028
𠀩
20029
𠀪
2002A
𠀫
2002B
𠀬
2002C
𠀭
2002D
𠀮
2002E
𠀯
2002F
A0
B0
𠀰
20030
𠀱
20031
𠀲
20032
𠀳
20033
𠀴
20034
𠀵
20035
𠀶
20036
𠀷
20037
𠀸
20038
𠀹
20039
𠀺
2003A
𠀻
2003B
𠀼
2003C
𠀽
2003D
𠀾
2003E
𠀿
2003F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]