International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F38DB2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󍲀
CDC80
󍲁
CDC81
󍲂
CDC82
󍲃
CDC83
󍲄
CDC84
󍲅
CDC85
󍲆
CDC86
󍲇
CDC87
󍲈
CDC88
󍲉
CDC89
󍲊
CDC8A
󍲋
CDC8B
󍲌
CDC8C
󍲍
CDC8D
󍲎
CDC8E
󍲏
CDC8F
80
90
󍲐
CDC90
󍲑
CDC91
󍲒
CDC92
󍲓
CDC93
󍲔
CDC94
󍲕
CDC95
󍲖
CDC96
󍲗
CDC97
󍲘
CDC98
󍲙
CDC99
󍲚
CDC9A
󍲛
CDC9B
󍲜
CDC9C
󍲝
CDC9D
󍲞
CDC9E
󍲟
CDC9F
90
A0
󍲠
CDCA0
󍲡
CDCA1
󍲢
CDCA2
󍲣
CDCA3
󍲤
CDCA4
󍲥
CDCA5
󍲦
CDCA6
󍲧
CDCA7
󍲨
CDCA8
󍲩
CDCA9
󍲪
CDCAA
󍲫
CDCAB
󍲬
CDCAC
󍲭
CDCAD
󍲮
CDCAE
󍲯
CDCAF
A0
B0
󍲰
CDCB0
󍲱
CDCB1
󍲲
CDCB2
󍲳
CDCB3
󍲴
CDCB4
󍲵
CDCB5
󍲶
CDCB6
󍲷
CDCB7
󍲸
CDCB8
󍲹
CDCB9
󍲺
CDCBA
󍲻
CDCBB
󍲼
CDCBC
󍲽
CDCBD
󍲾
CDCBE
󍲿
CDCBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]