International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A184

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򡄀
A1100
򡄁
A1101
򡄂
A1102
򡄃
A1103
򡄄
A1104
򡄅
A1105
򡄆
A1106
򡄇
A1107
򡄈
A1108
򡄉
A1109
򡄊
A110A
򡄋
A110B
򡄌
A110C
򡄍
A110D
򡄎
A110E
򡄏
A110F
80
90
򡄐
A1110
򡄑
A1111
򡄒
A1112
򡄓
A1113
򡄔
A1114
򡄕
A1115
򡄖
A1116
򡄗
A1117
򡄘
A1118
򡄙
A1119
򡄚
A111A
򡄛
A111B
򡄜
A111C
򡄝
A111D
򡄞
A111E
򡄟
A111F
90
A0
򡄠
A1120
򡄡
A1121
򡄢
A1122
򡄣
A1123
򡄤
A1124
򡄥
A1125
򡄦
A1126
򡄧
A1127
򡄨
A1128
򡄩
A1129
򡄪
A112A
򡄫
A112B
򡄬
A112C
򡄭
A112D
򡄮
A112E
򡄯
A112F
A0
B0
򡄰
A1130
򡄱
A1131
򡄲
A1132
򡄳
A1133
򡄴
A1134
򡄵
A1135
򡄶
A1136
򡄷
A1137
򡄸
A1138
򡄹
A1139
򡄺
A113A
򡄻
A113B
򡄼
A113C
򡄽
A113D
򡄾
A113E
򡄿
A113F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]