International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F19888

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񘈀
58200
񘈁
58201
񘈂
58202
񘈃
58203
񘈄
58204
񘈅
58205
񘈆
58206
񘈇
58207
񘈈
58208
񘈉
58209
񘈊
5820A
񘈋
5820B
񘈌
5820C
񘈍
5820D
񘈎
5820E
񘈏
5820F
80
90
񘈐
58210
񘈑
58211
񘈒
58212
񘈓
58213
񘈔
58214
񘈕
58215
񘈖
58216
񘈗
58217
񘈘
58218
񘈙
58219
񘈚
5821A
񘈛
5821B
񘈜
5821C
񘈝
5821D
񘈞
5821E
񘈟
5821F
90
A0
񘈠
58220
񘈡
58221
񘈢
58222
񘈣
58223
񘈤
58224
񘈥
58225
񘈦
58226
񘈧
58227
񘈨
58228
񘈩
58229
񘈪
5822A
񘈫
5822B
񘈬
5822C
񘈭
5822D
񘈮
5822E
񘈯
5822F
A0
B0
񘈰
58230
񘈱
58231
񘈲
58232
񘈳
58233
񘈴
58234
񘈵
58235
񘈶
58236
񘈷
58237
񘈸
58238
񘈹
58239
񘈺
5823A
񘈻
5823B
񘈼
5823C
񘈽
5823D
񘈾
5823E
񘈿
5823F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]