International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F18888

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񈈀
48200
񈈁
48201
񈈂
48202
񈈃
48203
񈈄
48204
񈈅
48205
񈈆
48206
񈈇
48207
񈈈
48208
񈈉
48209
񈈊
4820A
񈈋
4820B
񈈌
4820C
񈈍
4820D
񈈎
4820E
񈈏
4820F
80
90
񈈐
48210
񈈑
48211
񈈒
48212
񈈓
48213
񈈔
48214
񈈕
48215
񈈖
48216
񈈗
48217
񈈘
48218
񈈙
48219
񈈚
4821A
񈈛
4821B
񈈜
4821C
񈈝
4821D
񈈞
4821E
񈈟
4821F
90
A0
񈈠
48220
񈈡
48221
񈈢
48222
񈈣
48223
񈈤
48224
񈈥
48225
񈈦
48226
񈈧
48227
񈈨
48228
񈈩
48229
񈈪
4822A
񈈫
4822B
񈈬
4822C
񈈭
4822D
񈈮
4822E
񈈯
4822F
A0
B0
񈈰
48230
񈈱
48231
񈈲
48232
񈈳
48233
񈈴
48234
񈈵
48235
񈈶
48236
񈈷
48237
񈈸
48238
񈈹
48239
񈈺
4823A
񈈻
4823B
񈈼
4823C
񈈽
4823D
񈈾
4823E
񈈿
4823F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]