International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F3818C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁌀
C1300
󁌁
C1301
󁌂
C1302
󁌃
C1303
󁌄
C1304
󁌅
C1305
󁌆
C1306
󁌇
C1307
󁌈
C1308
󁌉
C1309
󁌊
C130A
󁌋
C130B
󁌌
C130C
󁌍
C130D
󁌎
C130E
󁌏
C130F
80
90
󁌐
C1310
󁌑
C1311
󁌒
C1312
󁌓
C1313
󁌔
C1314
󁌕
C1315
󁌖
C1316
󁌗
C1317
󁌘
C1318
󁌙
C1319
󁌚
C131A
󁌛
C131B
󁌜
C131C
󁌝
C131D
󁌞
C131E
󁌟
C131F
90
A0
󁌠
C1320
󁌡
C1321
󁌢
C1322
󁌣
C1323
󁌤
C1324
󁌥
C1325
󁌦
C1326
󁌧
C1327
󁌨
C1328
󁌩
C1329
󁌪
C132A
󁌫
C132B
󁌬
C132C
󁌭
C132D
󁌮
C132E
󁌯
C132F
A0
B0
󁌰
C1330
󁌱
C1331
󁌲
C1332
󁌳
C1333
󁌴
C1334
󁌵
C1335
󁌶
C1336
󁌷
C1337
󁌸
C1338
󁌹
C1339
󁌺
C133A
󁌻
C133B
󁌼
C133C
󁌽
C133D
󁌾
C133E
󁌿
C133F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]