International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F38391

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃑀
C3440
󃑁
C3441
󃑂
C3442
󃑃
C3443
󃑄
C3444
󃑅
C3445
󃑆
C3446
󃑇
C3447
󃑈
C3448
󃑉
C3449
󃑊
C344A
󃑋
C344B
󃑌
C344C
󃑍
C344D
󃑎
C344E
󃑏
C344F
80
90
󃑐
C3450
󃑑
C3451
󃑒
C3452
󃑓
C3453
󃑔
C3454
󃑕
C3455
󃑖
C3456
󃑗
C3457
󃑘
C3458
󃑙
C3459
󃑚
C345A
󃑛
C345B
󃑜
C345C
󃑝
C345D
󃑞
C345E
󃑟
C345F
90
A0
󃑠
C3460
󃑡
C3461
󃑢
C3462
󃑣
C3463
󃑤
C3464
󃑥
C3465
󃑦
C3466
󃑧
C3467
󃑨
C3468
󃑩
C3469
󃑪
C346A
󃑫
C346B
󃑬
C346C
󃑭
C346D
󃑮
C346E
󃑯
C346F
A0
B0
󃑰
C3470
󃑱
C3471
󃑲
C3472
󃑳
C3473
󃑴
C3474
󃑵
C3475
󃑶
C3476
󃑷
C3477
󃑸
C3478
󃑹
C3479
󃑺
C347A
󃑻
C347B
󃑼
C347C
󃑽
C347D
󃑾
C347E
󃑿
C347F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]