International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A890

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񨐀
68400
񨐁
68401
񨐂
68402
񨐃
68403
񨐄
68404
񨐅
68405
񨐆
68406
񨐇
68407
񨐈
68408
񨐉
68409
񨐊
6840A
񨐋
6840B
񨐌
6840C
񨐍
6840D
񨐎
6840E
񨐏
6840F
80
90
񨐐
68410
񨐑
68411
񨐒
68412
񨐓
68413
񨐔
68414
񨐕
68415
񨐖
68416
񨐗
68417
񨐘
68418
񨐙
68419
񨐚
6841A
񨐛
6841B
񨐜
6841C
񨐝
6841D
񨐞
6841E
񨐟
6841F
90
A0
񨐠
68420
񨐡
68421
񨐢
68422
񨐣
68423
񨐤
68424
񨐥
68425
񨐦
68426
񨐧
68427
񨐨
68428
񨐩
68429
񨐪
6842A
񨐫
6842B
񨐬
6842C
񨐭
6842D
񨐮
6842E
񨐯
6842F
A0
B0
񨐰
68430
񨐱
68431
񨐲
68432
񨐳
68433
񨐴
68434
񨐵
68435
񨐶
68436
񨐷
68437
񨐸
68438
񨐹
68439
񨐺
6843A
񨐻
6843B
񨐼
6843C
񨐽
6843D
񨐾
6843E
񨐿
6843F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]