International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F18E90

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎐀
4E400
񎐁
4E401
񎐂
4E402
񎐃
4E403
񎐄
4E404
񎐅
4E405
񎐆
4E406
񎐇
4E407
񎐈
4E408
񎐉
4E409
񎐊
4E40A
񎐋
4E40B
񎐌
4E40C
񎐍
4E40D
񎐎
4E40E
񎐏
4E40F
80
90
񎐐
4E410
񎐑
4E411
񎐒
4E412
񎐓
4E413
񎐔
4E414
񎐕
4E415
񎐖
4E416
񎐗
4E417
񎐘
4E418
񎐙
4E419
񎐚
4E41A
񎐛
4E41B
񎐜
4E41C
񎐝
4E41D
񎐞
4E41E
񎐟
4E41F
90
A0
񎐠
4E420
񎐡
4E421
񎐢
4E422
񎐣
4E423
񎐤
4E424
񎐥
4E425
񎐦
4E426
񎐧
4E427
񎐨
4E428
񎐩
4E429
񎐪
4E42A
񎐫
4E42B
񎐬
4E42C
񎐭
4E42D
񎐮
4E42E
񎐯
4E42F
A0
B0
񎐰
4E430
񎐱
4E431
񎐲
4E432
񎐳
4E433
񎐴
4E434
񎐵
4E435
񎐶
4E436
񎐷
4E437
񎐸
4E438
񎐹
4E439
񎐺
4E43A
񎐻
4E43B
񎐼
4E43C
񎐽
4E43D
񎐾
4E43E
񎐿
4E43F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]