International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F1898D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񉍀
49340
񉍁
49341
񉍂
49342
񉍃
49343
񉍄
49344
񉍅
49345
񉍆
49346
񉍇
49347
񉍈
49348
񉍉
49349
񉍊
4934A
񉍋
4934B
񉍌
4934C
񉍍
4934D
񉍎
4934E
񉍏
4934F
80
90
񉍐
49350
񉍑
49351
񉍒
49352
񉍓
49353
񉍔
49354
񉍕
49355
񉍖
49356
񉍗
49357
񉍘
49358
񉍙
49359
񉍚
4935A
񉍛
4935B
񉍜
4935C
񉍝
4935D
񉍞
4935E
񉍟
4935F
90
A0
񉍠
49360
񉍡
49361
񉍢
49362
񉍣
49363
񉍤
49364
񉍥
49365
񉍦
49366
񉍧
49367
񉍨
49368
񉍩
49369
񉍪
4936A
񉍫
4936B
񉍬
4936C
񉍭
4936D
񉍮
4936E
񉍯
4936F
A0
B0
񉍰
49370
񉍱
49371
񉍲
49372
񉍳
49373
񉍴
49374
񉍵
49375
񉍶
49376
񉍷
49377
񉍸
49378
񉍹
49379
񉍺
4937A
񉍻
4937B
񉍼
4937C
񉍽
4937D
񉍾
4937E
񉍿
4937F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]