International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3808E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀎀
C0380
󀎁
C0381
󀎂
C0382
󀎃
C0383
󀎄
C0384
󀎅
C0385
󀎆
C0386
󀎇
C0387
󀎈
C0388
󀎉
C0389
󀎊
C038A
󀎋
C038B
󀎌
C038C
󀎍
C038D
󀎎
C038E
󀎏
C038F
80
90
󀎐
C0390
󀎑
C0391
󀎒
C0392
󀎓
C0393
󀎔
C0394
󀎕
C0395
󀎖
C0396
󀎗
C0397
󀎘
C0398
󀎙
C0399
󀎚
C039A
󀎛
C039B
󀎜
C039C
󀎝
C039D
󀎞
C039E
󀎟
C039F
90
A0
󀎠
C03A0
󀎡
C03A1
󀎢
C03A2
󀎣
C03A3
󀎤
C03A4
󀎥
C03A5
󀎦
C03A6
󀎧
C03A7
󀎨
C03A8
󀎩
C03A9
󀎪
C03AA
󀎫
C03AB
󀎬
C03AC
󀎭
C03AD
󀎮
C03AE
󀎯
C03AF
A0
B0
󀎰
C03B0
󀎱
C03B1
󀎲
C03B2
󀎳
C03B3
󀎴
C03B4
󀎵
C03B5
󀎶
C03B6
󀎷
C03B7
󀎸
C03B8
󀎹
C03B9
󀎺
C03BA
󀎻
C03BB
󀎼
C03BC
󀎽
C03BD
󀎾
C03BE
󀎿
C03BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]