International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F399B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󙳀
D9CC0
󙳁
D9CC1
󙳂
D9CC2
󙳃
D9CC3
󙳄
D9CC4
󙳅
D9CC5
󙳆
D9CC6
󙳇
D9CC7
󙳈
D9CC8
󙳉
D9CC9
󙳊
D9CCA
󙳋
D9CCB
󙳌
D9CCC
󙳍
D9CCD
󙳎
D9CCE
󙳏
D9CCF
80
90
󙳐
D9CD0
󙳑
D9CD1
󙳒
D9CD2
󙳓
D9CD3
󙳔
D9CD4
󙳕
D9CD5
󙳖
D9CD6
󙳗
D9CD7
󙳘
D9CD8
󙳙
D9CD9
󙳚
D9CDA
󙳛
D9CDB
󙳜
D9CDC
󙳝
D9CDD
󙳞
D9CDE
󙳟
D9CDF
90
A0
󙳠
D9CE0
󙳡
D9CE1
󙳢
D9CE2
󙳣
D9CE3
󙳤
D9CE4
󙳥
D9CE5
󙳦
D9CE6
󙳧
D9CE7
󙳨
D9CE8
󙳩
D9CE9
󙳪
D9CEA
󙳫
D9CEB
󙳬
D9CEC
󙳭
D9CED
󙳮
D9CEE
󙳯
D9CEF
A0
B0
󙳰
D9CF0
󙳱
D9CF1
󙳲
D9CF2
󙳳
D9CF3
󙳴
D9CF4
󙳵
D9CF5
󙳶
D9CF6
󙳷
D9CF7
󙳸
D9CF8
󙳹
D9CF9
󙳺
D9CFA
󙳻
D9CFB
󙳼
D9CFC
󙳽
D9CFD
󙳾
D9CFE
󙳿
D9CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]