International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄶀
C4D80
󄶁
C4D81
󄶂
C4D82
󄶃
C4D83
󄶄
C4D84
󄶅
C4D85
󄶆
C4D86
󄶇
C4D87
󄶈
C4D88
󄶉
C4D89
󄶊
C4D8A
󄶋
C4D8B
󄶌
C4D8C
󄶍
C4D8D
󄶎
C4D8E
󄶏
C4D8F
80
90
󄶐
C4D90
󄶑
C4D91
󄶒
C4D92
󄶓
C4D93
󄶔
C4D94
󄶕
C4D95
󄶖
C4D96
󄶗
C4D97
󄶘
C4D98
󄶙
C4D99
󄶚
C4D9A
󄶛
C4D9B
󄶜
C4D9C
󄶝
C4D9D
󄶞
C4D9E
󄶟
C4D9F
90
A0
󄶠
C4DA0
󄶡
C4DA1
󄶢
C4DA2
󄶣
C4DA3
󄶤
C4DA4
󄶥
C4DA5
󄶦
C4DA6
󄶧
C4DA7
󄶨
C4DA8
󄶩
C4DA9
󄶪
C4DAA
󄶫
C4DAB
󄶬
C4DAC
󄶭
C4DAD
󄶮
C4DAE
󄶯
C4DAF
A0
B0
󄶰
C4DB0
󄶱
C4DB1
󄶲
C4DB2
󄶳
C4DB3
󄶴
C4DB4
󄶵
C4DB5
󄶶
C4DB6
󄶷
C4DB7
󄶸
C4DB8
󄶹
C4DB9
󄶺
C4DBA
󄶻
C4DBB
󄶼
C4DBC
󄶽
C4DBD
󄶾
C4DBE
󄶿
C4DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]