International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes E4AF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
4BC0
4BC1
4BC2
4BC3
4BC4
4BC5
4BC6
4BC7
4BC8
4BC9
4BCA
4BCB
4BCC
4BCD
4BCE
4BCF
80
90
4BD0
4BD1
4BD2
4BD3
4BD4
4BD5
4BD6
4BD7
4BD8
4BD9
4BDA
4BDB
4BDC
4BDD
4BDE
4BDF
90
A0
4BE0
4BE1
4BE2
4BE3
4BE4
4BE5
4BE6
4BE7
4BE8
4BE9
4BEA
4BEB
4BEC
4BED
4BEE
4BEF
A0
B0
4BF0
4BF1
4BF2
4BF3
4BF4
4BF5
4BF6
4BF7
4BF8
4BF9
4BFA
4BFB
4BFC
4BFD
4BFE
䯿
4BFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]