International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F283A3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򃣀
838C0
򃣁
838C1
򃣂
838C2
򃣃
838C3
򃣄
838C4
򃣅
838C5
򃣆
838C6
򃣇
838C7
򃣈
838C8
򃣉
838C9
򃣊
838CA
򃣋
838CB
򃣌
838CC
򃣍
838CD
򃣎
838CE
򃣏
838CF
80
90
򃣐
838D0
򃣑
838D1
򃣒
838D2
򃣓
838D3
򃣔
838D4
򃣕
838D5
򃣖
838D6
򃣗
838D7
򃣘
838D8
򃣙
838D9
򃣚
838DA
򃣛
838DB
򃣜
838DC
򃣝
838DD
򃣞
838DE
򃣟
838DF
90
A0
򃣠
838E0
򃣡
838E1
򃣢
838E2
򃣣
838E3
򃣤
838E4
򃣥
838E5
򃣦
838E6
򃣧
838E7
򃣨
838E8
򃣩
838E9
򃣪
838EA
򃣫
838EB
򃣬
838EC
򃣭
838ED
򃣮
838EE
򃣯
838EF
A0
B0
򃣰
838F0
򃣱
838F1
򃣲
838F2
򃣳
838F3
򃣴
838F4
򃣵
838F5
򃣶
838F6
򃣷
838F7
򃣸
838F8
򃣹
838F9
򃣺
838FA
򃣻
838FB
򃣼
838FC
򃣽
838FD
򃣾
838FE
򃣿
838FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]