International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B3A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳠀
F3800
󳠁
F3801
󳠂
F3802
󳠃
F3803
󳠄
F3804
󳠅
F3805
󳠆
F3806
󳠇
F3807
󳠈
F3808
󳠉
F3809
󳠊
F380A
󳠋
F380B
󳠌
F380C
󳠍
F380D
󳠎
F380E
󳠏
F380F
80
90
󳠐
F3810
󳠑
F3811
󳠒
F3812
󳠓
F3813
󳠔
F3814
󳠕
F3815
󳠖
F3816
󳠗
F3817
󳠘
F3818
󳠙
F3819
󳠚
F381A
󳠛
F381B
󳠜
F381C
󳠝
F381D
󳠞
F381E
󳠟
F381F
90
A0
󳠠
F3820
󳠡
F3821
󳠢
F3822
󳠣
F3823
󳠤
F3824
󳠥
F3825
󳠦
F3826
󳠧
F3827
󳠨
F3828
󳠩
F3829
󳠪
F382A
󳠫
F382B
󳠬
F382C
󳠭
F382D
󳠮
F382E
󳠯
F382F
A0
B0
󳠰
F3830
󳠱
F3831
󳠲
F3832
󳠳
F3833
󳠴
F3834
󳠵
F3835
󳠶
F3836
󳠷
F3837
󳠸
F3838
󳠹
F3839
󳠺
F383A
󳠻
F383B
󳠼
F383C
󳠽
F383D
󳠾
F383E
󳠿
F383F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]