International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F286A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򆠀
86800
򆠁
86801
򆠂
86802
򆠃
86803
򆠄
86804
򆠅
86805
򆠆
86806
򆠇
86807
򆠈
86808
򆠉
86809
򆠊
8680A
򆠋
8680B
򆠌
8680C
򆠍
8680D
򆠎
8680E
򆠏
8680F
80
90
򆠐
86810
򆠑
86811
򆠒
86812
򆠓
86813
򆠔
86814
򆠕
86815
򆠖
86816
򆠗
86817
򆠘
86818
򆠙
86819
򆠚
8681A
򆠛
8681B
򆠜
8681C
򆠝
8681D
򆠞
8681E
򆠟
8681F
90
A0
򆠠
86820
򆠡
86821
򆠢
86822
򆠣
86823
򆠤
86824
򆠥
86825
򆠦
86826
򆠧
86827
򆠨
86828
򆠩
86829
򆠪
8682A
򆠫
8682B
򆠬
8682C
򆠭
8682D
򆠮
8682E
򆠯
8682F
A0
B0
򆠰
86830
򆠱
86831
򆠲
86832
򆠳
86833
򆠴
86834
򆠵
86835
򆠶
86836
򆠷
86837
򆠸
86838
򆠹
86839
򆠺
8683A
򆠻
8683B
򆠼
8683C
򆠽
8683D
򆠾
8683E
򆠿
8683F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]