International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F48B80

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􋀀
10B000
􋀁
10B001
􋀂
10B002
􋀃
10B003
􋀄
10B004
􋀅
10B005
􋀆
10B006
􋀇
10B007
􋀈
10B008
􋀉
10B009
􋀊
10B00A
􋀋
10B00B
􋀌
10B00C
􋀍
10B00D
􋀎
10B00E
􋀏
10B00F
80
90
􋀐
10B010
􋀑
10B011
􋀒
10B012
􋀓
10B013
􋀔
10B014
􋀕
10B015
􋀖
10B016
􋀗
10B017
􋀘
10B018
􋀙
10B019
􋀚
10B01A
􋀛
10B01B
􋀜
10B01C
􋀝
10B01D
􋀞
10B01E
􋀟
10B01F
90
A0
􋀠
10B020
􋀡
10B021
􋀢
10B022
􋀣
10B023
􋀤
10B024
􋀥
10B025
􋀦
10B026
􋀧
10B027
􋀨
10B028
􋀩
10B029
􋀪
10B02A
􋀫
10B02B
􋀬
10B02C
􋀭
10B02D
􋀮
10B02E
􋀯
10B02F
A0
B0
􋀰
10B030
􋀱
10B031
􋀲
10B032
􋀳
10B033
􋀴
10B034
􋀵
10B035
􋀶
10B036
􋀷
10B037
􋀸
10B038
􋀹
10B039
􋀺
10B03A
􋀻
10B03B
􋀼
10B03C
􋀽
10B03D
􋀾
10B03E
􋀿
10B03F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]