International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A880

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񨀀
68000
񨀁
68001
񨀂
68002
񨀃
68003
񨀄
68004
񨀅
68005
񨀆
68006
񨀇
68007
񨀈
68008
񨀉
68009
񨀊
6800A
񨀋
6800B
񨀌
6800C
񨀍
6800D
񨀎
6800E
񨀏
6800F
80
90
񨀐
68010
񨀑
68011
񨀒
68012
񨀓
68013
񨀔
68014
񨀕
68015
񨀖
68016
񨀗
68017
񨀘
68018
񨀙
68019
񨀚
6801A
񨀛
6801B
񨀜
6801C
񨀝
6801D
񨀞
6801E
񨀟
6801F
90
A0
񨀠
68020
񨀡
68021
񨀢
68022
񨀣
68023
񨀤
68024
񨀥
68025
񨀦
68026
񨀧
68027
񨀨
68028
񨀩
68029
񨀪
6802A
񨀫
6802B
񨀬
6802C
񨀭
6802D
񨀮
6802E
񨀯
6802F
A0
B0
񨀰
68030
񨀱
68031
񨀲
68032
񨀳
68033
񨀴
68034
񨀵
68035
񨀶
68036
񨀷
68037
񨀸
68038
񨀹
68039
񨀺
6803A
񨀻
6803B
񨀼
6803C
񨀽
6803D
񨀾
6803E
񨀿
6803F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]