International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B1A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񱤀
71900
񱤁
71901
񱤂
71902
񱤃
71903
񱤄
71904
񱤅
71905
񱤆
71906
񱤇
71907
񱤈
71908
񱤉
71909
񱤊
7190A
񱤋
7190B
񱤌
7190C
񱤍
7190D
񱤎
7190E
񱤏
7190F
80
90
񱤐
71910
񱤑
71911
񱤒
71912
񱤓
71913
񱤔
71914
񱤕
71915
񱤖
71916
񱤗
71917
񱤘
71918
񱤙
71919
񱤚
7191A
񱤛
7191B
񱤜
7191C
񱤝
7191D
񱤞
7191E
񱤟
7191F
90
A0
񱤠
71920
񱤡
71921
񱤢
71922
񱤣
71923
񱤤
71924
񱤥
71925
񱤦
71926
񱤧
71927
񱤨
71928
񱤩
71929
񱤪
7192A
񱤫
7192B
񱤬
7192C
񱤭
7192D
񱤮
7192E
񱤯
7192F
A0
B0
񱤰
71930
񱤱
71931
񱤲
71932
񱤳
71933
񱤴
71934
񱤵
71935
񱤶
71936
񱤷
71937
񱤸
71938
񱤹
71939
񱤺
7193A
񱤻
7193B
񱤼
7193C
񱤽
7193D
񱤾
7193E
񱤿
7193F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]