International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A8A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󨤀
E8900
󨤁
E8901
󨤂
E8902
󨤃
E8903
󨤄
E8904
󨤅
E8905
󨤆
E8906
󨤇
E8907
󨤈
E8908
󨤉
E8909
󨤊
E890A
󨤋
E890B
󨤌
E890C
󨤍
E890D
󨤎
E890E
󨤏
E890F
80
90
󨤐
E8910
󨤑
E8911
󨤒
E8912
󨤓
E8913
󨤔
E8914
󨤕
E8915
󨤖
E8916
󨤗
E8917
󨤘
E8918
󨤙
E8919
󨤚
E891A
󨤛
E891B
󨤜
E891C
󨤝
E891D
󨤞
E891E
󨤟
E891F
90
A0
󨤠
E8920
󨤡
E8921
󨤢
E8922
󨤣
E8923
󨤤
E8924
󨤥
E8925
󨤦
E8926
󨤧
E8927
󨤨
E8928
󨤩
E8929
󨤪
E892A
󨤫
E892B
󨤬
E892C
󨤭
E892D
󨤮
E892E
󨤯
E892F
A0
B0
󨤰
E8930
󨤱
E8931
󨤲
E8932
󨤳
E8933
󨤴
E8934
󨤵
E8935
󨤶
E8936
󨤷
E8937
󨤸
E8938
󨤹
E8939
󨤺
E893A
󨤻
E893B
󨤼
E893C
󨤽
E893D
󨤾
E893E
󨤿
E893F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]