International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F2849A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򄚀
84680
򄚁
84681
򄚂
84682
򄚃
84683
򄚄
84684
򄚅
84685
򄚆
84686
򄚇
84687
򄚈
84688
򄚉
84689
򄚊
8468A
򄚋
8468B
򄚌
8468C
򄚍
8468D
򄚎
8468E
򄚏
8468F
80
90
򄚐
84690
򄚑
84691
򄚒
84692
򄚓
84693
򄚔
84694
򄚕
84695
򄚖
84696
򄚗
84697
򄚘
84698
򄚙
84699
򄚚
8469A
򄚛
8469B
򄚜
8469C
򄚝
8469D
򄚞
8469E
򄚟
8469F
90
A0
򄚠
846A0
򄚡
846A1
򄚢
846A2
򄚣
846A3
򄚤
846A4
򄚥
846A5
򄚦
846A6
򄚧
846A7
򄚨
846A8
򄚩
846A9
򄚪
846AA
򄚫
846AB
򄚬
846AC
򄚭
846AD
򄚮
846AE
򄚯
846AF
A0
B0
򄚰
846B0
򄚱
846B1
򄚲
846B2
򄚳
846B3
򄚴
846B4
򄚵
846B5
򄚶
846B6
򄚷
846B7
򄚸
846B8
򄚹
846B9
򄚺
846BA
򄚻
846BB
򄚼
846BC
򄚽
846BD
򄚾
846BE
򄚿
846BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]