International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA
UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F18EA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎦀
4E980
񎦁
4E981
񎦂
4E982
񎦃
4E983
񎦄
4E984
񎦅
4E985
񎦆
4E986
񎦇
4E987
񎦈
4E988
񎦉
4E989
񎦊
4E98A
񎦋
4E98B
񎦌
4E98C
񎦍
4E98D
񎦎
4E98E
񎦏
4E98F
80
90
񎦐
4E990
񎦑
4E991
񎦒
4E992
񎦓
4E993
񎦔
4E994
񎦕
4E995
񎦖
4E996
񎦗
4E997
񎦘
4E998
񎦙
4E999
񎦚
4E99A
񎦛
4E99B
񎦜
4E99C
񎦝
4E99D
񎦞
4E99E
񎦟
4E99F
90
A0
񎦠
4E9A0
񎦡
4E9A1
񎦢
4E9A2
񎦣
4E9A3
񎦤
4E9A4
񎦥
4E9A5
񎦦
4E9A6
񎦧
4E9A7
񎦨
4E9A8
񎦩
4E9A9
񎦪
4E9AA
񎦫
4E9AB
񎦬
4E9AC
񎦭
4E9AD
񎦮
4E9AE
񎦯
4E9AF
A0
B0
񎦰
4E9B0
񎦱
4E9B1
񎦲
4E9B2
񎦳
4E9B3
񎦴
4E9B4
񎦵
4E9B5
񎦶
4E9B6
񎦷
4E9B7
񎦸
4E9B8
񎦹
4E9B9
񎦺
4E9BA
񎦻
4E9BB
񎦼
4E9BC
񎦽
4E9BD
񎦾
4E9BE
񎦿
4E9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]