ICU >
Demo >
Converter Explorer >
UTF-32LE
List of Converter Aliases
Internal Converter Name |
JAVA |
UTF-32LE |
|
Codepage layout information is not available for this converter at this time.
Information About This Converter
Type of converter | UCNV_UTF32_LittleEndian |
Minimum number of bytes per UChar | 4 |
Maximum number of bytes per UChar | 4 |
Substitution character | \xFD\xFF\x00\x00 (See note below)
|
Is ASCII [\x20-\x7E] compatible? | FALSE |
Is ASCII [\u0020-\u007E] ambiguous? | FALSE |
Contains ambiguous aliases? | FALSE |
Always generates Unicode NFC? | UNKNOWN |
Contains BiDi characters? | TRUE |
List of Languages Representable By This Codepage
Locale | Locale Name |
af | Afrikaans |
agq | Aghem |
ak | Akan |
am | Amharic |
ar | Arabic |
as | Assamese |
asa | Asu |
ast | Asturian |
az | Azerbaijani |
az_Cyrl | Azerbaijani (Cyrillic) |
bas | Basaa |
be | Belarusian |
bem | Bemba |
bez | Bena |
bg | Bulgarian |
bgc | Haryanvi |
bho | Bhojpuri |
blo | Anii |
bm | Bambara |
bn | Bangla |
bo | Tibetan |
br | Breton |
brx | Bodo |
bs | Bosnian |
bs_Cyrl | Bosnian (Cyrillic) |
ca | Catalan |
ccp | Chakma |
ce | Chechen |
ceb | Cebuano |
cgg | Chiga |
chr | Cherokee |
ckb | Central Kurdish |
cs | Czech |
csw | Swampy Cree |
cv | Chuvash |
cy | Welsh |
da | Danish |
dav | Taita |
de | German |
de_CH | German (Switzerland) |
dje | Zarma |
doi | Dogri |
dsb | Lower Sorbian |
dua | Duala |
dyo | Jola-Fonyi |
dz | Dzongkha |
ebu | Embu |
ee | Ewe |
el | Greek |
en | English |
eo | Esperanto |
es | Spanish |
et | Estonian |
eu | Basque |
ewo | Ewondo |
fa | Persian |
ff | Fula |
ff_Adlm | Fula (Adlam) |
fi | Finnish |
fil | Filipino |
fo | Faroese |
fr | French |
fur | Friulian |
fy | Western Frisian |
ga | Irish |
gd | Scottish Gaelic |
gl | Galician |
gsw | Swiss German |
gu | Gujarati |
guz | Gusii |
gv | Manx |
ha | Hausa |
haw | Hawaiian |
he | Hebrew |
hi | Hindi |
hr | Croatian |
hsb | Upper Sorbian |
hu | Hungarian |
hy | Armenian |
ia | Interlingua |
id | Indonesian |
ie | Interlingue |
ig | Igbo |
ii | Sichuan Yi |
is | Icelandic |
it | Italian |
ja | Japanese |
jgo | Ngomba |
jmc | Machame |
jv | Javanese |
ka | Georgian |
kab | Kabyle |
kam | Kamba |
kde | Makonde |
kea | Kabuverdianu |
kgp | Kaingang |
khq | Koyra Chiini |
ki | Kikuyu |
kk | Kazakh |
kkj | Kako |
kl | Kalaallisut |
kln | Kalenjin |
km | Khmer |
kn | Kannada |
ko | Korean |
kok | Konkani |
ks | Kashmiri |
ks_Deva | Kashmiri (Devanagari) |
ksb | Shambala |
ksf | Bafia |
ksh | Colognian |
ku | Kurdish |
kw | Cornish |
kxv | Kuvi |
kxv_Deva | Kuvi (Devanagari) |
kxv_Orya | Kuvi (Odia) |
kxv_Telu | Kuvi (Telugu) |
ky | Kyrgyz |
lag | Langi |
lb | Luxembourgish |
lg | Ganda |
lij | Ligurian |
lkt | Lakota |
lmo | Lombard |
ln | Lingala |
lo | Lao |
lrc | Northern Luri |
lt | Lithuanian |
lu | Luba-Katanga |
luo | Luo |
luy | Luyia |
lv | Latvian |
mai | Maithili |
mas | Masai |
mer | Meru |
mfe | Morisyen |
mg | Malagasy |
mgh | Makhuwa-Meetto |
mgo | Metaʼ |
mi | Māori |
mk | Macedonian |
ml | Malayalam |
mn | Mongolian |
mni | Manipuri |
mr | Marathi |
ms | Malay |
mt | Maltese |
mua | Mundang |
my | Burmese |
mzn | Mazanderani |
naq | Nama |
nd | North Ndebele |
nds | Low German |
ne | Nepali |
nl | Dutch |
nmg | Kwasio |
nnh | Ngiemboon |
no | Norwegian |
nqo | N’Ko |
nus | Nuer |
nyn | Nyankole |
oc | Occitan |
om | Oromo |
or | Odia |
os | Ossetic |
pa | Punjabi |
pa_Arab | Punjabi (Arabic) |
pcm | Nigerian Pidgin |
pl | Polish |
prg | Prussian |
ps | Pashto |
ps_PK | Pashto (Pakistan) |
pt | Portuguese |
qu | Quechua |
raj | Rajasthani |
rm | Romansh |
rn | Rundi |
ro | Romanian |
rof | Rombo |
ru | Russian |
rw | Kinyarwanda |
rwk | Rwa |
sa | Sanskrit |
sah | Yakut |
saq | Samburu |
sat | Santali |
sbp | Sangu |
sc | Sardinian |
sd | Sindhi |
sd_Deva | Sindhi (Devanagari) |
se | Northern Sami |
seh | Sena |
ses | Koyraboro Senni |
sg | Sango |
shi | Tachelhit |
shi_Latn | Tachelhit (Latin) |
si | Sinhala |
sk | Slovak |
sl | Slovenian |
smn | Inari Sami |
sn | Shona |
so | Somali |
sq | Albanian |
sr | Serbian |
sr_Latn | Serbian (Latin) |
su | Sundanese |
sv | Swedish |
sw | Swahili |
sw_CD | Swahili (Congo - Kinshasa) |
sw_KE | Swahili (Kenya) |
syr | Syriac |
szl | Silesian |
ta | Tamil |
te | Telugu |
teo | Teso |
tg | Tajik |
th | Thai |
ti | Tigrinya |
tk | Turkmen |
to | Tongan |
tok | Toki Pona |
tr | Turkish |
tt | Tatar |
twq | Tasawaq |
tzm | Central Atlas Tamazight |
ug | Uyghur |
uk | Ukrainian |
ur | Urdu |
uz | Uzbek |
uz_Arab | Uzbek (Arabic) |
uz_Cyrl | Uzbek (Cyrillic) |
vai | Vai |
vai_Latn | Vai (Latin) |
vec | Venetian |
vi | Vietnamese |
vmw | Makhuwa |
vun | Vunjo |
wae | Walser |
wo | Wolof |
xh | Xhosa |
xnr | Kangri |
xog | Soga |
yav | Yangben |
yi | Yiddish |
yo | Yoruba |
yo_BJ | Yoruba (Benin) |
yrl | Nheengatu |
yue | Cantonese |
yue_Hans | Cantonese (Simplified) |
za | Zhuang |
zgh | Standard Moroccan Tamazight |
zh | Chinese |
zh_Hant | Chinese (Traditional) |
zu | Zulu |
Set of Unicode Characters Representable By This Codepage
[^\uD800-\uDFFF]
Note: The substitution byte sequence can be platform dependent.
It depends on the endianess of the platform.
Please see the Unicode FAQ for details.