HTML Character Sets

HTML Character Sets

Character sets determine how the bytes that represent the text of your HTML document are translated to readable characters. A Web browser interprets the bytes in your document according to the applied character set translations. It interprets numeric or hex character references ("〹" or "ሴ") as ISO10646 code points, consistent with the Unicode Standard, version 2.0, and independent of the chosen character set. Named entities ("&") are displayed independently of the chosen character set as well. The display of an arbitrary numeric character reference requires the existence of a font that is able to display that particular character on the user's system. Accordingly, the content in the first column of the following tables may not render as expected on all systems.

ISO Latin-1 Character Set

The following table contains the complete ISO Latin-1 character set, corresponding to the first 256 entries of the Unicode character repertoire. The table provides each character, its decimal code, its named entity reference for HTML, and also a brief description.

Character

Decimal code

Named entity

Description

---

�

---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---

	

---

Horizontal tab

---




---

Line feed

---



---

Unused

---



---

Unused

---



---

Carriage Return

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

---



---

Unused

 

 

---

Space

!

!

---

Exclamation mark

"

"

"

Quotation mark

#

#

---

Number sign

$

$

---

Dollar sign

%

%

---

Percent sign

&

&

&

Ampersand

'

'

---

Apostrophe

(

(

---

Left parenthesis

)

)

---

Right parenthesis

*

*

---

Asterisk

+

+

---

Plus sign

,

,

---

Comma

-

-

---

Hyphen

.

.

---

Period (fullstop)

/

/

---

Solidus (slash)

0

0

---

Digit 0

1

1

---

Digit 1

2

2

---

Digit 2

3

3

---

Digit 3

4

4

---

Digit 4

5

5

---

Digit 5

6

6

---

Digit 6

7

7

---

Digit 7

8

8

---

Digit 8

9

9

---

Digit 9

:

:

---

Colon

;

&#59;

---

Semicolon

<

&#60;

&lt;

Less than

=

&#61;

---

Equals sign

>

&#62;

&gt;

Greater than

?

&#63;

---

Question mark

@

&#64;

---

Commercial at

A

&#65;

---

Capital A

B

&#66;

---

Capital B

C

&#67;

---

Capital C

D

&#68;

---

Capital D

E

&#69;

---

Capital E

F

&#70;

---

Capital F

G

&#71;

---

Capital G

H

&#72;

---

Capital H

I

&#73;

---

Capital I

J

&#74;

---

Capital J

K

&#75;

---

Capital K

L

&#76;

---

Capital L

M

&#77;

---

Capital M

N

&#78;

---

Capital N

O

&#79;

---

Capital O

P

&#80;

---

Capital P

Q

&#81;

---

Capital Q

R

&#82;

---

Capital R

S

&#83;

---

Capital S

T

&#84;

---

Capital T

U

&#85;

---

Capital U

V

&#86;

---

Capital V

W

&#87;

---

Capital W

X

&#88;

---

Capital X

Y

&#89;

---

Capital Y

Z

&#90;

---

Capital Z

[

&#91;

---

Left square bracket

\

&#92;

---

Reverse solidus (backslash)

]

&#93;

---

Right square bracket

^

&#94;

---

Caret

_

&#95;

---

Horizontal bar (underscore)

`

&#96;

---

Grave accent

a

&#97;

---

Small a

b

&#98;

---

Small b

c

&#99;

---

Small c

d

&#100;

---

Small d

e

&#101;

---

Small e

f

&#102;

---

Small f

g

&#103;

---

Small g

h

&#104;

---

Small h

i

&#105;

---

Small i

j

&#106;

---

Small j

k

&#107;

---

Small k

l

&#108;

---

Small l

m

&#109;

---

Small m

n

&#110;

---

Small n

o

&#111;

---

Small o

p

&#112;

---

Small p

q

&#113;

---

Small q

r

&#114;

---

Small r

s

&#115;

---

Small s

t

&#116;

---

Small t

u

&#117;

---

Small u

v

&#118;

---

Small v

w

&#119;

---

Small w

x

&#120;

---

Small x

y

&#121;

---

Small y

z

&#122;

---

Small z

{

&#123;

---

Left curly brace

|

&#124;

---

Vertical bar

}

&#125;

---

Right curly brace

~

&#126;

---

Tilde

---

&#127;

---

Unused

 

&#160;

&nbsp;

Nonbreaking space

¡

&#161;

&iexcl;

Inverted exclamation

¢

&#162;

&cent;

Cent sign

£

&#163;

&pound;

Pound sterling

¤

&#164;

&curren;

General currency sign

¥

&#165;

&yen;

Yen sign

¦

&#166;

&brvbar; or &brkbar;

Broken vertical bar

§

&#167;

&sect;

Section sign

¨

&#168;

&uml; or &die;

Diæresis / Umlaut

©

&#169;

&copy;

Copyright

ª

&#170;

&ordf;

Feminine ordinal

«

&#171;

&laquo;

Left angle quote, guillemot left

 

&#172;

&not;

Not sign

-

&#173;

&shy;

Soft hyphen

®

&#174;

&reg;

Registered trademark

¯

&#175;

&macr; or &hibar;

Macron accent

°

&#176;

&deg;

Degree sign

±

&#177;

&plusmn;

Plus or minus

²

&#178;

&sup2;

Superscript two

³

&#179;

&sup3;

Superscript three

´

&#180;

&acute;

Acute accent

µ

&#181;

&micro;

Micro sign

 

&#182;

&para;

Paragraph sign

&#183;

&middot;

Middle dot

¸

&#184;

&cedil;

Cedilla

¹

&#185;

&sup1;

Superscript one

º

&#186;

&ordm;

Masculine ordinal

»

&#187;

&raquo;

Right angle quote, guillemot right

¼

&#188;

&frac14;

Fraction one-fourth

½

&#189;

&frac12;

Fraction one-half

¾

&#190;

&frac34;

Fraction three-fourths

¿

&#191;

&iquest;

Inverted question mark

À

&#192;

&Agrave;

Capital A, grave accent

Á

&#193;

&Aacute;

Capital A, acute accent

Â

&#194;

&Acirc;

Capital A, circumflex

Ã

&#195;

&Atilde;

Capital A, tilde

Ä

&#196;

&Auml;

Capital A, diæresis / umlaut

Å

&#197;

&Aring;

Capital A, ring

Æ

&#198;

&AElig;

Capital AE ligature

Ç

&#199;

&Ccedil;

Capital C, cedilla

È

&#200;

&Egrave;

Capital E, grave accent

É

&#201;

&Eacute;

Capital E, acute accent

Ê

&#202;

&Ecirc;

Capital E, circumflex

Ë

&#203;

&Euml;

Capital E, diæresis / umlaut

Ì

&#204;

&Igrave;

Capital I, grave accent

Í

&#205;

&Iacute;

Capital I, acute accent

Î

&#206;

&Icirc;

Capital I, circumflex

Ï

&#207;

&Iuml;

Capital I, diæresis / umlaut

Ð

&#208;

&ETH;

Capital Eth, Icelandic

Ñ

&#209;

&Ntilde;

Capital N, tilde

Ò

&#210;

&Ograve;

Capital O, grave accent

Ó

&#211;

&Oacute;

Capital O, acute accent

Ô

&#212;

&Ocirc;

Capital O, circumflex

Õ

&#213;

&Otilde;

Capital O, tilde

Ö

&#214;

&Ouml;

Capital O, diæresis / umlaut

×

&#215;

&times;

Multiply sign

Ø

&#216;

&Oslash;

Capital O, slash

Ù

&#217;

&Ugrave;

Capital U, grave accent

Ú

&#218;

&Uacute;

Capital U, acute accent

Û

&#219;

&Ucirc;

Capital U, circumflex

Ü

&#220;

&Uuml;

Capital U, diæresis / umlaut

Ý

&#221;

&Yacute;

Capital Y, acute accent

Þ

&#222;

&THORN;

Capital Thorn, Icelandic

ß

&#223;

&szlig;

Small sharp s, German sz

à

&#224;

&agrave;

Small a, grave accent

á

&#225;

&aacute;

Small a, acute accent

â

&#226;

&acirc;

Small a, circumflex

ã

&#227;

&atilde;

Small a, tilde

ä

&#228;

&auml;

Small a, diæresis / umlaut

å

&#229;

&aring;

Small a, ring

æ

&#230;

&aelig;

Small ae ligature

ç

&#231;

&ccedil;

Small c, cedilla

è

&#232;

&egrave;

Small e, grave accent

é

&#233;

&eacute;

Small e, acute accent

ê

&#234;

&ecirc;

Small e, circumflex

ë

&#235;

&euml;

Small e, diæresis / umlaut

ì

&#236;

&igrave;

Small i, grave accent

í

&#237;

&iacute;

Small i, acute accent

î

&#238;

&icirc;

Small i, circumflex

ï

&#239;

&iuml;

Small i, diæresis / umlaut

ð

&#240;

&eth;

Small eth, Icelandic

ñ

&#241;

&ntilde;

Small n, tilde

ò

&#242;

&ograve;

Small o, grave accent

ó

&#243;

&oacute;

Small o, acute accent

ô

&#244;

&ocirc;

Small o, circumflex

õ

&#245;

&otilde;

Small o, tilde

ö

&#246;

&ouml;

Small o, diæresis / umlaut

÷

&#247;

&divide;

Division sign

ø

&#248;

&oslash;

Small o, slash

ù

&#249;

&ugrave;

Small u, grave accent

ú

&#250;

&uacute;

Small u, acute accent

û

&#251;

&ucirc;

Small u, circumflex

ü

&#252;

&uuml;

Small u, diæresis / umlaut

ý

&#253;

&yacute;

Small y, acute accent

þ

&#254;

&thorn;

Small thorn, Icelandic

ÿ

&#255;

&yuml;

Small y, diæresis / umlaut

Additional Named Entities for HTML

The following table contains additional named entities, their numeric character references, and a description of each. With the exception of the left and right-pointing brackets (&#9001; and &#9002;), these entities are rendered using Lucida sans Unicode.

Character

Named entity

Numeric character reference

Description

Latin Extended-B

     

ƒ

&fnof;

&#402;

Latin small f with hook, =function, =florin, U0192 ISOtech

Greek

     

Α

&Alpha;

&#913;

Greek capital letter alpha, U0391

Β

&Beta;

&#914;

Greek capital letter beta, U0392

Γ

&Gamma;

&#915;

Greek capital letter gamma, U0393 ISOgrk3

Δ

&Delta;

&#916;

Greek capital letter delta, U0394 ISOgrk3

Ε

&Epsilon;

&#917;

Greek capital letter epsilon, U0395

Ζ

&Zeta;

&#918;

Greek capital letter zeta, U0396

Η

&Eta;

&#919;

Greek capital letter eta, U0397

Θ

&Theta;

&#920;

Greek capital letter theta, U0398 ISOgrk3

Ι

&Iota;

&#921;

Greek capital letter iota, U0399

Κ

&Kappa;

&#922;

Greek capital letter kappa, U039A

Λ

&Lambda;

&#923;

Greek capital letter lambda, U039B ISOgrk3

Μ

&Mu;

&#924;

Greek capital letter mu, U039C

Ν

&Nu;

&#925;

Greek capital letter nu, U039D

Ξ

&Xi;

&#926;

Greek capital letter xi, U039E ISOgrk3

Ο

&Omicron;

&#927;

Greek capital letter omicron, U039F

Π

&Pi;

&#928;

Greek capital letter pi, U03A0 ISOgrk3

Ρ

&Rho;

&#929;

Greek capital letter rho, U03A1

Σ

&Sigma;

&#931;

Greek capital letter sigma, U03A3 ISOgrk3

Τ

&Tau;

&#932;

Greek capital letter tau, U03A4

Υ

&Upsilon;

&#933;

Greek capital letter upsilon, U03A5 ISOgrk3

Φ

&Phi;

&#934;

Greek capital letter phi, U03A6 ISOgrk3

Χ

&Chi;

&#935;

Greek capital letter chi, U03A7

Ψ

&Psi;

&#936;

Greek capital letter psi, U03A8 ISOgrk3

Ω

&Omega;

&#937;

Greek capital letter omega, U03A9 ISOgrk3

α

&alpha;

&#945;

Greek small letter alpha, U03B1 ISOgrk3

β

&beta;

&#946;

Greek small letter beta, U03B2 ISOgrk3

γ

&gamma;

&#947;

Greek small letter gamma, U03B3 ISOgrk3

δ

&delta;

&#948;

Greek small letter delta, U03B4 ISOgrk3

ε

&epsilon;

&#949;

Greek small letter epsilon, U03B5 ISOgrk3

ζ

&zeta;

&#950;

Greek small letter zeta, U03B6 ISOgrk3

η

&eta;

&#951;

Greek small letter eta, U03B7 ISOgrk3

θ

&theta;

&#952;

Greek small letter theta, U03B8 ISOgrk3

ι

&iota;

&#953;

Greek small letter iota, U03B9 ISOgrk3

κ

&kappa;

&#954;

Greek small letter kappa, U03BA ISOgrk3

λ

&lambda;

&#955;

Greek small letter lambda, U03BB ISOgrk3

μ

&mu;

&#956;

Greek small letter mu, U03BC ISOgrk3

ν

&nu;

&#957;

Greek small letter nu, U03BD ISOgrk3

ξ

&xi;

&#958;

Greek small letter xi, U03BE ISOgrk3

ο

&omicron;

&#959;

Greek small letter omicron, U03BF NEW

π

&pi;

&#960;

Greek small letter pi, U03C0 ISOgrk3

ρ

&rho;

&#961;

Greek small letter rho, U03C1 ISOgrk3

ς

&sigmaf;

&#962;

Greek small letter final sigma, U03C2 ISOgrk3

σ

&sigma;

&#963;

Greek small letter sigma, U03C3 ISOgrk3

τ

&tau;

&#964;

Greek small letter tau, U03C4 ISOgrk3

υ

&upsilon;

&#965;

Greek small letter upsilon, U03C5 ISOgrk3

φ

&phi;

&#966;

Greek small letter phi, U03C6 ISOgrk3

χ

&chi;

&#967;

Greek small letter chi, U03C7 ISOgrk3

ψ

&psi;

&#968;

Greek small letter psi, U03C8 ISOgrk3

ω

&omega;

&#969;

Greek small letter omega, U03C9 ISOgrk3

θ

&thetasym;

&#977;

Greek small letter theta symbol, U03D1 NEW

Υ

&upsih;

&#978;

Greek upsilon with hook symbol, U03D2 NEW

π

&piv;

&#982;

Greek pi symbol, U03D6 ISOgrk3

General Punctuation

     

&bull;

&#8226;

bullet, =black small circle, U2022 ISOpub

&hellip;

&#8230;

horizontal ellipsis, =three dot leader, U2026 ISOpub

&prime;

&#8242;

prime, =minutes, =feet, U2032 ISOtech

&Prime;

&#8243;

double prime, =seconds, =inches, U2033 ISOtech

&oline;

&#8254;

overline, =spacing overscore, U203E NEW

&frasl;

&#8260;

fraction slash, U2044 NEW

Letterlike Symbols

     

clip_image001[6]

&weierp;

&#8472;

script capital P, =power set, =Weierstrass p, U2118 ISOamso

clip_image002[6]

&image;

&#8465;

blackletter capital I, =imaginary part, U2111 ISOamso

clip_image003[6]

&real;

&#8476;

blackletter capital R, =real part symbol, U211C ISOamso

&trade;

&#8482;

trade mark sign, U2122 ISOnum

clip_image004[6]

&alefsym;

&#8501;

alef symbol, =first transfinite cardinal, U2135 NEW

Arrows

     
 

&larr;

&#8592;

leftward arrow, U2190 ISOnum

&uarr;

&#8593;

upward arrow, U2191 ISOnum

&rarr;

&#8594;

rightward arrow, U2192 ISOnum

&darr;

&#8595;

downward arrow, U2193 ISOnum

&harr;

&#8596;

left right arrow, U2194 ISOamsa

clip_image005[6]

&crarr;

&#8629;

downward arrow with corner leftward, =carriage return, U21B5 NEW

clip_image006[6]

&lArr;

&#8656;

leftward double arrow, U21D0 ISOtech

clip_image007[6]

&uArr;

&#8657;

upward double arrow, U21D1 ISOamsa

clip_image008[6]

&rArr;

&#8658;

rightward double arrow, U21D2 ISOtech

clip_image009[6]

&dArr;

&#8659;

downward double arrow, U21D3 ISOamsa

clip_image010[6]

&hArr;

&#8660;

left right double arrow, U21D4 ISOamsa

Mathematical Operators

     

&forall;

&#8704;

for all, U2200 ISOtech

&part;

&#8706;

partial differential, U2202 ISOtech

&exist;

&#8707;

there exists, U2203 ISOtech

clip_image011[6]

&empty;

&#8709;

empty set, =null set, =diameter, U2205 ISOamso

&nabla;

&#8711;

nabla, =backward difference, U2207 ISOtech

&isin;

&#8712;

element of, U2208 ISOtech

clip_image012[6]

&notin;

&#8713;

not an element of, U2209 ISOtech

&ni;

&#8715;

contains as member, U220B ISOtech

&prod;

&#8719;

n-ary product, =product sign, U220F ISOamsb

&sum;

&#8722;

n-ary sumation, U2211 ISOamsb

&minus;

&#8722;

minus sign, U2212 ISOtech

clip_image013[6]

&lowast;

&#8727;

asterisk operator, U2217 ISOtech

&radic;

&#8730;

square root, =radical sign, U221A ISOtech

&prop;

&#8733;

proportional to, U221D ISOtech

&infin;

&#8734;

infinity, U221E ISOtech

&ang;

&#8736;

angle, U2220 ISOamso

&and;

&#8869;

logical and, =wedge, U2227 ISOtech

clip_image014[6]

&or;

&#8870;

logical or, =vee, U2228 ISOtech

&cap;

&#8745;

intersection, =cap, U2229 ISOtech

&cup;

&#8746;

union, =cup, U222A ISOtech

&int;

&#8747;

integral, U222B ISOtech

&there4;

&#8756;

therefore, U2234 ISOtech

&sim;

&#8764;

tilde operator, =varies with, =similar to, U223C ISOtech

clip_image015[10]

&cong;

&#8773;

approximately equal to, U2245 ISOtech

clip_image015[11]

&asymp;

&#8773;

almost equal to, =asymptotic to, U2248 ISOamsr

&ne;

&#8800;

not equal to, U2260 ISOtech

&equiv;

&#8801;

identical to, U2261 ISOtech

&le;

&#8804;

less-than or equal to, U2264 ISOtech

&ge;

&#8805;

greater-than or equal to, U2265 ISOtech

&sub;

&#8834;

subset of, U2282 ISOtech

&sup;

&#8835;

superset of, U2283 ISOtech

clip_image016[6]

&nsub;

&#8836;

not a subset of, U2284 ISOamsn

&sube;

&#8838;

subset of or equal to, U2286 ISOtech

&supe;

&#8839;

superset of or equal to, U2287 ISOtech

&oplus;

&#8853;

circled plus, =direct sum, U2295 ISOamsb

clip_image017[6]

&otimes;

&#8855;

circled times, =vector product, U2297 ISOamsb

&perp;

&#8869;

up tack, =orthogonal to, =perpendicular, U22A5 ISOtech

clip_image018[6]

&sdot;

&#8901;

dot operator, U22C5 ISOamsb

Miscellaneous Technical

     

clip_image019[6]

&lceil;

&#8968;

left ceiling, =apl upstile, U2308, ISOamsc

clip_image020[6]

&rceil;

&#8969;

right ceiling, U2309, ISOamsc

clip_image021[6]

&lfloor;

&#8970;

left floor, =apl downstile, U230A, ISOamsc

clip_image022[6]

&rfloor;

&#8971;

right floor, U230B, ISOamsc

<

&lang;

&#9001;

left-pointing angle bracket, =bra, U2329 ISOtech

>

&rang;

&#9002;

right-pointing angle bracket, =ket, U232A ISOtech

Geometric Shapes

     

&loz;

&#9674;

lozenge, U25CA ISOpub

Miscellaneous Symbols

     

&spades;

&#9824;

black spade suit, U2660 ISOpub

&clubs;

&#9827;

black club suit, =shamrock, U2663 ISOpub

&hearts;

&#9829;

black heart suit, =valentine, U2665 ISOpub

clip_image023[6]

&diams;

&#9830;

black diamond suit, U2666 ISOpub

Character Entities for Special Symbols and BIDI Text

Using NE

NE

NCR

Using NCR

C0 Controls and Basic Latin

     

"

&quot

&#34;

quotation mark, =apl quote, U0022 ISOnum

&

&amp

&#38;

ampersand, U0026 ISOnum

<

&lt

&#60;

less-than sign, U003C ISOnum

>

&gt

&#62;

greater-than sign, U003E ISOnum

Latin Extended-A

     

Œ

&OElig

&#338;

Latin capital ligature oe, U0152 ISOlat2

œ

&oelig

&#339;

Latin small ligature oe, U0153 ISOlat2

Š

&Scaron

&#352;

Latin capital letter s with caron, U0160 ISOlat2

š

&scaron

&#353;

Latin small letter s with caron, U0161 ISOlat2

Ÿ

&Yuml

&#376;

Latin capital letter y with diaeresis, U0178 ISOlat2

Spacing Modifier Letters

     

ˆ

&circ

&#710;

modifier letter circumflex accent, U02C6 ISOpub

˜

&tilde

&#732;

small tilde, U02DC ISOdia

General Punctuation

     
 

&ensp

&#8194;

en space, U2002 ISOpub

 

&emsp

&#8195;

em space, U2003 ISOpub

 

&thinsp

&#8201;

thin space, U2009 ISOpub

&zwnj

&#8204;

zero width non-joiner, U200C NEW RFC 2070

&zwj

&#8205;

zero width joiner, U200D NEW RFC 2070

 

&lrm

&#8206;

left-to-right mark, U200E NEW RFC 2070

 

&rlm

&#8207;

right-to-left mark, U200F NEW RFC 2070

&ndash

&#8211;

en dash, U2013 ISOpub

&mdash

&#151;

em dash, U2014 ISOpub

&lsquo

&#8216;

left single quotation mark, U2018 ISOnum

'

&rsquo

&#8217;

right single quotation mark, U2019 ISOnum

&sbquo

&#8218;

single low-9 quotation mark, U201A NEW

"

&ldquo

&#8220;

left double quotation mark, U201C ISOnum

"

&rdquo

&#8221;

right double quotation mark, U201D ISOnum

&bdquo

&#8222;

double low-9 quotation mark, U201E NEW

&dagger

&#8224;

dagger, U2020 ISOpub

&Dagger

&#8225;

double dagger, U2021 ISOpub

&permil

&#8240;

per mille sign, U2030 ISOtech

<

&lsaquo

&#8249;

single left-pointing angle quotation mark, U2039 ISO proposed

>

&rsaquo

&#8250;

single right-pointing angle quotation mark, U203A ISO proposed

Character Set Recognition

Microsoft Internet Explorer uses the character set specified for a document to determine how to translate the bytes in the document into characters on the screen or on paper. By default, Internet Explorer uses the character set specified in the HTTP content type returned by the server to determine this translation. If this parameter is not given, Internet Explorer uses the character set specified by the META element in the document. It uses the user's preferences if no META element is given.

You can use the META element to explicitly set the character set for a document. In this case, you set the HTTP-EQUIV= attribute to "Content-Type" and specify a character set identifier in the CONTENT= attribute. For example, the following META element identifies windows-1251 as the character set for the document.

<META HTTP-EQUIV="Content-Type"

CONTENT="text/html; CHARSET=windows-1251">

As long as you place the META element before the BODY element, it affects the whole document, including the TITLE element. For clarity, it should appear as the first element after HEAD so that all readers know the encoding before the first element that can be displayed is parsed. Note that the META element applies to the document containing it. This means, for example, that a compound document (a document consisting of two or more documents in a set of frames) can use different character sets in different frames.

The following table contains information concerning the character sets supported by Internet Explorer 5. The information provided is:

1. Display Name — the name used to refer to the character set.

2. Preferred Charset ID — the most common identifier used to set character sets in Internet Explorer. For example, in the previous code sample windows-1251 is the Charset ID.

3. Additional Aliases — other identifiers that may be used to set character sets.

4. MLang Code Pages — numeric value of the code pages used by the Internet Explorer MLang API.

5. Supported by Version — the versions of Internet Explorer that support the listed character sets.

6. Note CS indicates that the version of Internet Explorer must support complex scripts such as Arabic, Hebrew, or Thai.

Charsets in Microsoft Internet Explorer 5

Display Name

Preferred Charset ID

Additional Aliases

MLang Code Page

Supported by Versions

Arabic ASMO-708

ASMO-708

 

708

4CS, 5

Arabic (DOS)

DOS-720

 

720

4CS, 5

Arabic (ISO)

iso-8859-6

ISO_8859-6:1987, iso-ir-127, ISO_8859-6, ECMA-114, arabic, csISOLatinArabic

28596

4CS, 5

Arabic (Windows)

windows-1256

 

1256

4CS, 5

Baltic (ISO)

iso-8859-4

csISOLatin4, iso-ir-110, ISO_8859-4, ISO_8859-4:1988, l4, latin4

28594

4, 5

Baltic (Windows)

Windows-1257

 

1257

4, 5

Central European (DOS)

ibm852

cp852

852

4, 5

Central European (ISO)

iso-8859-2

csISOLatin2, iso-ir-101, iso8859-2, iso_8859-2, iso_8859-2:1987, l2, latin2

28592

3, 4, 5

Central European (Windows)

windows-1250

x-cp1250

1250

3, 4, 5

Chinese Simplified (GB2312)

gb2312

chinese, csGB2312, csISO58GB23128, GB2312, GBK, GB_2312-80, iso-ir-58

936

3, 4, 5

Chinese Simplified (HZ)

hz-gb-2312

 

52936

4, 5

Chinese Traditional

big5

csbig5, x-x-big5

950

3, 4, 5

Cyrillic (DOS)

cp866

ibm866

866

4, 5

Cyrillic (ISO)

iso-8859-5

csISOLatinCyrillic, cyrillic, iso-ir-144, ISO_8859-5, ISO_8859-5:1988

28595

4, 5

Cyrillic (KOI8-R)

koi8-r

csKOI8R, koi

20866

3, 4, 5

Cyrillic (Windows)

windows-1251

x-cp1251

1251

3, 4, 5

Greek (ISO)

iso-8859-7

csISOLatinGreek, ECMA-118, ELOT_928, greek, greek8, iso-ir-126, ISO_8859-7, ISO_8859-7:1987

28597

3, 4, 5

Greek (Windows)

Windows-1253

windows-1253

1253

5

Hebrew (DOS)

DOS-862

 

862

4CS, 5

Hebrew (ISO)

iso-8859-8

csISOLatinHebrew, hebrew, iso-ir-138, ISO_8859-8, visual, ISO-8859-8 Visual

28598

4CS, 5

Hebrew (Windows)

windows-1255

logical, ISO_8859-8:1988, iso-ir-138

1255

3CS, 4CS, 5

Japanese (JIS)

iso-2022-jp

csISO2022JP

50220

4, 5

Japanese (JIS-Allow 1-byte Kana)

csISO2022JP

iso-2022-jp

50221

4, 5

Japanese (JIS-Allow 1-byte Kana - SO/SI)

iso-2022-jp

csISO2022JP

50222

3, 4, 5

Japanese (EUC)

euc-jp

csEUCPkdFmtJapanese, Extended_UNIX_Code_Packed_
Format_for_Japanese, x-euc, x-euc-jp

51932

3, 4, 5

Japanese (Shift-JIS)

shift_jis

csShiftJIS, csWindows31J, ms_Kanji, shift-jis, x-ms-cp932, x-sjis

932

3, 4, 5

Korean

ks_c_5601-1987

csKSC56011987, euc-kr, korean, ks_c_5601

949

3, 4, 5

Korean (ISO)

iso-2022-kr

csISO2022KR

50225

3, 4, 5

Latin 3 (ISO)

iso-8859-3

 

28593

4, 5

Thai (Windows)

iso-8859-11

windows-874

874

3, 4, 5

Turkish (Windows)

Windows-1254

windows-1254

1254

3, 4, 5

Turkish (ISO)

iso-8859-9

csISOLatin5, ISO_8859-9, ISO_8859-9:1989, iso-ir-148, l5, latin5

28599

3, 4, 5

Ukrainian (KOI8-U)

koi8-u

 

21866

4, 5

Unicode (UTF-7)

utf-7

csUnicode11UTF7, unicode-1-1-utf-7, x-unicode-2-0-utf-7

65000

4, 5

Unicode (UFT-8)

utf-8

unicode-1-1-utf-8, unicode-2-0-utf-8, x-unicode-2-0-utf-8

65001

4, 5

Vietnamese (Windows)

windows-1258

 

1258

3, 4, 5

Western European (Windows)

Windows-1252

 

1252

5

Western European (ISO)

iso-8859-1

ANSI_X3.4-1968, ANSI_X3.4-1986, ascii, cp367, cp819, csASCII, IBM367, ibm819, iso-ir-100, iso-ir-6, ISO646-US, iso8859-1, ISO_646.irv:1991, iso_8859-1, iso_8859-1:1987, latin1, us, us-ascii, x-ansi

1252

3, 4, 5

Nonstandard Charsets with Special Meaning Inside Internet Explorer and MLang

These character sets are not to be used for labeling documents.

Display Name

Preferred Charset ID

Additional Aliases

MLang Code Page

Supported by Versions

Japanese (Auto Select)

_autodetect

 

50932

3, 4, 5

Korean (Auto Select)

_autodetect_kr

 

50949

4, 5

Unicode

unicode

 

1200

4, 5

Unicode (BigEndian)

unicodeFEFF

 

1201

4, 5

User Defined

x-user-defined

 

50000

4, 5

©2003 Microsoft Corporation. All rights reserved.

原文地址:https://www.cnblogs.com/rock_chen/p/2014427.html