ISO Latin-1 Character Set & Corresponding HTML Entity Codes

[edit]
Standard ASCII Character Set
DECHEXOCT GLYPHDESCKEY HTML 2.0+HTML 3.2+
00x0 00(nul)null Ctrl-@�  
10x1 01(soh)start-of-heading Ctrl-A  
20x2 02(stx)start-of-text Ctrl-B  
30x3 03(etx)end-of-text Ctrl-C  
40x4 04(eot)end-of-transmit Ctrl-D  
50x5 05(enq)enquiry Ctrl-E  
60x6 06(ack)acknowledge Ctrl-F  
70x7 07(bel)bell Ctrl-G  
80x8 010(bs)back-space Ctrl-H  
90x9 011(ht)horizontal-tab Ctrl-I	  
100xa 012(lf) or
(nl)
line-feed or
new-line
Ctrl-J or
Ctrl-Enter

  
110xb 013(vt)vertical-tab Ctrl-K  
120xc 014(np)form-feed Ctrl-L  
130xd 015(cr)carriage-return Ctrl-M
  
140xe 016(so)shift-out Ctrl-N  
150xf 017(si)shift-in Ctrl-O  
160x10 020(dle)data-link-escape Ctrl-P  
170x11 021(dc1)device-control-1 Ctrl-Q  
180x12 022(dc2)device-control-2 Ctrl-R  
190x13 023(dc3)device-control-3 Ctrl-S  
200x14 024(dc4)device-control-4 Ctrl-T  
210x15 025(nak)negative-ack Ctrl-U  
220x16 026(syn)synchronous-idle Ctrl-V  
230x17 027(etb)end-trans-block Ctrl-W  
240x18 030(can)cancel Ctrl-X  
250x19 031(em)end-medium Ctrl-Y  
260x1a 032(sub) or
(eof)
substitute or
end-of-file
Ctrl-Z  
270x1b 033(esc)escape Ctrl-[  
280x1c 034(fs)file-separator Ctrl-\  
290x1d 035(gs)group-separator Ctrl-]  
300x1e 036(rs)record-separator Ctrl-^  
310x1f 037(us)unit-separator Ctrl-_  
320x20 040(sp)blank-space Spacebar  &sp;
330x21 041!  !! !
340x22 042"  "" or
"
"
350x23 043#  ## #
360x24 044$  $$ $
370x25 045%  %% %
380x26 046&  && or
&
&
390x27 047'  '' '
400x28 050(  (( (
410x29 051)  )) )
420x2a 052*  ** *
430x2b 053+  ++ +
440x2c 054,  ,, ,
450x2d 055-  -- ‐ or
−
460x2e 056.  .. .
470x2f 057/  // /
480x30 0600  00  
490x31 0611  11  
500x32 0622  22  
510x33 0633  33  
520x34 0644  44  
530x35 0655  55  
540x36 0666  66  
550x37 0677  77  
560x38 0708  88  
570x39 0719  99  
580x3a 072:  :: :
590x3b 073;  ;&#59; ;
600x3c 074<  <&lt; or
&#60;
&lt;
610x3d 075=  =&#61; &equals;
620x3e 076>  >&gt; or
&#62;
&gt;
630x3f 077?  ?&#63; &quest;
640x40 0100@  @&#64; &commat;
650x41 0101A  A&#65;  
660x42 0102B  B&#66;  
670x43 0103C  C&#67;  
680x44 0104D  D&#68;  
690x45 0105E  E&#69;  
700x46 0106F  F&#70;  
710x47 0107G  G&#71;  
720x48 0110H  H&#72;  
730x49 0111I  I&#73;  
740x4a 0112J  J&#74;  
750x4b 0113K  K&#75;  
760x4c 0114L  L&#76;  
770x4d 0115M  M&#77;  
780x4e 0116N  N&#78;  
790x4f 0117O  O&#79;  
800x50 0120P  P&#80;  
810x51 0121Q  Q&#81;  
820x52 0122R  R&#82;  
830x53 0123S  S&#83;  
840x54 0124T  T&#84;  
850x55 0125U  U&#85;  
860x56 0126V  V&#86;  
870x57 0127W  W&#87;  
880x58 0130X  X&#88;  
890x59 0131Y  Y&#89;  
900x5a 0132Z  Z&#90;  
910x5b 0133[  [&#91; &lsqb;
920x5c 0134\  \&#92; &bsol;
930x5d 0135]  ]&#93; &rsqb;
940x5e 0136^  ^&#94; &circ;
950x5f 0137_  _&#95; &lowbar; or
&horbar;
960x60 0140`  `&#96; &grave;
970x61 0141a  a&#97;  
980x62 0142b  b&#98;  
990x63 0143c  c&#99;  
1000x64 0144d  d&#100;  
1010x65 0145e  e&#101;  
1020x66 0146f  f&#102;  
1030x67 0147g  g&#103;  
1040x68 0150h  h&#104;  
1050x69 0151i  i&#105;  
1060x6a 0152j  j&#106;  
1070x6b 0153k  k&#107;  
1080x6c 0154l  l&#108;  
1090x6d 0155m  m&#109;  
1100x6e 0156n  n&#110;  
1110x6f 0157o  o&#111;  
1120x70 0160p  p&#112;  
1130x71 0161q  q&#113;  
1140x72 0162r  r&#114;  
1150x73 0163s  s&#115;  
1160x74 0164t  t&#116;  
1170x75 0165u  u&#117;  
1180x76 0166v  v&#118;  
1190x77 0167w  w&#119;  
1200x78 0170x  x&#120;  
1210x79 0171y  y&#121;  
1220x7a 0172z  z&#122;  
1230x7b 0173{  {&#123; &lcub;
1240x7c 0174|  |&#124; &verbar;
1250x7d 0175}  }&#125; &rcub;
1260x7e 0176~  ~&#126; &tilde;
1270x7f 0177(del)delete Ctrl-?&#127;  

Characters Added By ISO Latin-1 (ISO-8859-1)
DECHEXOCT GLYPHDESCKEY HTML 2.0+HTML 3.2+
1280x80 0200  &#128;  
1290x81 0201  &#129;  
1300x82 0202  &#130; &lsquor;
1310x83 0203ƒ  ƒ&#131; &fnof;
1320x84 0204  &#132; &dbquo;
1330x85 0205  &#133; &hellip;
1340x86 0206  &#134; &dagger;
1350x87 0207  &#135; &Dagger;
1360x88 0210ˆ  ˆ&#136; &circ;
1370x89 0211  &#137; &permil;
1380x8a 0212Š  Š&#138; &Scaron;
1390x8b 0213  &#139; &lsaquo;
1400x8c 0214Œ  Œ&#140; &OElig;
1410x8d 0215  &#141;  
1420x8e 0216Ž  Ž&#142;  
1430x8f 0217  &#143;  
1440x90 0220  &#144;  
1450x91 0221  &#145; &lsquo; or
&rsquor;
1460x92 0222  &#146; &rsquo;
1470x93 0223  &#147; &ldquo; or
&rdquor;
1480x94 0224  &#148; &rdquo;
1490x95 0225  &#149; &bull;
1500x96 0226  &#150; &ndash; or
&endash;
1510x97 0227  &#151; &mdash; or
&emdash;
1520x98 0230˜  ˜&#152; &tilde;
1530x99 0231  &#153; &trade;
1540x9a 0232š  š&#154; &scaron;
1550x9b 0233  &#155; &rsaquo;
1560x9c 0234œ  œ&#156; &oelig;
1570x9d 0235  &#157;  
1580x9e 0236ž  ž&#158;  
1590x9f 0237Ÿ  Ÿ&#159; &Yuml;
1600xa0 0240    &#160; &nbsp;
1610xa1 0241¡  ¡&#161; &iexcl;
1620xa2 0242¢  ¢&#162; &cent;
1630xa3 0243£  £&#163; &pound;
1640xa4 0244¤  ¤&#164; &curren;
1650xa5 0245¥  ¥&#165; &yen;
1660xa6 0246¦  ¦&#166; &brvbar;
1670xa7 0247§  §&#167; &sect;
1680xa8 0250¨  ¨&#168; &uml;
1690xa9 0251©  ©&#169; &copy;
1700xaa 0252ª  ª&#170; &ordf;
1710xab 0253«  «&#171; &laquo;
1720xac 0254¬  ¬&#172; &not;
1730xad 0255­  ­&#173; &shy;
1740xae 0256®  ®&#174; &reg;
1750xaf 0257¯  ¯&#175; &macr;
1760xb0 0260°  °&#176; &deg;
1770xb1 0261±  ±&#177; &plusmn;
1780xb2 0262²  ²&#178; &sup2;
1790xb3 0263³  ³&#179; &sup3;
1800xb4 0264´  ´&#180; &acute;
1810xb5 0265µ  µ&#181; &micro;
1820xb6 0266  &#182; &para;
1830xb7 0267·  ·&#183; &middot;
1840xb8 0270¸  ¸&#184; &cedil;
1850xb9 0271¹  ¹&#185; &sup1;
1860xba 0272º  º&#186; &ordm;
1870xbb 0273»  »&#187; &raquo;
1880xbc 0274¼  ¼&#188; &frac14;
1890xbd 0275½  ½&#189; &frac12;
1900xbe 0276¾  ¾&#190; &frac34;
1910xbf 0277¿  ¿&#191; &iquest;
1920xc0 0300À  À&#192; &Agrave;
1930xc1 0301Á  Á&#193; &Aacute;
1940xc2 0302  Â&#194; &Acirc;
1950xc3 0303à Ã&#195; &Atilde;
1960xc4 0304Ä  Ä&#196; &Auml;
1970xc5 0305Å  Å&#197; &Aring;
1980xc6 0306Æ  Æ&#198; &AElig;
1990xc7 0307Ç  Ç&#199; &Ccedil;
2000xc8 0310È  È&#200; &Egrave;
2010xc9 0311É  É&#201; &Eacute;
2020xca 0312Ê  Ê&#202; &Ecirc;
2030xcb 0313Ë  Ë&#203; &Euml;
2040xcc 0314Ì  Ì&#204; &Igrave;
2050xcd 0315Í  Í&#205; &Iacute;
2060xce 0316Π Î&#206; &Icirc;
2070xcf 0317Ï  Ï&#207; &Iuml;
2080xd0 0320Р Ð&#208; &ETH;
2090xd1 0321Ñ  Ñ&#209; &Ntilde;
2100xd2 0322Ò  Ò&#210; &Ograve;
2110xd3 0323Ó  Ó&#211; &Oacute;
2120xd4 0324Ô  Ô&#212; &Ocirc;
2130xd5 0325Õ  Õ&#213; &Otilde;
2140xd6 0326Ö  Ö&#214; &Ouml;
2150xd7 0327×  ×&#215; &times;
2160xd8 0330Ø  Ø&#216; &Oslash;
2170xd9 0331Ù  Ù&#217; &Ugrave;
2180xda 0332Ú  Ú&#218; &Uacute;
2190xdb 0333Û  Û&#219; &Ucirc;
2200xdc 0334Ü  Ü&#220; &Uuml;
2210xdd 0335Ý  Ý&#221; &Yacute;
2220xde 0336Þ  Þ&#222; &THORN;
2230xdf 0337ß  ß&#223; &szlig;
2240xe0 0340à  à&#224; &agrave;
2250xe1 0341á  á&#225; &aacute;
2260xe2 0342â  â&#226; &acirc;
2270xe3 0343ã  ã&#227; &atilde;
2280xe4 0344ä  ä&#228; &auml;
2290xe5 0345å  å&#229; &aring;
2300xe6 0346æ  æ&#230; &aelig;
2310xe7 0347ç  ç&#231; &ccedil;
2320xe8 0350è  è&#232; &egrave;
2330xe9 0351é  é&#233; &eacute;
2340xea 0352ê  ê&#234; &ecirc;
2350xeb 0353ë  ë&#235; &euml;
2360xec 0354ì  ì&#236; &igrave;
2370xed 0355í  í&#237; &iacute;
2380xee 0356î  î&#238; &icirc;
2390xef 0357ï  ï&#239; &iuml;
2400xf0 0360ð  ð&#240; &eth;
2410xf1 0361ñ  ñ&#241; &ntilde;
2420xf2 0362ò  ò&#242; &ograve;
2430xf3 0363ó  ó&#243; &oacute;
2440xf4 0364ô  ô&#244; &ocirc;
2450xf5 0365õ  õ&#245; &otilde;
2460xf6 0366ö  ö&#246; &ouml;
2470xf7 0367÷  ÷&#247; &divide;
2480xf8 0370ø  ø&#248; &oslash;
2490xf9 0371ù  ù&#249; &ugrave;
2500xfa 0372ú  ú&#250; &uacute;
2510xfb 0373û  û&#251; &ucirc;
2520xfc 0374ü  ü&#252; &uuml;
2530xfd 0375ý  ý&#253; &yacute;
2540xfe 0376þ  þ&#254; &thorn;
2550xff 0377ÿ  ÿ&#255; &yuml;

Extended ASCII, ANSI, and Unicode in DOS and Windows

MS-DOS uses the IBM PC Extended Character Set (ECS), often referred to as the Extended ASCII character set. The lower 128 values are the same as the ASCII character set, but values 128-255 map into the European characters differently from other character sets, including that used by Windows.

One odd thing about ECS is that codes 1-31 and 128-159, which are assigned to non-printing control functions such as Return and Line feed, are also assigned special graphics for use in contexts where control functions are not needed. That is how you get happy faces and arrows in ECS displays and line-drawing characters as well as European characters.

Extended ASCII Character Set (ECS)

Extended ASCII Character Set

Windows 3.x (3.0, 3.1, and WFW 3.11) and Windows 9x (95, 98, and ME) use the ANSI character set (for US English, Microsoft code page 1252), which includes 256 characters, numbered 0 to 255. Values 0 to 127 are the same as in the ASCII character set. Values 128 to 255 are similar to the ISO Latin-1 character set, but, naturally, there are extensions and incompatibilities. You can enter any ANSI character into a Windows application. If you see it on the keyboard, just press it. Even if it isn't on your keyboard, you can still enter it using a special Alt-Num key sequence.

Unfortunately, this ANSI character set is incompatible with the ECS character set used by DOS. For example, the British pound symbol is code 163 in ANSI, but code 156 in ECS. Worse still, ANSI does not include many of the ECS symbols, particularly the line drawing characters. If you try to display a DOS document in Windows, Windows attempts to convert the ECS characters to ANSI. Any character that does not convert is replaced by an arbitrary graphic. Therefore a round-trip conversion from DOS to Windows and back will not re-create the original document.

ANSI Character Set

ANSI Character Set

Windows NT (NT 3.5, NT 4.0, 2000, XP, and 2003) uses the 16-bit Unicode character set (for US English, Microsoft code page 437). Unicode covers scripts in use by major living languages, including the Far East, as well as dead languages which are in widespread scholarly use. For backward compatibility, NT also supports ANSI. Unicode support can be added to Windows 9x by installing the Microsoft Layer for Unicode on Windows 9x Systems.

Links

1