Description Code Entity name
=================================== ============ ==============
quotation mark " --> " " --> "
ampersand & --> & & --> &
less-than sign < --> < < --> <
greater-than sign > --> > > --> >
Description Char Code Entity name
=================================== ==== ============ ==============
non-breaking space   --> -->
inverted exclamation mark ¡ ¡ --> ¡ ¡ --> ¡
cent sign ¢ ¢ --> ¢ ¢ --> ¢
pound sign £ £ --> £ £ --> £
currency sign ¤ ¤ --> ¤ ¤ --> ¤
yen sign ¥ ¥ --> ¥ ¥ --> ¥
broken vertical bar ¦ ¦ --> ¦ ¦ --> ¦
&brkbar; --> &brkbar;
section sign § § --> § § --> §
spacing diaresis ¨ ¨ --> ¨ ¨ --> ¨
copyright sign © © --> © © --> ©
feminine ordinal indicator ª ª --> ª ª --> ª
angle quotation mark, left « « --> « « --> «
negation sign ¬ ¬ --> ¬ ¬ --> ¬
soft hyphen ­ --> ­ -->
circled r registered sign ® ® --> ® ® --> ®
spacing macron ¯ ¯ --> ¯ &hibar; --> &hibar;
degree sign ° ° --> ° ° --> °
plus-or-minus sign ± ± --> ± ± --> ±
superscript 2 ² ² --> ² ² --> ²
superscript 3 ³ ³ --> ³ ³ --> ³
spacing acute ´ ´ --> ´ ´ --> ´
micro sign µ µ --> µ µ --> µ
paragraph sign ¶ ¶ --> ¶ ¶ --> ¶
middle dot · · --> · · --> ·
spacing cedilla ¸ ¸ --> ¸ ¸ --> ¸
superscript 1 ¹ ¹ --> ¹ ¹ --> ¹
masculine ordinal indicator º º --> º º --> º
angle quotation mark, right » » --> » » --> »
fraction 1/4 ¼ ¼ --> ¼ ¼ --> ¼
fraction 1/2 ½ ½ --> ½ ½ --> ½
fraction 3/4 ¾ ¾ --> ¾ ¾ --> ¾
inverted question mark ¿ ¿ --> ¿ ¿ --> ¿
capital a, grave accent À À --> À à --> À
capital a, acute accent Á Á --> Á á --> Á
capital a, circumflex accent   -->  â --> Â
capital a, tilde à à --> à ã --> Ã
capital a, dieresis or umlaut mark Ä Ä --> Ä ä --> Ä
capital a, ring Å Å --> Å å --> Å
capital ae diphthong (ligature) Æ Æ --> Æ æ --> Æ
capital c, cedilla Ç Ç --> Ç ç --> Ç
capital e, grave accent È È --> È è --> È
capital e, acute accent É É --> É é --> É
capital e, circumflex accent Ê Ê --> Ê ê --> Ê
capital e, dieresis or umlaut mark Ë Ë --> Ë ë --> Ë
capital i, grave accent Ì Ì --> Ì ì --> Ì
capital i, acute accent Í Í --> Í í --> Í
capital i, circumflex accent Î Î --> Î î --> Î
capital i, dieresis or umlaut mark Ï Ï --> Ï ï --> Ï
capital eth, icelandic Ð Ð --> Ð ð --> Ð
đ --> Đ
capital n, tilde Ñ Ñ --> Ñ ñ --> Ñ
capital o, grave accent Ò Ò --> Ò ò --> Ò
capital o, acute accent Ó Ó --> Ó ó --> Ó
capital o, circumflex accent Ô Ô --> Ô ô --> Ô
capital o, tilde Õ Õ --> Õ õ --> Õ
capital o, dieresis or umlaut mark Ö Ö --> Ö ö --> Ö
multiplication sign × × --> × × --> ×
capital o, slash Ø Ø --> Ø ø --> Ø
capital u, grave accent Ù Ù --> Ù ù --> Ù
capital u, acute accent Ú Ú --> Ú ú --> Ú
capital u, circumflex accent Û Û --> Û û --> Û
capital u, dieresis or umlaut mark Ü Ü --> Ü ü --> Ü
capital y, acute accent Ý Ý --> Ý ý --> Ý
capital thorn, icelandic Þ Þ --> Þ þ --> Þ
small sharp s, german (sz ligature) ß ß --> ß ß --> ß
small a, grave accent à à --> à à --> à
small a, acute accent á á --> á á --> á
small a, circumflex accent â â --> â â --> â
small a, tilde ã ã --> ã ã --> ã
small a, dieresis or umlaut mark ä ä --> ä ä --> ä
small a, ring å å --> å å --> å
small ae diphthong (ligature) æ æ --> æ æ --> æ
small c, cedilla ç ç --> ç ç --> ç
small e, grave accent è è --> è è --> è
small e, acute accent é é --> é é --> é
small e, circumflex accent ê ê --> ê ê --> ê
small e, dieresis or umlaut mark ë ë --> ë ë --> ë
small i, grave accent ì ì --> ì ì --> ì
small i, acute accent í í --> í í --> í
small i, circumflex accent î î --> î î --> î
small i, dieresis or umlaut mark ï ï --> ï ï --> ï
small eth, icelandic ð ð --> ð ð --> ð
small n, tilde ñ ñ --> ñ ñ --> ñ
small o, grave accent ò ò --> ò ò --> ò
small o, acute accent ó ó --> ó ó --> ó
small o, circumflex accent ô ô --> ô ô --> ô
small o, tilde õ õ --> õ õ --> õ
small o, dieresis or umlaut mark ö ö --> ö ö --> ö
division sign ÷ ÷ --> ÷ ÷ --> ÷
small o, slash ø ø --> ø ø --> ø
small u, grave accent ù ù --> ù ù --> ù
small u, acute accent ú ú --> ú ú --> ú
small u, circumflex accent û û --> û û --> û
small u, dieresis or umlaut mark ü ü --> ü ü --> ü
small y, acute accent ý ý --> ý ý --> ý
small thorn, icelandic þ þ --> þ þ --> þ
small y, dieresis or umlaut mark ÿ ÿ --> ÿ ÿ --> ÿ
&brkbar; and Đ
seem to be unique to HTF.
The standards stuff:
The
HTML 2.0 Standard
includes a section on
Character Entity Sets
and an overview on the
HTML Coded Character Set
(The entity names are derived from ISO 8879).
Or have a look at the
Latin-1 Character Entities
as listed in an draft for the
HTML 3.0 specification.
The
Appendix II
of CERN's
HTML+ Discussion Document
contains a
table
(in PostScript format) of the proposed character entities for HTML+ and their
corresponding character codes for Unicode and the Adobe Latin-1 & Symbol
character sets.
Please note that there is nothing wrong with using characters of ISO Latin-1 above 127: HTTP/1.0 uses the 8bit ISO latin-1 as default encoding. (Thanks to Roman Czyborra for pointing this out!)
Other information:
server/ddx/sun/Compose.list.