HTML4.01规定的所有文本实体

文本实体是可以直接书写在HTML中用以显示特殊字符的字符串,例如&表示&。

访问http://www.w3.org/TR/html4/strict.dtd可以在页面中找到3个后缀名为.ent的文件:HTMLlat1.ent,HTMLsymbol.ent和HTMLspecial.ent,分别访问三个文件即可得到相关的文本实体及对应编码(共252个),现整理如下(通过https://www.w3school.com.cn/tags/html_ref_entities.html和[https://www.w3school.com.cn/tags/html_ref_symbols.html也可直观看到各文本实体的效果):

  1. http://www.w3.org/TR/html4/HTMLlat1.ent文件
      nbsp “ ” – no-break space = non-breaking space,U+00A0 ISOnum
    ¡ iexcl “¡” – inverted exclamation mark, U+00A1 ISOnum
    ¢ cent “¢” – cent sign, U+00A2 ISOnum
    £ pound “£” – pound sign, U+00A3 ISOnum
    ¤ curren “¤” – currency sign, U+00A4 ISOnum
    ¥ yen “¥” – yen sign = yuan sign, U+00A5 ISOnum
    ¦ brvbar “¦” – broken bar = broken vertical bar,U+00A6 ISOnum
    § sect “§” – section sign, U+00A7 ISOnum
    ¨ uml “¨” – diaeresis = spacing diaeresis,U+00A8 ISOdia
    © copy “©” – copyright sign, U+00A9 ISOnum
    ª ordf “ª” – feminine ordinal indicator, U+00AA ISOnum
    « laquo “«” – left-pointing double angle quotation mark= left pointing guillemet, U+00AB ISOnum
    ¬ not “¬” – not sign, U+00AC ISOnum
    ­ shy “­” – soft hyphen = discretionary hyphen,U+00AD ISOnum
    ® reg “®” – registered sign = registered trade mark sign,U+00AE ISOnum
    ¯ macr “¯” – macron = spacing macron = overline= APL overbar, U+00AF ISOdia
    ° deg “°” – degree sign, U+00B0 ISOnum
    ± plusmn “±” – plus-minus sign = plus-or-minus sign,U+00B1 ISOnum
    ² sup2 “²” – superscript two = superscript digit two= squared, U+00B2 ISOnum
    ³ sup3 “³” – superscript three = superscript digit three= cubed, U+00B3 ISOnum
    ´ acute “´” – acute accent = spacing acute,U+00B4 ISOdia
    µ micro “µ” – micro sign, U+00B5 ISOnum
    ¶ para “¶” – pilcrow sign = paragraph sign,U+00B6 ISOnum
    · middot “·” – middle dot = Georgian comma= Greek middle dot, U+00B7 ISOnum
    ¸ cedil “¸” – cedilla = spacing cedilla, U+00B8 ISOdia
    ¹ sup1 “¹” – superscript one = superscript digit one,U+00B9 ISOnum
    º ordm “º” – masculine ordinal indicator,U+00BA ISOnum
    » raquo “»” – right-pointing double angle quotation mark= right pointing guillemet, U+00BB ISOnum
    ¼ frac14 “¼” – vulgar fraction one quarter= fraction one quarter, U+00BC ISOnum
    ½ frac12 “½” – vulgar fraction one half= fraction one half, U+00BD ISOnum
    ¾ frac34 “¾” – vulgar fraction three quarters= fraction three quarters, U+00BE ISOnum
    ¿ iquest “¿” – inverted question mark= turned question mark, U+00BF ISOnum
    À Agrave “À” – latin capital letter A with grave= latin capital letter A grave,U+00C0 ISOlat1
    Á Aacute “Á” – latin capital letter A with acute,U+00C1 ISOlat1
     Acirc “” – latin capital letter A with circumflex,U+00C2 ISOlat1
    à Atilde “Ô – latin capital letter A with tilde,U+00C3 ISOlat1
    Ä Auml “Ä” – latin capital letter A with diaeresis,U+00C4 ISOlat1
    Å Aring “Å” – latin capital letter A with ring above= latin capital letter A ring,U+00C5 ISOlat1
    Æ AElig “Æ” – latin capital letter AE= latin capital ligature AE,U+00C6 ISOlat1
    Ç Ccedil “Ç” – latin capital letter C with cedilla,U+00C7 ISOlat1
    È Egrave “È” – latin capital letter E with grave,U+00C8 ISOlat1
    É Eacute “É” – latin capital letter E with acute,U+00C9 ISOlat1
    Ê Ecirc “Ê” – latin capital letter E with circumflex,U+00CA ISOlat1
    Ë Euml “Ë” – latin capital letter E with diaeresis,U+00CB ISOlat1
    Ì Igrave “Ì” – latin capital letter I with grave,U+00CC ISOlat1
    Í Iacute “Í” – latin capital letter I with acute,U+00CD ISOlat1
    Î Icirc “Δ – latin capital letter I with circumflex,U+00CE ISOlat1
    Ï Iuml “Ï” – latin capital letter I with diaeresis,U+00CF ISOlat1
    Ð ETH “Д – latin capital letter ETH, U+00D0 ISOlat1
    Ñ Ntilde “Ñ” – latin capital letter N with tilde,U+00D1 ISOlat1
    Ò Ograve “Ò” – latin capital letter O with grave,U+00D2 ISOlat1
    Ó Oacute “Ó” – latin capital letter O with acute,U+00D3 ISOlat1
    Ô Ocirc “Ô” – latin capital letter O with circumflex,U+00D4 ISOlat1
    Õ Otilde “Õ” – latin capital letter O with tilde,U+00D5 ISOlat1
    Ö Ouml “Ö” – latin capital letter O with diaeresis,U+00D6 ISOlat1
    × times “×” – multiplication sign, U+00D7 ISOnum
    Ø Oslash “Ø” – latin capital letter O with stroke= latin capital letter O slash,U+00D8 ISOlat1
    Ù Ugrave “Ù” – latin capital letter U with grave,U+00D9 ISOlat1
    Ú Uacute “Ú” – latin capital letter U with acute,U+00DA ISOlat1
    Û Ucirc “Û” – latin capital letter U with circumflex,U+00DB ISOlat1
    Ü Uuml “Ü” – latin capital letter U with diaeresis,U+00DC ISOlat1
    Ý Yacute “Ý” – latin capital letter Y with acute,U+00DD ISOlat1
    Þ THORN “Þ” – latin capital letter THORN,U+00DE ISOlat1
    ß szlig “ß” – latin small letter sharp s = ess-zed,U+00DF ISOlat1
    à agrave “à” – latin small letter a with grave= latin small letter a grave,U+00E0 ISOlat1
    á aacute “á” – latin small letter a with acute,U+00E1 ISOlat1
    â acirc “â” – latin small letter a with circumflex,U+00E2 ISOlat1
    ã atilde “ã” – latin small letter a with tilde,U+00E3 ISOlat1
    ä auml “ä” – latin small letter a with diaeresis,U+00E4 ISOlat1
    å aring “å” – latin small letter a with ring above= latin small letter a ring,U+00E5 ISOlat1
    æ aelig “æ” – latin small letter ae= latin small ligature ae, U+00E6 ISOlat1
    ç ccedil “ç” – latin small letter c with cedilla,U+00E7 ISOlat1
    è egrave “è” – latin small letter e with grave,U+00E8 ISOlat1
    é eacute “é” – latin small letter e with acute,U+00E9 ISOlat1
    ê ecirc “ê” – latin small letter e with circumflex,U+00EA ISOlat1
    ë euml “ë” – latin small letter e with diaeresis,U+00EB ISOlat1
    ì igrave “ì” – latin small letter i with grave,U+00EC ISOlat1
    í iacute “í” – latin small letter i with acute,U+00ED ISOlat1
    î icirc “î” – latin small letter i with circumflex,U+00EE ISOlat1
    ï iuml “ï” – latin small letter i with diaeresis,U+00EF ISOlat1
    ð eth “ð” – latin small letter eth, U+00F0 ISOlat1
    ñ ntilde “ñ” – latin small letter n with tilde,U+00F1 ISOlat1
    ò ograve “ò” – latin small letter o with grave,U+00F2 ISOlat1
    ó oacute “ó” – latin small letter o with acute,U+00F3 ISOlat1
    ô ocirc “ô” – latin small letter o with circumflex,U+00F4 ISOlat1
    õ otilde “õ” – latin small letter o with tilde,U+00F5 ISOlat1
    ö ouml “ö” – latin small letter o with diaeresis,U+00F6 ISOlat1
    ÷ divide “÷” – division sign, U+00F7 ISOnum
    ø oslash “ø” – latin small letter o with stroke,= latin small letter o slash,U+00F8 ISOlat1
    ù ugrave “ù” – latin small letter u with grave,U+00F9 ISOlat1
    ú uacute “ú” – latin small letter u with acute,U+00FA ISOlat1
    û ucirc “û” – latin small letter u with circumflex,U+00FB ISOlat1
    ü uuml “ü” – latin small letter u with diaeresis,U+00FC ISOlat1
    ý yacute “ý” – latin small letter y with acute,U+00FD ISOlat1
    þ thorn “þ” – latin small letter thorn,U+00FE ISOlat1
    ÿ yuml “ÿ” – latin small letter y with diaeresis,U+00FF ISOlat1

  2. http://www.w3.org/TR/html4/HTMLsymbol.ent文件

  • Mathematical, Greek and Symbolic characters for HTML

  • Latin Extended-B
    ƒ fnof “ƒ” – latin small f with hook = function= florin, U+0192 ISOtech

  • Greek
    Α Alpha “Α” – greek capital letter alpha, U+0391
    Β Beta “Β” – greek capital letter beta, U+0392
    Γ Gamma “Γ” – greek capital letter gamma,U+0393 ISOgrk3
    Δ Delta “Δ” – greek capital letter delta,U+0394 ISOgrk3
    Ε Epsilon “Ε” – greek capital letter epsilon, U+0395
    Ζ Zeta “Ζ” – greek capital letter zeta, U+0396
    Η Eta “Η” – greek capital letter eta, U+0397
    Θ Theta “Θ” – greek capital letter theta,U+0398 ISOgrk3
    Ι Iota “Ι” – greek capital letter iota, U+0399
    Κ Kappa “Κ” – greek capital letter kappa, U+039A
    Λ Lambda “Λ” – greek capital letter lambda,U+039B ISOgrk3
    Μ Mu “Μ” – greek capital letter mu, U+039C
    Ν Nu “Ν” – greek capital letter nu, U+039D
    Ξ Xi “Ξ” – greek capital letter xi, U+039E ISOgrk3
    Ο Omicron “Ο” – greek capital letter omicron, U+039F
    Π Pi “Π” – greek capital letter pi, U+03A0 ISOgrk3
    Ρ Rho “Ρ” – greek capital letter rho, U+03A1

  • there is no Sigmaf, and no U+03A2 character either
    Σ Sigma “Σ” – greek capital letter sigma,U+03A3 ISOgrk3
    Τ Tau “Τ” – greek capital letter tau, U+03A4
    Υ Upsilon “Υ” – greek capital letter upsilon,U+03A5 ISOgrk3
    Φ Phi “Φ” – greek capital letter phi,U+03A6 ISOgrk3
    Χ Chi “Χ” – greek capital letter chi, U+03A7
    Ψ Psi “Ψ” – greek capital letter psi,U+03A8 ISOgrk3
    Ω Omega “Ω” – greek capital letter omega,U+03A9 ISOgrk3
    α alpha “α” – greek small letter alpha,U+03B1 ISOgrk3
    β beta “β” – greek small letter beta, U+03B2 ISOgrk3
    γ gamma “γ” – greek small letter gamma,U+03B3 ISOgrk3
    δ delta “δ” – greek small letter delta,U+03B4 ISOgrk3
    ε epsilon “ε” – greek small letter epsilon,U+03B5 ISOgrk3
    ζ zeta “ζ” – greek small letter zeta, U+03B6 ISOgrk3
    η eta “η” – greek small letter eta, U+03B7 ISOgrk3
    θ theta “θ” – greek small letter theta,U+03B8 ISOgrk3
    ι iota “ι” – greek small letter iota, U+03B9 ISOgrk3
    κ kappa “κ” – greek small letter kappa,U+03BA ISOgrk3
    λ lambda “λ” – greek small letter lambda,U+03BB ISOgrk3
    μ mu “μ” – greek small letter mu, U+03BC ISOgrk3
    ν nu “ν” – greek small letter nu, U+03BD ISOgrk3
    ξ xi “ξ” – greek small letter xi, U+03BE ISOgrk3
    ο omicron “ο” – greek small letter omicron, U+03BF NEW
    π pi “π” – greek small letter pi, U+03C0 ISOgrk3
    ρ rho “ρ” – greek small letter rho, U+03C1 ISOgrk3
    ς sigmaf “ς” – greek small letter final sigma,U+03C2 ISOgrk3
    σ sigma “σ” – greek small letter sigma,U+03C3 ISOgrk3
    τ tau “τ” – greek small letter tau, U+03C4 ISOgrk3
    υ upsilon “υ” – greek small letter upsilon,U+03C5 ISOgrk3
    φ phi “φ” – greek small letter phi, U+03C6 ISOgrk3
    χ chi “χ” – greek small letter chi, U+03C7 ISOgrk3
    ψ psi “ψ” – greek small letter psi, U+03C8 ISOgrk3
    ω omega “ω” – greek small letter omega,U+03C9 ISOgrk3
    ϑ thetasym “ϑ” – greek small letter theta symbol,U+03D1 NEW
    ϒ upsih “ϒ” – greek upsilon with hook symbol,U+03D2 NEW
    ϖ piv “ϖ” – greek pi symbol, U+03D6 ISOgrk3

  • GeneralPunctuation
    • bull “•” – bullet = black small circle,U+2022 ISOpub

  • bullet is NOT the same as bullet operator, U+2219
    … hellip “…” – horizontal ellipsis = three dot leader,U+2026 ISOpub
    ′ prime “′” – prime = minutes = feet, U+2032 ISOtech
    ″ Prime “″” – double prime = seconds = inches,U+2033 ISOtech
    ‾ oline “‾” – overline = spacing overscore,U+203E NEW
    ⁄ frasl “⁄” – fraction slash, U+2044 NEW

  • Letterlke Symbols
    ℘ weierp “℘” – script capital P = power set= Weierstrass p, U+2118 ISOamso
    ℑ image “ℑ” – blackletter capital I = imaginary part,U+2111 ISOamso
    ℜ real “ℜ” – blackletter capital R = real part symbol,U+211C ISOamso
    ™ trade “™” – trade mark sign, U+2122 ISOnum
    ℵ alefsym “ℵ” – alef symbol = first transfinite cardinal,U+2135 NEW

  • Arrows
    ← larr “←” – leftwards arrow, U+2190 ISOnum
    ↑ uarr “↑” – upwards arrow, U+2191 ISOnum
    → rarr “→” – rightwards arrow, U+2192 ISOnum
    ↓ darr “↓” – downwards arrow, U+2193 ISOnum
    ↔ harr “↔” – left right arrow, U+2194 ISOamsa
    ↵ crarr “↵” – downwards arrow with corner leftwards= carriage return, U+21B5 NEW
    ⇐ lArr “⇐” – leftwards double arrow, U+21D0 ISOtech
    ⇑ uArr “⇑” – upwards double arrow, U+21D1 ISOamsa
    ⇒ rArr “⇒” – rightwards double arrow,U+21D2 ISOtech
    ⇓ dArr “⇓” – downwards double arrow, U+21D3 ISOamsa
    ⇔ hArr “⇔” – left right double arrow,U+21D4 ISOamsa

  • Mathemaical Operators
    ∀ forall “∀” – for all, U+2200 ISOtech
    ∂ part “∂” – partial differential, U+2202 ISOtech
    ∃ exist “∃” – there exists, U+2203 ISOtech
    ∅ empty “∅” – empty set = null set = diameter,U+2205 ISOamso
    ∇ nabla “∇” – nabla = backward difference,U+2207 ISOtech
    ∈ isin “∈” – element of, U+2208 ISOtech
    ∉ notin “∉” – not an element of, U+2209 ISOtech
    ∋ ni “∋” – contains as member, U+220B ISOtech
    ∏ prod “∏” – n-ary product = product sign,U+220F ISOamsb
    ∑ sum “∑” – n-ary sumation, U+2211 ISOamsb
    − minus “−” – minus sign, U+2212 ISOtech
    ∗ lowast “∗” – asterisk operator, U+2217 ISOtech
    √ radic “√” – square root = radical sign,U+221A ISOtech
    ∝ prop “∝” – proportional to, U+221D ISOtech
    ∞ infin “∞” – infinity, U+221E ISOtech
    ∠ ang “∠” – angle, U+2220 ISOamso
    ∧ and “∧” – logical and = wedge, U+2227 ISOtech
    ∨ or “∨” – logical or = vee, U+2228 ISOtech
    ∩ cap “∩” – intersection = cap, U+2229 ISOtech
    ∪ cup “∪” – union = cup, U+222A ISOtech
    ∫ int “∫” – integral, U+222B ISOtech
    ∴ there4 “∴” – therefore, U+2234 ISOtech
    ∼ sim “∼” – tilde operator = varies with = similar to,U+223C ISOtech
    ≅ cong “≅” – approximately equal to, U+2245 ISOtech
    ≈ asymp “≈” – almost equal to = asymptotic to,U+2248 ISOamsr
    ≠ ne “≠” – not equal to, U+2260 ISOtech
    ≡ equiv “≡” – identical to, U+2261 ISOtech
    ≤ le “≤” – less-than or equal to, U+2264 ISOtech
    ≥ ge “≥” – greater-than or equal to,U+2265 ISOtech
    ⊂ sub “⊂” – subset of, U+2282 ISOtech
    ⊃ sup “⊃” – superset of, U+2283 ISOtech
    ⊄ nsub “⊄” – not a subset of, U+2284 ISOamsn
    ⊆ sube “⊆” – subset of or equal to, U+2286 ISOtech
    ⊇ supe “⊇” – superset of or equal to,U+2287 ISOtech
    ⊕ oplus “⊕” – circled plus = direct sum,U+2295 ISOamsb
    ⊗ otimes “⊗” – circled times = vector product,U+2297 ISOamsb
    ⊥ perp “⊥” – up tack = orthogonal to = perpendicular,U+22A5 ISOtech
    ⋅ sdot “⋅” – dot operator, U+22C5 ISOamsb

  • Miscellneous Technical
    ⌈ lceil “⌈” – left ceiling = apl upstile,U+2308 ISOamsc
    ⌉ rceil “⌉” – right ceiling, U+2309 ISOamsc
    ⌊ lfloor “⌊” – left floor = apl downstile,U+230A ISOamsc
    ⌋ rfloor “⌋” – right floor, U+230B ISOamsc
    〈 lang “〈” – left-pointing angle bracket = bra,U+2329 ISOtech
    〉 rang “〉” – right-pointing angle bracket = ket,U+232A ISOtech

  • Geometrc Shapes
    ◊ loz “◊” – lozenge, U+25CA ISOpub

  • Miscellneous Symbols
    ♠ spades “♠” – black spade suit, U+2660 ISOpub

  • black hre seems to mean filled as opposed to hollow
    ♣ clubs “♣” – black club suit = shamrock,U+2663 ISOpub
    ♥ hearts “♥” – black heart suit = valentine,U+2665 ISOpub
    ♦ diams “♦” – black diamond suit, U+2666 ISOpub

  1. http://www.w3.org/TR/html4/HTMLspecial.ent文件
  • Special characters for HTML

  • C0 Controls and Basic Latin
    " quot “"” – quotation mark = APL quote,U+0022 ISOnum
    & amp “&” – ampersand, U+0026 ISOnum
    < lt “&#60;” – less-than sign, U+003C ISOnum
    > gt “&#62;” – greater-than sign, U+003E ISOnum

  • Latin Extended-A
    Œ OElig “&#338;” – latin capital ligature OE,U+0152 ISOlat2
    œ oelig “&#339;” – latin small ligature oe, U+0153 ISOlat2

  • ligature is a misnomer, this is a separate character in some languages
    Š Scaron “&#352;” – latin capital letter S with caron,U+0160 ISOlat2
    š scaron “&#353;” – latin small letter s with caron,U+0161 ISOlat2
    Ÿ Yuml “&#376;” – latin capital letter Y with diaeresis,U+0178 ISOlat2

  • Spacing Modifier Letters
    ˆ circ “&#710;” – modifier letter circumflex accent,U+02C6 ISOpub
    ˜ tilde “&#732;” – small tilde, U+02DC ISOdia

  • General Punctuation
      ensp “&#8194;” – en space, U+2002 ISOpub
      emsp “&#8195;” – em space, U+2003 ISOpub
      thinsp “&#8201;” – thin space, U+2009 ISOpub
    ‌ zwnj “&#8204;” – zero width non-joiner,U+200C NEW RFC 2070
    ‍ zwj “&#8205;” – zero width joiner, U+200D NEW RFC 2070
    ‎ lrm “&#8206;” – left-to-right mark, U+200E NEW RFC 2070
    ‏ rlm “&#8207;” – right-to-left mark, U+200F NEW RFC 2070
    – ndash “&#8211;” – en dash, U+2013 ISOpub
    — mdash “&#8212;” – em dash, U+2014 ISOpub
    ‘ lsquo “&#8216;” – left single quotation mark,U+2018 ISOnum
    ’ rsquo “&#8217;” – right single quotation mark,U+2019 ISOnum
    ‚ sbquo “&#8218;” – single low-9 quotation mark, U+201A NEW
    “ ldquo “&#8220;” – left double quotation mark,U+201C ISOnum
    ” rdquo “&#8221;” – right double quotation mark,U+201D ISOnum
    „ bdquo “&#8222;” – double low-9 quotation mark, U+201E NEW
    † dagger “&#8224;” – dagger, U+2020 ISOpub
    ‡ Dagger “&#8225;” – double dagger, U+2021 ISOpub
    ‰ permil “&#8240;” – per mille sign, U+2030 ISOtech
    ‹ lsaquo “&#8249;” – single left-pointing angle quotation mark,U+2039 ISO proposed

  • lsaquo is proposed but not yet ISO standardized
    › rsaquo “&#8250;” – single right-pointing angle quotation mark,U+203A ISO proposed

  • rsaquo is proposed but not yet ISO standardized
    € euro “&#8364;” – euro sign, U+20AC NEW

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值