[ Avaa Bypassed ]




Upload:

Command:

hmhc3928@18.119.213.78: ~ $

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
  "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">


<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    
    <title>19.2. sgmllib — Simple SGML parser &mdash; Python 2.7.5 documentation</title>
    
    <link rel="stylesheet" href="../_static/default.css" type="text/css" />
    <link rel="stylesheet" href="../_static/pygments.css" type="text/css" />
    
    <script type="text/javascript">
      var DOCUMENTATION_OPTIONS = {
        URL_ROOT:    '../',
        VERSION:     '2.7.5',
        COLLAPSE_INDEX: false,
        FILE_SUFFIX: '.html',
        HAS_SOURCE:  true
      };
    </script>
    <script type="text/javascript" src="../_static/jquery.js"></script>
    <script type="text/javascript" src="../_static/underscore.js"></script>
    <script type="text/javascript" src="../_static/doctools.js"></script>
    <script type="text/javascript" src="../_static/sidebar.js"></script>
    <link rel="search" type="application/opensearchdescription+xml"
          title="Search within Python 2.7.5 documentation"
          href="../_static/opensearch.xml"/>
    <link rel="author" title="About these documents" href="../about.html" />
    <link rel="copyright" title="Copyright" href="../copyright.html" />
    <link rel="top" title="Python 2.7.5 documentation" href="../index.html" />
    <link rel="up" title="19. Structured Markup Processing Tools" href="markup.html" />
    <link rel="next" title="19.3. htmllib — A parser for HTML documents" href="htmllib.html" />
    <link rel="prev" title="19.1. HTMLParser — Simple HTML and XHTML parser" href="htmlparser.html" />
    <link rel="shortcut icon" type="image/png" href="../_static/py.png" />
    <script type="text/javascript" src="../_static/copybutton.js"></script>
    
 

  </head>
  <body>
    <div class="related">
      <h3>Navigation</h3>
      <ul>
        <li class="right" style="margin-right: 10px">
          <a href="../genindex.html" title="General Index"
             accesskey="I">index</a></li>
        <li class="right" >
          <a href="../py-modindex.html" title="Python Module Index"
             >modules</a> |</li>
        <li class="right" >
          <a href="htmllib.html" title="19.3. htmllib — A parser for HTML documents"
             accesskey="N">next</a> |</li>
        <li class="right" >
          <a href="htmlparser.html" title="19.1. HTMLParser — Simple HTML and XHTML parser"
             accesskey="P">previous</a> |</li>
        <li><img src="../_static/py.png" alt=""
                 style="vertical-align: middle; margin-top: -1px"/></li>
        <li><a href="http://www.python.org/">Python</a> &raquo;</li>
        <li>
          <a href="../index.html">Python 2.7.5 documentation</a> &raquo;
        </li>

          <li><a href="index.html" >The Python Standard Library</a> &raquo;</li>
          <li><a href="markup.html" accesskey="U">19. Structured Markup Processing Tools</a> &raquo;</li> 
      </ul>
    </div>  

    <div class="document">
      <div class="documentwrapper">
        <div class="bodywrapper">
          <div class="body">
            
  <div class="section" id="module-sgmllib">
<span id="sgmllib-simple-sgml-parser"></span><h1>19.2. <a class="reference internal" href="#module-sgmllib" title="sgmllib: Only as much of an SGML parser as needed to parse HTML. (deprecated)"><tt class="xref py py-mod docutils literal"><span class="pre">sgmllib</span></tt></a> &#8212; Simple SGML parser<a class="headerlink" href="#module-sgmllib" title="Permalink to this headline">¶</a></h1>
<p class="deprecated">
<span class="versionmodified">Deprecated since version 2.6: </span>The <a class="reference internal" href="#module-sgmllib" title="sgmllib: Only as much of an SGML parser as needed to parse HTML. (deprecated)"><tt class="xref py py-mod docutils literal"><span class="pre">sgmllib</span></tt></a> module has been removed in Python 3.</p>
<p id="index-0">This module defines a class <a class="reference internal" href="#sgmllib.SGMLParser" title="sgmllib.SGMLParser"><tt class="xref py py-class docutils literal"><span class="pre">SGMLParser</span></tt></a> which serves as the basis for
parsing text files formatted in SGML (Standard Generalized Mark-up Language).
In fact, it does not provide a full SGML parser &#8212; it only parses SGML insofar
as it is used by HTML, and the module only exists as a base for the
<a class="reference internal" href="htmllib.html#module-htmllib" title="htmllib: A parser for HTML documents. (deprecated)"><tt class="xref py py-mod docutils literal"><span class="pre">htmllib</span></tt></a> module.  Another HTML parser which supports XHTML and offers a
somewhat different interface is available in the <a class="reference internal" href="htmlparser.html#module-HTMLParser" title="HTMLParser: A simple parser that can handle HTML and XHTML."><tt class="xref py py-mod docutils literal"><span class="pre">HTMLParser</span></tt></a> module.</p>
<dl class="class">
<dt id="sgmllib.SGMLParser">
<em class="property">class </em><tt class="descclassname">sgmllib.</tt><tt class="descname">SGMLParser</tt><a class="headerlink" href="#sgmllib.SGMLParser" title="Permalink to this definition">¶</a></dt>
<dd><p>The <a class="reference internal" href="#sgmllib.SGMLParser" title="sgmllib.SGMLParser"><tt class="xref py py-class docutils literal"><span class="pre">SGMLParser</span></tt></a> class is instantiated without arguments. The parser is
hardcoded to recognize the following constructs:</p>
<ul class="simple">
<li>Opening and closing tags of the form <tt class="docutils literal"><span class="pre">&lt;tag</span> <span class="pre">attr=&quot;value&quot;</span> <span class="pre">...&gt;</span></tt> and
<tt class="docutils literal"><span class="pre">&lt;/tag&gt;</span></tt>, respectively.</li>
<li>Numeric character references of the form <tt class="docutils literal"><span class="pre">&amp;#name;</span></tt>.</li>
<li>Entity references of the form <tt class="docutils literal"><span class="pre">&amp;name;</span></tt>.</li>
<li>SGML comments of the form <tt class="docutils literal"><span class="pre">&lt;!--text--&gt;</span></tt>.  Note that spaces, tabs, and
newlines are allowed between the trailing <tt class="docutils literal"><span class="pre">&gt;</span></tt> and the immediately preceding
<tt class="docutils literal"><span class="pre">--</span></tt>.</li>
</ul>
</dd></dl>

<p>A single exception is defined as well:</p>
<dl class="exception">
<dt id="sgmllib.SGMLParseError">
<em class="property">exception </em><tt class="descclassname">sgmllib.</tt><tt class="descname">SGMLParseError</tt><a class="headerlink" href="#sgmllib.SGMLParseError" title="Permalink to this definition">¶</a></dt>
<dd><p>Exception raised by the <a class="reference internal" href="#sgmllib.SGMLParser" title="sgmllib.SGMLParser"><tt class="xref py py-class docutils literal"><span class="pre">SGMLParser</span></tt></a> class when it encounters an error
while parsing.</p>
<p class="versionadded">
<span class="versionmodified">New in version 2.1.</span></p>
</dd></dl>

<p><a class="reference internal" href="#sgmllib.SGMLParser" title="sgmllib.SGMLParser"><tt class="xref py py-class docutils literal"><span class="pre">SGMLParser</span></tt></a> instances have the following methods:</p>
<dl class="method">
<dt id="sgmllib.SGMLParser.reset">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">reset</tt><big>(</big><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.reset" title="Permalink to this definition">¶</a></dt>
<dd><p>Reset the instance.  Loses all unprocessed data.  This is called implicitly at
instantiation time.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.setnomoretags">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">setnomoretags</tt><big>(</big><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.setnomoretags" title="Permalink to this definition">¶</a></dt>
<dd><p>Stop processing tags.  Treat all following input as literal input (CDATA).
(This is only provided so the HTML tag <tt class="docutils literal"><span class="pre">&lt;PLAINTEXT&gt;</span></tt> can be implemented.)</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.setliteral">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">setliteral</tt><big>(</big><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.setliteral" title="Permalink to this definition">¶</a></dt>
<dd><p>Enter literal mode (CDATA mode).</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.feed">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">feed</tt><big>(</big><em>data</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.feed" title="Permalink to this definition">¶</a></dt>
<dd><p>Feed some text to the parser.  It is processed insofar as it consists of
complete elements; incomplete data is buffered until more data is fed or
<a class="reference internal" href="#sgmllib.SGMLParser.close" title="sgmllib.SGMLParser.close"><tt class="xref py py-meth docutils literal"><span class="pre">close()</span></tt></a> is called.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.close">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">close</tt><big>(</big><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.close" title="Permalink to this definition">¶</a></dt>
<dd><p>Force processing of all buffered data as if it were followed by an end-of-file
mark.  This method may be redefined by a derived class to define additional
processing at the end of the input, but the redefined version should always call
<a class="reference internal" href="#sgmllib.SGMLParser.close" title="sgmllib.SGMLParser.close"><tt class="xref py py-meth docutils literal"><span class="pre">close()</span></tt></a>.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.get_starttag_text">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">get_starttag_text</tt><big>(</big><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.get_starttag_text" title="Permalink to this definition">¶</a></dt>
<dd><p>Return the text of the most recently opened start tag.  This should not normally
be needed for structured processing, but may be useful in dealing with HTML &#8220;as
deployed&#8221; or for re-generating input with minimal changes (whitespace between
attributes can be preserved, etc.).</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_starttag">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_starttag</tt><big>(</big><em>tag</em>, <em>method</em>, <em>attributes</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_starttag" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to handle start tags for which either a <tt class="xref py py-meth docutils literal"><span class="pre">start_tag()</span></tt>
or <tt class="xref py py-meth docutils literal"><span class="pre">do_tag()</span></tt> method has been defined.  The <em>tag</em> argument is the name of
the tag converted to lower case, and the <em>method</em> argument is the bound method
which should be used to support semantic interpretation of the start tag. The
<em>attributes</em> argument is a list of <tt class="docutils literal"><span class="pre">(name,</span> <span class="pre">value)</span></tt> pairs containing the
attributes found inside the tag&#8217;s <tt class="docutils literal"><span class="pre">&lt;&gt;</span></tt> brackets.</p>
<p>The <em>name</em> has been translated to lower case. Double quotes and backslashes in
the <em>value</em> have been interpreted, as well as known character references and
known entity references terminated by a semicolon (normally, entity references
can be terminated by any non-alphanumerical character, but this would break the
very common case of <tt class="docutils literal"><span class="pre">&lt;A</span> <span class="pre">HREF=&quot;url?spam=1&amp;eggs=2&quot;&gt;</span></tt> when <tt class="docutils literal"><span class="pre">eggs</span></tt> is a valid
entity name).</p>
<p>For instance, for the tag <tt class="docutils literal"><span class="pre">&lt;A</span> <span class="pre">HREF=&quot;http://www.cwi.nl/&quot;&gt;</span></tt>, this method would
be called as <tt class="docutils literal"><span class="pre">unknown_starttag('a',</span> <span class="pre">[('href',</span> <span class="pre">'http://www.cwi.nl/')])</span></tt>.  The
base implementation simply calls <em>method</em> with <em>attributes</em> as the only
argument.</p>
<p class="versionadded">
<span class="versionmodified">New in version 2.5: </span>Handling of entity and character references within attribute values.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_endtag">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_endtag</tt><big>(</big><em>tag</em>, <em>method</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_endtag" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to handle endtags for which an <tt class="xref py py-meth docutils literal"><span class="pre">end_tag()</span></tt> method has
been defined.  The <em>tag</em> argument is the name of the tag converted to lower
case, and the <em>method</em> argument is the bound method which should be used to
support semantic interpretation of the end tag.  If no <tt class="xref py py-meth docutils literal"><span class="pre">end_tag()</span></tt> method is
defined for the closing element, this handler is not called.  The base
implementation simply calls <em>method</em>.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_data">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_data</tt><big>(</big><em>data</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_data" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process arbitrary data.  It is intended to be
overridden by a derived class; the base class implementation does nothing.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_charref">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_charref</tt><big>(</big><em>ref</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_charref" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process a character reference of the form <tt class="docutils literal"><span class="pre">&amp;#ref;</span></tt>.
The base implementation uses <a class="reference internal" href="#sgmllib.SGMLParser.convert_charref" title="sgmllib.SGMLParser.convert_charref"><tt class="xref py py-meth docutils literal"><span class="pre">convert_charref()</span></tt></a> to convert the reference to
a string.  If that method returns a string, it is passed to <a class="reference internal" href="#sgmllib.SGMLParser.handle_data" title="sgmllib.SGMLParser.handle_data"><tt class="xref py py-meth docutils literal"><span class="pre">handle_data()</span></tt></a>,
otherwise <tt class="docutils literal"><span class="pre">unknown_charref(ref)</span></tt> is called to handle the error.</p>
<p class="versionchanged">
<span class="versionmodified">Changed in version 2.5: </span>Use <a class="reference internal" href="#sgmllib.SGMLParser.convert_charref" title="sgmllib.SGMLParser.convert_charref"><tt class="xref py py-meth docutils literal"><span class="pre">convert_charref()</span></tt></a> instead of hard-coding the conversion.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.convert_charref">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">convert_charref</tt><big>(</big><em>ref</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.convert_charref" title="Permalink to this definition">¶</a></dt>
<dd><p>Convert a character reference to a string, or <tt class="docutils literal"><span class="pre">None</span></tt>.  <em>ref</em> is the reference
passed in as a string.  In the base implementation, <em>ref</em> must be a decimal
number in the range 0-255.  It converts the code point found using the
<a class="reference internal" href="#sgmllib.SGMLParser.convert_codepoint" title="sgmllib.SGMLParser.convert_codepoint"><tt class="xref py py-meth docutils literal"><span class="pre">convert_codepoint()</span></tt></a> method. If <em>ref</em> is invalid or out of range, this
method returns <tt class="docutils literal"><span class="pre">None</span></tt>.  This method is called by the default
<a class="reference internal" href="#sgmllib.SGMLParser.handle_charref" title="sgmllib.SGMLParser.handle_charref"><tt class="xref py py-meth docutils literal"><span class="pre">handle_charref()</span></tt></a> implementation and by the attribute value parser.</p>
<p class="versionadded">
<span class="versionmodified">New in version 2.5.</span></p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.convert_codepoint">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">convert_codepoint</tt><big>(</big><em>codepoint</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.convert_codepoint" title="Permalink to this definition">¶</a></dt>
<dd><p>Convert a codepoint to a <a class="reference internal" href="functions.html#str" title="str"><tt class="xref py py-class docutils literal"><span class="pre">str</span></tt></a> value.  Encodings can be handled here if
appropriate, though the rest of <a class="reference internal" href="#module-sgmllib" title="sgmllib: Only as much of an SGML parser as needed to parse HTML. (deprecated)"><tt class="xref py py-mod docutils literal"><span class="pre">sgmllib</span></tt></a> is oblivious on this matter.</p>
<p class="versionadded">
<span class="versionmodified">New in version 2.5.</span></p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_entityref">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_entityref</tt><big>(</big><em>ref</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_entityref" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process a general entity reference of the form
<tt class="docutils literal"><span class="pre">&amp;ref;</span></tt> where <em>ref</em> is an general entity reference.  It converts <em>ref</em> by
passing it to <a class="reference internal" href="#sgmllib.SGMLParser.convert_entityref" title="sgmllib.SGMLParser.convert_entityref"><tt class="xref py py-meth docutils literal"><span class="pre">convert_entityref()</span></tt></a>.  If a translation is returned, it calls
the method <a class="reference internal" href="#sgmllib.SGMLParser.handle_data" title="sgmllib.SGMLParser.handle_data"><tt class="xref py py-meth docutils literal"><span class="pre">handle_data()</span></tt></a> with the translation; otherwise, it calls the
method <tt class="docutils literal"><span class="pre">unknown_entityref(ref)</span></tt>. The default <tt class="xref py py-attr docutils literal"><span class="pre">entitydefs</span></tt> defines
translations for <tt class="docutils literal"><span class="pre">&amp;amp;</span></tt>, <tt class="docutils literal"><span class="pre">&amp;apos</span></tt>, <tt class="docutils literal"><span class="pre">&amp;gt;</span></tt>, <tt class="docutils literal"><span class="pre">&amp;lt;</span></tt>, and <tt class="docutils literal"><span class="pre">&amp;quot;</span></tt>.</p>
<p class="versionchanged">
<span class="versionmodified">Changed in version 2.5: </span>Use <a class="reference internal" href="#sgmllib.SGMLParser.convert_entityref" title="sgmllib.SGMLParser.convert_entityref"><tt class="xref py py-meth docutils literal"><span class="pre">convert_entityref()</span></tt></a> instead of hard-coding the conversion.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.convert_entityref">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">convert_entityref</tt><big>(</big><em>ref</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.convert_entityref" title="Permalink to this definition">¶</a></dt>
<dd><p>Convert a named entity reference to a <a class="reference internal" href="functions.html#str" title="str"><tt class="xref py py-class docutils literal"><span class="pre">str</span></tt></a> value, or <tt class="docutils literal"><span class="pre">None</span></tt>.  The
resulting value will not be parsed.  <em>ref</em> will be only the name of the entity.
The default implementation looks for <em>ref</em> in the instance (or class) variable
<tt class="xref py py-attr docutils literal"><span class="pre">entitydefs</span></tt> which should be a mapping from entity names to corresponding
translations.  If no translation is available for <em>ref</em>, this method returns
<tt class="docutils literal"><span class="pre">None</span></tt>.  This method is called by the default <a class="reference internal" href="#sgmllib.SGMLParser.handle_entityref" title="sgmllib.SGMLParser.handle_entityref"><tt class="xref py py-meth docutils literal"><span class="pre">handle_entityref()</span></tt></a>
implementation and by the attribute value parser.</p>
<p class="versionadded">
<span class="versionmodified">New in version 2.5.</span></p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_comment">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_comment</tt><big>(</big><em>comment</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_comment" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called when a comment is encountered.  The <em>comment</em> argument is
a string containing the text between the <tt class="docutils literal"><span class="pre">&lt;!--</span></tt> and <tt class="docutils literal"><span class="pre">--&gt;</span></tt> delimiters, but
not the delimiters themselves.  For example, the comment <tt class="docutils literal"><span class="pre">&lt;!--text--&gt;</span></tt> will
cause this method to be called with the argument <tt class="docutils literal"><span class="pre">'text'</span></tt>.  The default method
does nothing.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.handle_decl">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">handle_decl</tt><big>(</big><em>data</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.handle_decl" title="Permalink to this definition">¶</a></dt>
<dd><p>Method called when an SGML declaration is read by the parser.  In practice, the
<tt class="docutils literal"><span class="pre">DOCTYPE</span></tt> declaration is the only thing observed in HTML, but the parser does
not discriminate among different (or broken) declarations.  Internal subsets in
a <tt class="docutils literal"><span class="pre">DOCTYPE</span></tt> declaration are not supported.  The <em>data</em> parameter will be the
entire contents of the declaration inside the <tt class="docutils literal"><span class="pre">&lt;!</span></tt>...<tt class="docutils literal"><span class="pre">&gt;</span></tt> markup.  The
default implementation does nothing.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.report_unbalanced">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">report_unbalanced</tt><big>(</big><em>tag</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.report_unbalanced" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called when an end tag is found which does not correspond to any
open element.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.unknown_starttag">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">unknown_starttag</tt><big>(</big><em>tag</em>, <em>attributes</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.unknown_starttag" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process an unknown start tag.  It is intended to be
overridden by a derived class; the base class implementation does nothing.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.unknown_endtag">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">unknown_endtag</tt><big>(</big><em>tag</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.unknown_endtag" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process an unknown end tag.  It is intended to be
overridden by a derived class; the base class implementation does nothing.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.unknown_charref">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">unknown_charref</tt><big>(</big><em>ref</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.unknown_charref" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process unresolvable numeric character references.
Refer to <a class="reference internal" href="#sgmllib.SGMLParser.handle_charref" title="sgmllib.SGMLParser.handle_charref"><tt class="xref py py-meth docutils literal"><span class="pre">handle_charref()</span></tt></a> to determine what is handled by default.  It is
intended to be overridden by a derived class; the base class implementation does
nothing.</p>
</dd></dl>

<dl class="method">
<dt id="sgmllib.SGMLParser.unknown_entityref">
<tt class="descclassname">SGMLParser.</tt><tt class="descname">unknown_entityref</tt><big>(</big><em>ref</em><big>)</big><a class="headerlink" href="#sgmllib.SGMLParser.unknown_entityref" title="Permalink to this definition">¶</a></dt>
<dd><p>This method is called to process an unknown entity reference.  It is intended to
be overridden by a derived class; the base class implementation does nothing.</p>
</dd></dl>

<p>Apart from overriding or extending the methods listed above, derived classes may
also define methods of the following form to define processing of specific tags.
Tag names in the input stream are case independent; the <em>tag</em> occurring in
method names must be in lower case:</p>
<dl class="method">
<dt>
<tt class="descclassname">SGMLParser.</tt><tt class="descname">start_tag</tt><big>(</big><em>attributes</em><big>)</big></dt>
<dd><p>This method is called to process an opening tag <em>tag</em>.  It has preference over
<tt class="xref py py-meth docutils literal"><span class="pre">do_tag()</span></tt>.  The <em>attributes</em> argument has the same meaning as described for
<tt class="xref py py-meth docutils literal"><span class="pre">handle_starttag()</span></tt> above.</p>
</dd></dl>

<dl class="method">
<dt>
<tt class="descclassname">SGMLParser.</tt><tt class="descname">do_tag</tt><big>(</big><em>attributes</em><big>)</big></dt>
<dd><p>This method is called to process an opening tag <em>tag</em>  for which no
<tt class="xref py py-meth docutils literal"><span class="pre">start_tag()</span></tt> method is defined.   The <em>attributes</em> argument has the same
meaning as described for <tt class="xref py py-meth docutils literal"><span class="pre">handle_starttag()</span></tt> above.</p>
</dd></dl>

<dl class="method">
<dt>
<tt class="descclassname">SGMLParser.</tt><tt class="descname">end_tag</tt><big>(</big><big>)</big></dt>
<dd><p>This method is called to process a closing tag <em>tag</em>.</p>
</dd></dl>

<p>Note that the parser maintains a stack of open elements for which no end tag has
been found yet.  Only tags processed by <tt class="xref py py-meth docutils literal"><span class="pre">start_tag()</span></tt> are pushed on this
stack.  Definition of an <tt class="xref py py-meth docutils literal"><span class="pre">end_tag()</span></tt> method is optional for these tags.  For
tags processed by <tt class="xref py py-meth docutils literal"><span class="pre">do_tag()</span></tt> or by <tt class="xref py py-meth docutils literal"><span class="pre">unknown_tag()</span></tt>, no <tt class="xref py py-meth docutils literal"><span class="pre">end_tag()</span></tt>
method must be defined; if defined, it will not be used.  If both
<tt class="xref py py-meth docutils literal"><span class="pre">start_tag()</span></tt> and <tt class="xref py py-meth docutils literal"><span class="pre">do_tag()</span></tt> methods exist for a tag, the
<tt class="xref py py-meth docutils literal"><span class="pre">start_tag()</span></tt> method takes precedence.</p>
</div>


          </div>
        </div>
      </div>
      <div class="sphinxsidebar">
        <div class="sphinxsidebarwrapper">
  <h4>Previous topic</h4>
  <p class="topless"><a href="htmlparser.html"
                        title="previous chapter">19.1. <tt class="docutils literal"><span class="pre">HTMLParser</span></tt> &#8212; Simple HTML and XHTML parser</a></p>
  <h4>Next topic</h4>
  <p class="topless"><a href="htmllib.html"
                        title="next chapter">19.3. <tt class="docutils literal"><span class="pre">htmllib</span></tt> &#8212; A parser for HTML documents</a></p>
<h3>This Page</h3>
<ul class="this-page-menu">
  <li><a href="../bugs.html">Report a Bug</a></li>
  <li><a href="../_sources/library/sgmllib.txt"
         rel="nofollow">Show Source</a></li>
</ul>

<div id="searchbox" style="display: none">
  <h3>Quick search</h3>
    <form class="search" action="../search.html" method="get">
      <input type="text" name="q" />
      <input type="submit" value="Go" />
      <input type="hidden" name="check_keywords" value="yes" />
      <input type="hidden" name="area" value="default" />
    </form>
    <p class="searchtip" style="font-size: 90%">
    Enter search terms or a module, class or function name.
    </p>
</div>
<script type="text/javascript">$('#searchbox').show(0);</script>
        </div>
      </div>
      <div class="clearer"></div>
    </div>
    <div class="related">
      <h3>Navigation</h3>
      <ul>
        <li class="right" style="margin-right: 10px">
          <a href="../genindex.html" title="General Index"
             >index</a></li>
        <li class="right" >
          <a href="../py-modindex.html" title="Python Module Index"
             >modules</a> |</li>
        <li class="right" >
          <a href="htmllib.html" title="19.3. htmllib — A parser for HTML documents"
             >next</a> |</li>
        <li class="right" >
          <a href="htmlparser.html" title="19.1. HTMLParser — Simple HTML and XHTML parser"
             >previous</a> |</li>
        <li><img src="../_static/py.png" alt=""
                 style="vertical-align: middle; margin-top: -1px"/></li>
        <li><a href="http://www.python.org/">Python</a> &raquo;</li>
        <li>
          <a href="../index.html">Python 2.7.5 documentation</a> &raquo;
        </li>

          <li><a href="index.html" >The Python Standard Library</a> &raquo;</li>
          <li><a href="markup.html" >19. Structured Markup Processing Tools</a> &raquo;</li> 
      </ul>
    </div>
    <div class="footer">
    &copy; <a href="../copyright.html">Copyright</a> 1990-2019, Python Software Foundation.
    <br />
    The Python Software Foundation is a non-profit corporation.
    <a href="http://www.python.org/psf/donations/">Please donate.</a>
    <br />
    Last updated on Jul 03, 2019.
    <a href="../bugs.html">Found a bug</a>?
    <br />
    Created using <a href="http://sphinx.pocoo.org/">Sphinx</a> 1.1.3.
    </div>

  </body>
</html>

Filemanager

Name Type Size Permission Actions
2to3.html File 49.27 KB 0644
__builtin__.html File 10.26 KB 0644
__future__.html File 13.79 KB 0644
__main__.html File 7.05 KB 0644
_winreg.html File 59.21 KB 0644
abc.html File 23.9 KB 0644
aepack.html File 13.16 KB 0644
aetools.html File 14.91 KB 0644
aetypes.html File 18.88 KB 0644
aifc.html File 22.4 KB 0644
al.html File 17.34 KB 0644
allos.html File 33.72 KB 0644
anydbm.html File 16.33 KB 0644
archiving.html File 9.26 KB 0644
argparse.html File 237.62 KB 0644
array.html File 29.29 KB 0644
ast.html File 34.98 KB 0644
asynchat.html File 31.43 KB 0644
asyncore.html File 36.51 KB 0644
atexit.html File 16.8 KB 0644
audioop.html File 31.36 KB 0644
autogil.html File 8.19 KB 0644
base64.html File 19.67 KB 0644
basehttpserver.html File 34.04 KB 0644
bastion.html File 11.04 KB 0644
bdb.html File 36.68 KB 0644
binascii.html File 20.67 KB 0644
binhex.html File 10.58 KB 0644
bisect.html File 23.24 KB 0644
bsddb.html File 26.43 KB 0644
bz2.html File 26.08 KB 0644
calendar.html File 37.79 KB 0644
carbon.html File 48.94 KB 0644
cd.html File 27.96 KB 0644
cgi.html File 49.92 KB 0644
cgihttpserver.html File 13.1 KB 0644
cgitb.html File 11.41 KB 0644
chunk.html File 14.66 KB 0644
cmath.html File 25.63 KB 0644
cmd.html File 26.09 KB 0644
code.html File 24.58 KB 0644
codecs.html File 100.64 KB 0644
codeop.html File 14.84 KB 0644
collections.html File 133.96 KB 0644
colorpicker.html File 7.52 KB 0644
colorsys.html File 11.04 KB 0644
commands.html File 14.36 KB 0644
compileall.html File 16.83 KB 0644
compiler.html File 67.75 KB 0644
configparser.html File 62.13 KB 0644
constants.html File 12.83 KB 0644
contextlib.html File 19.39 KB 0644
cookie.html File 39.07 KB 0644
cookielib.html File 83.82 KB 0644
copy.html File 12.19 KB 0644
copy_reg.html File 13.76 KB 0644
crypt.html File 10.04 KB 0644
crypto.html File 7.59 KB 0644
csv.html File 67.37 KB 0644
ctypes.html File 238.78 KB 0644
curses.ascii.html File 22.29 KB 0644
curses.html File 146.63 KB 0644
curses.panel.html File 14.39 KB 0644
custominterp.html File 7.62 KB 0644
datatypes.html File 16.84 KB 0644
datetime.html File 226.59 KB 0644
dbhash.html File 15.48 KB 0644
dbm.html File 12.07 KB 0644
debug.html File 10.15 KB 0644
decimal.html File 194.44 KB 0644
development.html File 14.17 KB 0644
difflib.html File 84.83 KB 0644
dircache.html File 11.41 KB 0644
dis.html File 69.95 KB 0644
distutils.html File 8.05 KB 0644
dl.html File 16.33 KB 0644
doctest.html File 165.54 KB 0644
docxmlrpcserver.html File 16.43 KB 0644
dumbdbm.html File 14.02 KB 0644
dummy_thread.html File 9.43 KB 0644
dummy_threading.html File 8.37 KB 0644
easydialogs.html File 30.55 KB 0644
email-examples.html File 45.65 KB 0644
email.charset.html File 26.8 KB 0644
email.encoders.html File 11.86 KB 0644
email.errors.html File 15.77 KB 0644
email.generator.html File 20.77 KB 0644
email.header.html File 26.92 KB 0644
email.html File 44.24 KB 0644
email.iterators.html File 11.52 KB 0644
email.message.html File 63.16 KB 0644
email.mime.html File 27.93 KB 0644
email.parser.html File 30.45 KB 0644
email.util.html File 24.46 KB 0644
errno.html File 37.99 KB 0644
exceptions.html File 56.13 KB 0644
fcntl.html File 22.67 KB 0644
filecmp.html File 22.3 KB 0644
fileformats.html File 9.14 KB 0644
fileinput.html File 24.28 KB 0644
filesys.html File 10.2 KB 0644
fl.html File 49.92 KB 0644
fm.html File 11.91 KB 0644
fnmatch.html File 14.58 KB 0644
formatter.html File 34.06 KB 0644
fpectl.html File 16.01 KB 0644
fpformat.html File 10.59 KB 0644
fractions.html File 22.61 KB 0644
framework.html File 33.34 KB 0644
frameworks.html File 7.14 KB 0644
ftplib.html File 43.99 KB 0644
functions.html File 183.14 KB 0644
functools.html File 27.17 KB 0644
future_builtins.html File 13.04 KB 0644
gc.html File 25.75 KB 0644
gdbm.html File 15.96 KB 0644
gensuitemodule.html File 11.51 KB 0644
getopt.html File 23.66 KB 0644
getpass.html File 10.65 KB 0644
gettext.html File 78.76 KB 0644
gl.html File 22.09 KB 0644
glob.html File 13.26 KB 0644
grp.html File 10.49 KB 0644
gzip.html File 18.99 KB 0644
hashlib.html File 18.2 KB 0644
heapq.html File 31.61 KB 0644
hmac.html File 10.46 KB 0644
hotshot.html File 18.65 KB 0644
htmllib.html File 25.32 KB 0644
htmlparser.html File 39.11 KB 0644
httplib.html File 62.95 KB 0644
i18n.html File 9.52 KB 0644
ic.html File 17.17 KB 0644
idle.html File 20.9 KB 0644
imageop.html File 14.76 KB 0644
imaplib.html File 51.99 KB 0644
imgfile.html File 11.71 KB 0644
imghdr.html File 11.3 KB 0644
imp.html File 34.34 KB 0644
importlib.html File 8.26 KB 0644
imputil.html File 31.81 KB 0644
index.html File 72.78 KB 0644
inspect.html File 50.71 KB 0644
internet.html File 24.87 KB 0644
intro.html File 8.93 KB 0644
io.html File 98.13 KB 0644
ipc.html File 13.41 KB 0644
itertools.html File 115.91 KB 0644
jpeg.html File 12.74 KB 0644
json.html File 67.04 KB 0644
keyword.html File 7.68 KB 0644
language.html File 11.03 KB 0644
linecache.html File 10.59 KB 0644
locale.html File 55.14 KB 0644
logging.config.html File 63.36 KB 0644
logging.handlers.html File 69.64 KB 0644
logging.html File 95.64 KB 0644
mac.html File 21.79 KB 0644
macos.html File 14.76 KB 0644
macosa.html File 12.96 KB 0644
macostools.html File 15.52 KB 0644
macpath.html File 7.76 KB 0644
mailbox.html File 156.75 KB 0644
mailcap.html File 13.21 KB 0644
markup.html File 18.77 KB 0644
marshal.html File 17.98 KB 0644
math.html File 39.24 KB 0644
md5.html File 13.97 KB 0644
mhlib.html File 21.54 KB 0644
mimetools.html File 19.25 KB 0644
mimetypes.html File 28.39 KB 0644
mimewriter.html File 15.02 KB 0644
mimify.html File 13.36 KB 0644
miniaeframe.html File 12.2 KB 0644
misc.html File 6.87 KB 0644
mm.html File 9.03 KB 0644
mmap.html File 28.36 KB 0644
modulefinder.html File 15.31 KB 0644
modules.html File 8.46 KB 0644
msilib.html File 52.43 KB 0644
msvcrt.html File 19.37 KB 0644
multifile.html File 24.3 KB 0644
multiprocessing.html File 365.71 KB 0644
mutex.html File 11.23 KB 0644
netdata.html File 16.98 KB 0644
netrc.html File 12.3 KB 0644
new.html File 12.12 KB 0644
nis.html File 10.64 KB 0644
nntplib.html File 41.92 KB 0644
numbers.html File 37.75 KB 0644
numeric.html File 13.55 KB 0644
operator.html File 82 KB 0644
optparse.html File 222.56 KB 0644
os.html File 214.25 KB 0644
os.path.html File 38.34 KB 0644
ossaudiodev.html File 41.5 KB 0644
othergui.html File 9.08 KB 0644
parser.html File 39.36 KB 0644
pdb.html File 33.96 KB 0644
persistence.html File 14.87 KB 0644
pickle.html File 102.27 KB 0644
pickletools.html File 10.63 KB 0644
pipes.html File 18.01 KB 0644
pkgutil.html File 25.11 KB 0644
platform.html File 28.37 KB 0644
plistlib.html File 17.03 KB 0644
popen2.html File 25.43 KB 0644
poplib.html File 22.32 KB 0644
posix.html File 14.41 KB 0644
posixfile.html File 19.76 KB 0644
pprint.html File 29.92 KB 0644
profile.html File 63.56 KB 0644
pty.html File 9.48 KB 0644
pwd.html File 11.43 KB 0644
py_compile.html File 11.12 KB 0644
pyclbr.html File 14.71 KB 0644
pydoc.html File 11.48 KB 0644
pyexpat.html File 71.53 KB 0644
python.html File 12.27 KB 0644
queue.html File 24.22 KB 0644
quopri.html File 11.9 KB 0644
random.html File 37.83 KB 0644
re.html File 134.74 KB 0644
readline.html File 28.24 KB 0644
repr.html File 20.43 KB 0644
resource.html File 26.48 KB 0644
restricted.html File 11.65 KB 0644
rexec.html File 37.41 KB 0644
rfc822.html File 42.22 KB 0644
rlcompleter.html File 13.51 KB 0644
robotparser.html File 12.27 KB 0644
runpy.html File 19.34 KB 0644
sched.html File 18.54 KB 0644
scrolledtext.html File 9.32 KB 0644
select.html File 39.67 KB 0644
sets.html File 36.92 KB 0644
sgi.html File 9.71 KB 0644
sgmllib.html File 30.77 KB 0644
sha.html File 12.09 KB 0644
shelve.html File 27.02 KB 0644
shlex.html File 32.1 KB 0644
shutil.html File 40.22 KB 0644
signal.html File 31.14 KB 0644
simplehttpserver.html File 18.41 KB 0644
simplexmlrpcserver.html File 31.39 KB 0644
site.html File 23.64 KB 0644
smtpd.html File 12.46 KB 0644
smtplib.html File 42.13 KB 0644
sndhdr.html File 10.02 KB 0644
socket.html File 106.34 KB 0644
socketserver.html File 59.83 KB 0644
someos.html File 15.11 KB 0644
spwd.html File 10.33 KB 0644
sqlite3.html File 139.5 KB 0644
ssl.html File 65.62 KB 0644
stat.html File 32.31 KB 0644
statvfs.html File 10.6 KB 0644
stdtypes.html File 260.4 KB 0644
string.html File 106.65 KB 0644
stringio.html File 18.81 KB 0644
stringprep.html File 16.13 KB 0644
strings.html File 14.93 KB 0644
struct.html File 40.88 KB 0644
subprocess.html File 84.91 KB 0644
sun.html File 6.84 KB 0644
sunau.html File 27.1 KB 0644
sunaudio.html File 17.79 KB 0644
symbol.html File 7.66 KB 0644
symtable.html File 22.94 KB 0644
sys.html File 98.7 KB 0644
sysconfig.html File 23.84 KB 0644
syslog.html File 17.92 KB 0644
tabnanny.html File 10.63 KB 0644
tarfile.html File 78.68 KB 0644
telnetlib.html File 25.48 KB 0644
tempfile.html File 29.42 KB 0644
termios.html File 16.01 KB 0644
test.html File 52.62 KB 0644
textwrap.html File 27.25 KB 0644
thread.html File 20.47 KB 0644
threading.html File 76.69 KB 0644
time.html File 56.93 KB 0644
timeit.html File 36.27 KB 0644
tix.html File 46.96 KB 0644
tk.html File 23.64 KB 0644
tkinter.html File 67.67 KB 0644
token.html File 19.62 KB 0644
tokenize.html File 18.45 KB 0644
trace.html File 25.54 KB 0644
traceback.html File 33.44 KB 0644
ttk.html File 101.75 KB 0644
tty.html File 9.06 KB 0644
turtle.html File 211.74 KB 0644
types.html File 27.59 KB 0644
undoc.html File 23.16 KB 0644
unicodedata.html File 18.55 KB 0644
unittest.html File 202.85 KB 0644
unix.html File 10.55 KB 0644
urllib.html File 58.68 KB 0644
urllib2.html File 100.58 KB 0644
urlparse.html File 40.41 KB 0644
user.html File 11.83 KB 0644
userdict.html File 29.73 KB 0644
uu.html File 11.03 KB 0644
uuid.html File 28.19 KB 0644
warnings.html File 46.6 KB 0644
wave.html File 22.22 KB 0644
weakref.html File 36.52 KB 0644
webbrowser.html File 23.07 KB 0644
whichdb.html File 8.85 KB 0644
windows.html File 9.33 KB 0644
winsound.html File 18.75 KB 0644
wsgiref.html File 81.04 KB 0644
xdrlib.html File 29.94 KB 0644
xml.dom.html File 89.04 KB 0644
xml.dom.minidom.html File 40.42 KB 0644
xml.dom.pulldom.html File 12.71 KB 0644
xml.etree.elementtree.html File 93.22 KB 0644
xml.html File 16.49 KB 0644
xml.sax.handler.html File 38.63 KB 0644
xml.sax.html File 20.22 KB 0644
xml.sax.reader.html File 39.09 KB 0644
xml.sax.utils.html File 14.26 KB 0644
xmlrpclib.html File 60.79 KB 0644
zipfile.html File 53.14 KB 0644
zipimport.html File 20.42 KB 0644
zlib.html File 25.46 KB 0644