[ Avaa Bypassed ]




Upload:

Command:

hmhc3928@18.116.63.107: ~ $
U

<_9Y�0�@s�dZddlZddlZddlZddlmZddlmZmZm	Z	ddl
mZddlm
Z
ddlmZdd	lmZGd
d�de�ZdS)a
Module containing the UniversalDetector detector class, which is the primary
class a user of ``chardet`` should use.

:author: Mark Pilgrim (initial port to Python)
:author: Shy Shalom (original C code)
:author: Dan Blanchard (major refactoring for 3.0)
:author: Ian Cordasco
�N�)�CharSetGroupProber)�
InputState�LanguageFilter�ProbingState)�EscCharSetProber)�Latin1Prober)�MBCSGroupProber)�SBCSGroupProberc	@sneZdZdZdZe�d�Ze�d�Ze�d�Z	dddd	d
ddd
d�Z
ejfdd�Z
dd�Zdd�Zdd�ZdS)�UniversalDetectoraq
    The ``UniversalDetector`` class underlies the ``chardet.detect`` function
    and coordinates all of the different charset probers.

    To get a ``dict`` containing an encoding and its confidence, you can simply
    run:

    .. code::

            u = UniversalDetector()
            u.feed(some_bytes)
            u.close()
            detected = u.result

    g�������?s[�-�]s(|~{)s[�-�]zWindows-1252zWindows-1250zWindows-1251zWindows-1256zWindows-1253zWindows-1255zWindows-1254zWindows-1257)z
iso-8859-1z
iso-8859-2z
iso-8859-5z
iso-8859-6z
iso-8859-7z
iso-8859-8z
iso-8859-9ziso-8859-13cCsNd|_g|_d|_d|_d|_d|_d|_||_t�	t
�|_d|_|�
�dS)N)�_esc_charset_prober�_charset_probers�result�done�	_got_data�_input_state�
_last_char�lang_filter�loggingZ	getLogger�__name__�logger�_has_win_bytes�reset)�selfr�r�J/opt/alt/python38/lib/python3.8/site-packages/chardet/universaldetector.py�__init__QszUniversalDetector.__init__cCsVdddd�|_d|_d|_d|_tj|_d|_|jr>|j�	�|j
D]}|�	�qDdS)z�
        Reset the UniversalDetector and all of its probers back to their
        initial states.  This is called by ``__init__``, so you only need to
        call this directly in between analyses of different documents.
        N���encoding�
confidence�languageF�)rrrrr�
PURE_ASCIIrrrrr
)r�proberrrrr^s

zUniversalDetector.resetcCs>|jr
dSt|�sdSt|t�s(t|�}|js�|�tj�rJdddd�|_nv|�tj	tj
f�rldddd�|_nT|�d�r�dddd�|_n:|�d	�r�d
ddd�|_n |�tjtjf�r�dddd�|_d|_|jd
dk	r�d|_dS|j
tjk�r.|j�|��rtj|_
n*|j
tjk�r.|j�|j|��r.tj|_
|dd�|_|j
tjk�r�|j�s^t|j�|_|j�|�tjk�r:|jj|j��|jjd�|_d|_n�|j
tjk�r:|j�s�t |j�g|_|jt!j"@�r�|j�#t$��|j�#t%��|jD]:}|�|�tjk�r�|j|��|jd�|_d|_�q&�q�|j&�|��r:d|_'dS)a�
        Takes a chunk of a document and feeds it through all of the relevant
        charset probers.

        After calling ``feed``, you can check the value of the ``done``
        attribute to see if you need to continue feeding the
        ``UniversalDetector`` more data, or if it has made a prediction
        (in the ``result`` attribute).

        .. note::
           You should always call ``close`` when you're done feeding in your
           document if ``done`` is not already ``True``.
        Nz	UTF-8-SIG��?�rzUTF-32s��zX-ISO-10646-UCS-4-3412s��zX-ISO-10646-UCS-4-2143zUTF-16Tr���)(r�len�
isinstance�	bytearrayr�
startswith�codecs�BOM_UTF8r�BOM_UTF32_LE�BOM_UTF32_BE�BOM_LE�BOM_BErrr#�HIGH_BYTE_DETECTOR�search�	HIGH_BYTE�ESC_DETECTORrZ	ESC_ASCIIrrr�feedrZFOUND_IT�charset_name�get_confidencer!r
r	rZNON_CJK�appendr
r�WIN_BYTE_DETECTORr)rZbyte_strr$rrrr6os�
�
��

�

�
�
��
�
zUniversalDetector.feedc		Cst|jr|jSd|_|js&|j�d�n�|jtjkrBdddd�|_n�|jtjkr�d}d}d}|j	D]"}|sjq`|�
�}||kr`|}|}q`|r�||jkr�|j}|j�
�}|�
�}|�d	�r�|jr�|j�||�}|||jd�|_|j��tjk�rn|jd
dk�rn|j�d�|j	D]`}|�s�qt|t��rP|jD] }|j�d|j|j|�
���q,n|j�d|j|j|�
���q|jS)
z�
        Stop analyzing the current document and come up with a final
        prediction.

        :returns:  The ``result`` attribute, a ``dict`` with the keys
                   `encoding`, `confidence`, and `language`.
        Tzno data received!�asciir%r&rNrziso-8859rz no probers hit minimum thresholdz%s %s confidence = %s)rrrr�debugrrr#r4r
r8�MINIMUM_THRESHOLDr7�lowerr+r�ISO_WIN_MAP�getr!ZgetEffectiveLevelr�DEBUGr)rZprobers)	rZprober_confidenceZmax_prober_confidenceZ
max_proberr$r7Zlower_charset_namer Zgroup_proberrrr�close�sj	�



��

�
�zUniversalDetector.closeN)r�
__module__�__qualname__�__doc__r=�re�compiler2r5r:r?rZALLrrr6rBrrrrr3s$


�	
mr)rEr,rrFZcharsetgroupproberrZenumsrrrZ	escproberrZlatin1proberrZmbcsgroupproberr	Zsbcsgroupproberr
�objectrrrrr�<module>s

Filemanager

Name Type Size Permission Actions
__init__.cpython-38.opt-1.pyc File 814 B 0644
__init__.cpython-38.pyc File 814 B 0644
big5freq.cpython-38.opt-1.pyc File 26.51 KB 0644
big5freq.cpython-38.pyc File 26.51 KB 0644
big5prober.cpython-38.opt-1.pyc File 1.07 KB 0644
big5prober.cpython-38.pyc File 1.07 KB 0644
chardistribution.cpython-38.opt-1.pyc File 6.04 KB 0644
chardistribution.cpython-38.pyc File 6.04 KB 0644
charsetgroupprober.cpython-38.opt-1.pyc File 2.16 KB 0644
charsetgroupprober.cpython-38.pyc File 2.16 KB 0644
charsetprober.cpython-38.opt-1.pyc File 3.37 KB 0644
charsetprober.cpython-38.pyc File 3.37 KB 0644
codingstatemachine.cpython-38.opt-1.pyc File 2.81 KB 0644
codingstatemachine.cpython-38.pyc File 2.81 KB 0644
compat.cpython-38.opt-1.pyc File 319 B 0644
compat.cpython-38.pyc File 319 B 0644
cp949prober.cpython-38.opt-1.pyc File 1.08 KB 0644
cp949prober.cpython-38.pyc File 1.08 KB 0644
enums.cpython-38.opt-1.pyc File 2.55 KB 0644
enums.cpython-38.pyc File 2.55 KB 0644
escprober.cpython-38.opt-1.pyc File 2.54 KB 0644
escprober.cpython-38.pyc File 2.54 KB 0644
escsm.cpython-38.opt-1.pyc File 7.26 KB 0644
escsm.cpython-38.pyc File 7.26 KB 0644
eucjpprober.cpython-38.opt-1.pyc File 2.36 KB 0644
eucjpprober.cpython-38.pyc File 2.36 KB 0644
euckrfreq.cpython-38.opt-1.pyc File 11.75 KB 0644
euckrfreq.cpython-38.pyc File 11.75 KB 0644
euckrprober.cpython-38.opt-1.pyc File 1.08 KB 0644
euckrprober.cpython-38.pyc File 1.08 KB 0644
euctwfreq.cpython-38.opt-1.pyc File 26.51 KB 0644
euctwfreq.cpython-38.pyc File 26.51 KB 0644
euctwprober.cpython-38.opt-1.pyc File 1.08 KB 0644
euctwprober.cpython-38.pyc File 1.08 KB 0644
gb2312freq.cpython-38.opt-1.pyc File 18.62 KB 0644
gb2312freq.cpython-38.pyc File 18.62 KB 0644
gb2312prober.cpython-38.opt-1.pyc File 1.09 KB 0644
gb2312prober.cpython-38.pyc File 1.09 KB 0644
hebrewprober.cpython-38.opt-1.pyc File 2.92 KB 0644
hebrewprober.cpython-38.pyc File 2.92 KB 0644
jisfreq.cpython-38.opt-1.pyc File 21.58 KB 0644
jisfreq.cpython-38.pyc File 21.58 KB 0644
jpcntx.cpython-38.opt-1.pyc File 36.69 KB 0644
jpcntx.cpython-38.pyc File 36.69 KB 0644
langbulgarianmodel.cpython-38.opt-1.pyc File 23.04 KB 0644
langbulgarianmodel.cpython-38.pyc File 23.04 KB 0644
langcyrillicmodel.cpython-38.opt-1.pyc File 28.38 KB 0644
langcyrillicmodel.cpython-38.pyc File 28.38 KB 0644
langgreekmodel.cpython-38.opt-1.pyc File 23 KB 0644
langgreekmodel.cpython-38.pyc File 23 KB 0644
langhebrewmodel.cpython-38.opt-1.pyc File 21.66 KB 0644
langhebrewmodel.cpython-38.pyc File 21.66 KB 0644
langhungarianmodel.cpython-38.opt-1.pyc File 23.03 KB 0644
langhungarianmodel.cpython-38.pyc File 23.03 KB 0644
langthaimodel.cpython-38.opt-1.pyc File 21.64 KB 0644
langthaimodel.cpython-38.pyc File 21.64 KB 0644
langturkishmodel.cpython-38.opt-1.pyc File 21.66 KB 0644
langturkishmodel.cpython-38.pyc File 21.66 KB 0644
latin1prober.cpython-38.opt-1.pyc File 3.29 KB 0644
latin1prober.cpython-38.pyc File 3.29 KB 0644
mbcharsetprober.cpython-38.opt-1.pyc File 2.18 KB 0644
mbcharsetprober.cpython-38.pyc File 2.18 KB 0644
mbcsgroupprober.cpython-38.opt-1.pyc File 1.07 KB 0644
mbcsgroupprober.cpython-38.pyc File 1.07 KB 0644
mbcssm.cpython-38.opt-1.pyc File 16.33 KB 0644
mbcssm.cpython-38.pyc File 16.33 KB 0644
sbcharsetprober.cpython-38.opt-1.pyc File 2.91 KB 0644
sbcharsetprober.cpython-38.pyc File 2.91 KB 0644
sbcsgroupprober.cpython-38.opt-1.pyc File 1.56 KB 0644
sbcsgroupprober.cpython-38.pyc File 1.56 KB 0644
sjisprober.cpython-38.opt-1.pyc File 2.39 KB 0644
sjisprober.cpython-38.pyc File 2.39 KB 0644
universaldetector.cpython-38.opt-1.pyc File 5.66 KB 0644
universaldetector.cpython-38.pyc File 5.66 KB 0644
utf8prober.cpython-38.opt-1.pyc File 1.91 KB 0644
utf8prober.cpython-38.pyc File 1.91 KB 0644
version.cpython-38.opt-1.pyc File 403 B 0644
version.cpython-38.pyc File 403 B 0644