Python基础

Sean Wong

已于 2022-05-24 18:55:35 修改

阅读量4.5k

点赞数 3

分类专栏： Python基础文章标签： python url

于 2021-03-04 22:17:43 首次发布

本文链接：https://blog.csdn.net/m0_48004969/article/details/114378868

版权

Python基础专栏收录该内容

1 篇文章 1 订阅

订阅专栏

Python

pip更改镜像

pip install -i [镜像url] [包名称]
如：
从清华镜像安装scrapy包

pip install -i https://pypi.tuna.tsinghua.edu.cn/simple scrapy

常用源：
（1）阿里云 http://mirrors.aliyun.com/pypi/simple/
（2）豆瓣http://pypi.douban.com/simple/
（3）清华大学 https://pypi.tuna.tsinghua.edu.cn/simple/
（4）中国科学技术大学 http://pypi.mirrors.ustc.edu.cn/simple/
（5）华中科技大学http://pypi.hustunique.com/

常用包：
一、Web框架:

Tornado，访问：http://www.tornadoweb.org/en/stable/

Flask，访问：http://flask.pocoo.org/

Web.py，访问：http://webpy.org/

django：https://www.djangoproject.com/

cherrypy：http://cherrypy.org/

jinjs：http://docs.jinkan.org/docs/jinja2/

二、GUI 图形界面

Tkinter :https://wiki.python.org/moin/TkInter/

wxPython:https://www.wxpython.org/

PyGTK:http://www.pygtk.org/

PyQt:https://sourceforge.net/projects/pyqt/

PySide:http://wiki.qt.io/Category:LanguageBindings::PySide

三、科学计算

numpy：http://www.numpy.org/

SciPy：https://www.scipy.org/

pandas： http://pandas.pydata.org/

blaze： http://blaze.readthedocs.io/en/latest/index.html

四、密码学

cryptography：https://pypi.python.org/pypi/cryptography/

hashids：http://www.oschina.net/p/hashids

Paramiko：http://www.paramiko.org/

Passlib：https://pythonhosted.org/passlib/

PyCrypto：https://pypi.python.org/pypi/pycrypto

PyNacl：http://pynacl.readthedocs.io/en/latest/

五、爬虫相关

scrapy：https://scrapy.org/

pyspider： https://github.com/binux/pyspider

portia：https://github.com/scrapinghub/portia

html2text：https://github.com/Alir3z4/html2text

BeautifulSoup：https://www.crummy.com/software/BeautifulSoup/

lxml：http://lxml.de/

selenium：http://docs.seleniumhq.org/

mechanize：https://pypi.python.org/pypi/mechanize

PyQuery：https://pypi.python.org/pypi/pyquery/

creepy：https://pypi.python.org/pypi/creepy

gevent，一个高并发的网络性能库，访问：http://www.gevent.org/

requests，最好用的http工具，访问：http://www.python-requests.org/

六、图像处理

bigmoyan：http://scikit-image.org/

Python Imaging Library(PIL)：

http://www.pythonware.com/products/pil/

pillow： http://pillow.readthedocs.io/en/latest/

七、自然语言处理

nltk： http://www.nltk.org/

snownlp： https://github.com/isnowfy/snownlp

Pattern：https://github.com/clips/pattern

TextBlob：http://textblob.readthedocs.io/en/dev/

Polyglot：https://pypi.python.org/pypi/polyglot

jieba： https://github.com/fxsjy/jieba

八、数据库驱动

mysql-python： https://sourceforge.net/projects/mysql-python/

PyMySQL：https://github.com/PyMySQL/PyMySQL

PyMongo：https://docs.mongodb.com/ecosystem/drivers/python/

pymongo，MongoDB库，访问：https://pypi.python.org/pypi/pymongo/

redis，Redis库，访问：https://pypi.python.org/pypi/redis/

cxOracle，Oracle库，访问：https://pypi.python.org/pypi/cx_Oracle

SQLAlchemy，SQL工具包及对象关系映射（ORM）工具，访问：http://www.sqlalchemy.org/

peewee， SQL工具包及对象关系映射（ORM）工具，访问：https://pypi.python.org/pypi/peewee

torndb，Tornado原装DB，访问：https://github.com/bdarnell/torndb

九、Web

pycurl ———— URL处理工具

smtplib模块 ———— 发送电子邮件

with statement

with statement用于异常处理。

requests模块

requests模块：可以发生http请求，并返回response对象.

什么是urllib

urllib是Python的[URL处理包]，可用于访问网页，并且与网页交互。urllib包含了几个模块，如图所示：
在这里插入图片描述

urllib.request

from urllib.request import urlopen

myURL = urlopen("http://194.53.108.5")
print(myURL.read().decode('utf-8'))

打印网页的html源码。

urllib.parse

from urllib.parse import urlparse 
parsedUrl = urlparse('https://www.educative.io/track/python-for-programmers') 
print(parsedUrl)

打印内容为：URL分为其组成部分，例如所使用的协议方案，网络位置netloc和网页路径path等。

urllib.error

from urllib.request import urlopen, HTTPError, URLError

try:
    myURL = urlopen("http://ww.educative.xyz/")
except HTTPError as e:
    print('HTTP Error code: ', e.code)
except URLError as e:
    print('URL Error: ', e.reason)
else:
    print('No Error.')