-
- re(正则表达式)
- 参考: http://docs.python.org/2/library/re.html
- 参考: http://www.cnblogs.com/huxi/archive/2010/07/04/1771073.html
- 参考: http://blog.csdn.net/jgood/article/details/4277902
-
- gevent/threading(多线程)
- 参考: http://docs.python.org/2/library/threading.html
- 参考: http://www.cnblogs.com/huxi/archive/2010/06/26/1765808.html
-
- docopt/getopt/argparse(命令行参数处理)
- 参考: https://github.com/docopt/docopt
- 参考: http://docs.python.org/2/library/getopt.html
- 参考: http://andylin02.iteye.com/blog/845355
- 参考: http://docs.python.org/2/howto/argparse.html
- 参考: http://www.cnblogs.com/jianboqi/archive/2013/01/10/2854726.html
-
- ConfigParser(配置文件读取)
- 参考: http://docs.python.org/2/library/configparser.html
- 参考: http://blog.chinaunix.net/uid-25890465-id-3312861.html
-
- urllib/urllib2/httplib(网页下载)
- 参考: http://docs.python.org/2/library/urllib2.html
- 参考: http://blog.csdn.net/wklken/article/details/7364328
- 参考: http://www.nowamagic.net/academy/detail/1302872
-
- pyquery/beautifulsoup4/HTMLParser/SGMLParser(HTML解析)
- 参考: http://docs.python.org/2/library/htmlparser.html
- 参考: http://cloudaice.com/yong-pythonde-htmlparserfen-xi-htmlye-mian/
- 参考: http://docs.python.org/2/library/sgmllib.html
- 参考: http://pako.iteye.com/blog/592009
-
- urlparse(URL解析处理)
- 参考: http://docs.python.org/2/library/urlparse.html
- 参考: http://blog.sina.com.cn/s/blog_5ff7f94f0100qr3c.html
-
- logging(日志处理)
- 参考: http://docs.python.org/2/library/logging.html
- 参考: http://kenby.iteye.com/blog/1162698
- 参考: http://my.oschina.net/leejun2005/blog/126713
python 日志模块 logging 详解