python
winnertakeall
这个作者很懒,什么都没留下…
展开
-
标签的制作
kind = ["体育","财经","房产"]kind_id = dict(zip(kind, range(len(kind))))print(kind_id){'体育': 0, '财经': 1, '房产': 2}原创 2018-10-19 09:49:02 · 297 阅读 · 0 评论 -
多线程
单线程的方式import timedef coding(): for x in range(3): print("正在写代码%s"%x) time.sleep(1) def drawing(): for x in range(3): print("正在画图%s"%x) time.sleep(1...原创 2019-02-26 20:54:40 · 110 阅读 · 0 评论 -
csv文件进行操作
import csvheaders = ["username", "age", "height"]#values = [# ("张三", 18, 180),# ("李四", 19, 190),# ("王五", 20, 160)# ]##with open("classroom.csv", &qu原创 2019-02-25 23:28:05 · 366 阅读 · 0 评论 -
case when end
select * from employees select distinct name,age,case when address like '%广州%' then '广州中山大' when address like '%朝阳%' then '朝阳区' end as addrefrom employees原创 2019-02-18 22:04:11 · 313 阅读 · 0 评论 -
装饰器用在爬虫即retrying模块的安装
import requestsfrom retrying import retryheaders={"User-Agent":"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.84 Safari/537.36"}@retry(st...原创 2019-01-22 23:16:24 · 312 阅读 · 0 评论 -
爬虫实现百度翻译
import requestsimport jsonimport sysquery_string = sys.argv[1]headers = {"User-Agent": "Mozilla/5.0 (iPhone; CPU iPhone OS 11_0 like Mac OS X) AppleWebKit/604.1.38 (KHTML, like Gecko) Version/11....原创 2019-01-20 16:09:39 · 1722 阅读 · 0 评论 -
实现任意贴吧的爬虫,保存网页到本地
# coding=utf-8import requestsclass TiebaSpider: def __init__(self, tieba_name): self.tieba_name = tieba_name self.url_temp = "https://tieba.baidu.com/f?kw="+tieba_name+"&pn=...原创 2019-01-20 13:27:02 · 887 阅读 · 0 评论 -
代参数的url发送请求
import requestsheaders = {"User-Agent": "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.98 Safari/537.36"}url = "https://www.baidu.com/s?"p = {"wd":"csdn"}...原创 2019-01-20 09:44:13 · 350 阅读 · 0 评论 -
response.text和response.content
In [1]: import requests In [2]: response = requests.get("http://www.baidu.com") In [3]: response ...原创 2019-01-19 20:58:55 · 735 阅读 · 0 评论 -
pyhon基础知识
查看python的版本pc@pc-HP-ProDesk-680-G3-PCI-MT:~$ pip3 --versionpip 9.0.1 from /usr/lib/python3/dist-packages (python 3.6)原创 2019-01-19 20:17:21 · 118 阅读 · 0 评论 -
str bytes如何转换
str 使用encode方法转换为bytes(爬虫的得到的响应以二进制的方式传送)In [9]: a = "你好" In [10]: type(a) ...原创 2019-01-19 15:24:22 · 829 阅读 · 0 评论 -
json的str类型和python类型的转换
parse_url.py# coding=utf-8import requestsfrom retrying import retryheaders={"User-Agent":"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239...原创 2019-01-23 23:53:04 · 591 阅读 · 0 评论 -
python下的os
import osos.getcwd() 表示当前的路径'/home/shnu/demo/NLP/第九章'os.sep 表示/'/'c_root = os.getcwd() + os.sep + "source_data" + os.sep'/home/shnu/demo/NLP/第九章/source_data/'os.listdir(c_root) 把当前文件下的所...原创 2019-01-03 21:56:38 · 308 阅读 · 0 评论 -
python常用用法
1.rangeIn [3]: for i in range(4,0,-1): ...: print(i) ...: 43212.sorted,lambdadic = {('a', 10), ('b', 7), ('c', 7), ('d', 5), ('f', 5), ('e', 4), ('g', 3), ('h', 2), ('i', 2)...原创 2018-11-13 09:48:28 · 238 阅读 · 0 评论 -
爬虫之selenium
from selenium import webdriverfrom lxml import etreeimport reimport timefrom selenium.webdriver.support.ui import WebDriverWaitfrom selenium.webdriver.support import expected_conditions as ECfr...原创 2019-03-04 21:24:56 · 144 阅读 · 0 评论