1. r.json() 将字典形式的字符串转化为字典。是 requests 中的方法,
如果 r 不是字典形式的字符串,那么会报错。
verify=0,disable_warnings()
import requests
import re
requests.packages.urllib3.disable_warnings()
r = requests.get('https://static1.scrape.cuiqingcai.com', verify=0)
pattern = re.compile('<h2.*?>(.*?)</h2>', re.S)
titles = re.findall(pattern, r.text)print(titles)
['霸王别姬 - Farewell My Concubine', '这个杀手不太冷 - Léon', '肖申克的救赎 - The Shawshank Redemption', '泰坦尼克号 - Titanic', '罗马假日 - Roman Holiday', '唐伯虎点秋香 - Flirting Scholar', '乱世佳人 - Gone with the Wind', '喜剧之王 - The King of Comedy', '楚门的世界 - The Truman Show', '狮子王 - The Lion King']
import requests
requests.get('http://httpbin.org/cookies/set/number/123456789')
r = requests.get('http://httpbin.org/cookies')print(r.text)
{
"cookies": {}
}
Process finished with exit code 0
with session
import requests
s = requests.Session()
s.get('http://httpbin.org/cookies/set/number/123456789')
r = s.get('http://httpbin.org/cookies')print(r.text)
{
"cookies": {
"number": "123456789"
}
}
Process finished with exit code 0
1. s = requests.Session() 构造一个 requests.Session() 对象,
后续 s 的请求都是在同一个会话。
timeout
import requests
r = requests.get('http://httpbin.org/get', timeout=1)
r = requests.get('http://httpbin.org/get', timeout=(5,30))
r = requests.get('http://httpbin.org/get', timeout=None)# defautprint(r.status_code)# wait all the time