多线程抓取豆瓣top250,其实数据量不多,单线程完全够用,初学多线程抓取,就当练练手好了,下次换个数据量大的网页来抓取
import requests
from lxml import etree
import time
from concurrent.futures import ThreadPoolExecutor
def download_one_page(url, headers):
# 拿到页面源代码
resp = requests.get(url=url
多线程抓取豆瓣top250,其实数据量不多,单线程完全够用,初学多线程抓取,就当练练手好了,下次换个数据量大的网页来抓取
import requests
from lxml import etree
import time
from concurrent.futures import ThreadPoolExecutor
def download_one_page(url, headers):
# 拿到页面源代码
resp = requests.get(url=url