Python爬虫
Hi-CWJ
这个作者很懒,什么都没留下…
展开
-
selenium爬取多个网站及通过GUI界面点击爬取
selenium爬取代码webcrawl.pyimport reimport timeimport jsonfrom selenium import webdriverfrom selenium.webdriver.common.by import Byfrom selenium.webdriver.chrome.options import Optionsfrom selenium.common.exceptions import TimeoutException, StaleElemen原创 2024-01-09 18:53:12 · 672 阅读 · 0 评论 -
Python 豆瓣电影 Top 250 xpath,beautiful soup,pyquery
xpath:import requestsimport timeimport csvfrom requests import RequestExceptionfrom lxml import etreedef get_one_page(url): try: headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML原创 2021-12-12 20:50:11 · 707 阅读 · 0 评论