- 博客(6)
- 收藏
- 关注
原创 4k图片下载
import requests import os from lxml import etree if __name__ == '__main__': url = 'https://pic.netbian.com/index.html' headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.
2021-08-05 15:28:21 102
原创 豆瓣电影排行
import requests import json if __name__ == '__main__': url = 'https://movie.douban.com/j/chart/top_list' headers = { 'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safar
2021-08-05 15:27:27 148
原创 国家药品监管总局生产许可证信息
import requests import json if __name__ == '__main__': page_url = 'http://scxk.nmpa.gov.cn:81/xk/itownet/portalAction.do?method=getXkzsList' headers = { 'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, li
2021-08-05 15:26:08 105
原创 抓取拉勾网
import time import requests from pymongo import MongoClient from fake_useragent import UserAgent headers = { 'cookie': 'JSESSIONID=ABAAAECAAEBABIIF581B734F530AD0ECB170938F7C05171; WEBTJ-ID=2021073%E4%B8%8B%E5%8D%886:32:15183215-17a6bed0db95ae-0997d60
2021-08-05 15:21:37 339
原创 抓取头条易读文章
import requests from lxml import etree import pandas as pd import time headers = { ‘User-Agent’:‘Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.198 Safari/537.36’ } user_data = [] def get_data(page): for i in r
2021-08-05 15:19:57 148
原创 selenium自动登录12306
selenium模拟登录12306 from selenium import webdriver import time import requests from hashlib import md5 from PIL import Image from selenium.webdriver import ActionChains from selenium.webdriver import ChromeOptions options = ChromeOptions() options.add_experi
2021-07-28 03:36:48 200
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人