爬虫
凤枭香
这个作者很懒,什么都没留下…
展开
-
爬取企查查公司信息
import requestsimport timefrom lxml import etreeimport pandas as pdimport csvbase_url = 'https://www.qcc.com/web/search?key='headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:86.0) Gecko/20100101 Firefox/86.0'}data = pd.r原创 2021-09-06 16:16:31 · 1602 阅读 · 2 评论 -
爬取企查查公司URL
import timeimport pandas as pdfrom selenium import webdriverfrom selenium.webdriver import ActionChainsa = []def login(driver): driver.delete_all_cookies() url = "https://www.qcc.com/weblogin?back=%2F" #https://www.qcc.com/weblogin?back=%2F原创 2021-09-06 16:15:12 · 498 阅读 · 0 评论 -
营业执照数据生成
import pandas as pdfrom PIL import Imagefrom PIL import ImageFilterfrom PIL import ImageEnhanceimport cv2from PIL import ImageDraw, ImageFontdata=pd.read_csv('E:\qichacha\data\qichacha.csv',encoding='gbk')fnt = ImageFont.truetype(r'E:\qichacha\gen_d原创 2021-09-06 17:33:03 · 1853 阅读 · 1 评论