爬虫
python 爬虫
HJZ11
记录学习之路,欢迎交流
展开
-
爬虫05-lxml-模块使用方法
原创 2020-05-21 17:04:53 · 140 阅读 · 0 评论 -
爬虫-BeautifulSoup-蛋壳公寓租房
import re,requestsfrom bs4 import BeautifulSoupdef get_page_info(page=1): url="https://www.danke.com/room/sh?page="+str(page) headers={ "User-Agent":"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrom原创 2020-05-21 16:46:54 · 355 阅读 · 0 评论 -
爬虫04-BeautifulSoup模块使用方法
原创 2020-05-21 16:18:03 · 116 阅读 · 0 评论 -
爬虫-requsets-re-前程无忧-数据分析师
import requests,reurl="https://search.51job.com/list/020000,000000,0000,00,9,99,%25E6%2595%25B0%25E6%258D%25AE%25E5%2588%2586%25E6%259E%2590%25E5%25B8%2588,2,1.html?lang=c&postchannel=0000&workyear=99&cotype=99°reefrom=99&jobterm=9原创 2020-05-21 15:35:06 · 508 阅读 · 0 评论 -
爬虫03-re模块使用方法
原创 2020-05-21 15:15:04 · 386 阅读 · 0 评论 -
爬虫02-模拟登陆
import requests# data={# "ecmsfrom": "",# "enews": "login",# "tobind": "0",# "useraccount": "自己的账号",# "password": "自己的密码",# "lifetime": "2592000",# }# url="https://www.woyaogexing.com/e/member/doaction.php"headers={ "U原创 2020-05-21 14:11:16 · 479 阅读 · 0 评论 -
爬虫01-requests模块
import requests# 网址url="http://www.antvv.com"url="https://www.woyaogexing.com"r=requests.get(url)# print(r) #<Response [200]> 返回值对象,返回了一个http状态码print(type(r))# text 获取返回值的源代码,如果误判了编码会乱码# print(r.text)# status_code 返回的状态码,200代表请求成功,得到返回值p原创 2020-05-21 14:06:11 · 449 阅读 · 0 评论 -
爬虫-前程无忧-python职位
# -*- coding: utf-8 -*-import requestsimport reurl = "https://search.51job.com/list/170200,000000,0000,00,9,99,python,2,1.html?lang=c&stype=&postchannel=0000&workyear=99&cotype=99°reefrom=99&jobterm=99&companysize=99&原创 2020-05-21 01:27:58 · 1428 阅读 · 5 评论 -
爬虫-智联招聘
import requests,jsonfrom lxml import etreeurl="https://fe-api.zhaopin.com/c/i/sou?start=90&pageSize=90&cityId=538&salary=0,0&workExperience=-1&education=-1&companyType=-1&am...原创 2019-12-01 21:31:29 · 490 阅读 · 0 评论 -
爬虫-QQ音乐动态爬取
import requests,random,timeimport jsondef get_hot_song_list(): url="https://u.y.qq.com/cgi-bin/musicu.fcg?-=getUCGI16992261760781546&g_tk=5381&loginUin=3004439232&hostUin=0&for...原创 2019-12-01 21:26:18 · 961 阅读 · 0 评论