python爬去知乎动态内容_Python-爬虫-爬取知乎的标题和当页显示的文字

最新推荐文章于 2021-08-18 23:44:39 发布

weixin_39548438

最新推荐文章于 2021-08-18 23:44:39 发布

阅读量129

点赞数

文章标签： python爬去知乎动态内容

# coding:utf-8

import requests

from bs4 import BeautifulSoup

quesNumStr = str(input("请输入搜索关键字："))

url = ‘https://www.zhihu.com/search?type=content&q=‘+quesNumStr

headers = {

‘User-Agent‘: ‘Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.95 Safari/537.36‘ # your user-Agent here

}

data = requests.get(url, headers=headers)

soup = BeautifulSoup(data.text, ‘lxml‘)

liList = soup.select(‘li‘)

print(len(liList))

for li in liList:

try:

temp1 = li.select(‘a[class="js-title-link"]‘)

if temp1:

print(‘The title is :‘)

print(temp1[0].get_text())

temp2 = li.select(‘div[class="summary hidden-expanded"]‘)

if temp2:

print(‘The content is:‘)

print(temp2[0].text)

except:

pass

原文地址：http://www.cnblogs.com/fredkeke/p/7003923.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39548438

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python爬去知乎动态内容_Python-爬虫-爬取知乎的标题和当页显示的文字

# coding:utf-8import requestsfrom bs4 import BeautifulSoupquesNumStr = str(input("请输入搜索关键字："))url = ‘https://www.zhihu.com/search?type=content&q=‘+quesNumStrheaders = {‘User-Agent‘: ‘Mozilla/5.0 (Maci...
复制链接

扫一扫