PyQuery库是Python 仿照 jQuery 的严格实现,解析网页效率很高,强大灵活简单易用。
下面是根据古诗词的一句返回完整的诗,作者,标题。
import requests
from pyquery import PyQuery as pq
import re
def searchPoem(sentence):
my_url = "https://so.gushiwen.org/search.aspx?value="+sentence
d = pq(url=my_url)
poemList=getTopic = d("textarea")
poemRes = poemList[0].text
poem = re.match(r"(.*)——(.*)《(.*)》", poemRes, re.M)
if not poem is None:
return poem.groups()
sentence = "柴门闻犬吠"
poem = searchPoem(sentence)
print(poem)
运行结果:
('日暮苍山远,天寒白屋贫。柴门闻犬吠,风雪夜归人。', '唐代·刘长卿', '逢雪宿芙蓉山主人')