1.pyquery简介
1.1 安装
安装命令:
pip install pyquery
有兴趣可以看看:pyquery官方文档
1.2 使用方法
>>> from pyquery import PyQuery as pq
>>> d("#hello")
[<p#hello.hello>]
>>> p = d("#hello")
>>> print(p.html())
Hello world !
>>> p.html("you know <a href='http://python.org/'>Python</a> rocks")
[<p#hello.hello>]
>>> print(p.html())
you know <a href="http://python.org/">Python</a> rocks
>>> print(p.text())
you know Python rocks
2.通过有道翻译单词
from pyquery import PyQuery as pq
import time
headers = {
"User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_14_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/84.0.4147.135 Safari/537.36"
}
words_list = ['work', 'can', 'be', 'kind']
new_words_list = []
for item in words_list:
url = 'http://www.youdao.com/w/' + item + '/#keyfrom=dict2.top'
# 获取翻译数据并解析
data = pq(url=url, headers=headers, encoding='utf-8')
time.sleep(3) # 避免被反爬
# phrsListTab > div.trans-container > ul
explains = data("#phrsListTab > div.trans-container > ul") # pyquery.pyquery.PyQuery
# 将翻译数据类型转化为字符串,然后通过正则表达式将文本信息提出来
explains = str(explains)
explains = re.sub("<.*?>|\n| ", "", explains)
print(explains)
new_item = (item, explains)
new_words_list.append(new_item)
print(new_words_list)
输出: