本次操作的网站是一个用于给姓名打分的网站,我们要拿到的数据是网站对名字的打分,首先来试用一下这个网站;
这里我们需要输入一些,个人信息。
点击开始测试后我们会得到这样的评分,这就是我们需要的数据。
来分析一下这个页面
需要的数据在xingming.asp中,来看看这个:
这里就是我们需要提交的表单信息有这几项:
我这里自己制作了一份数据 :
这里提交数据是要进行encode编码的:
headers中设置好:
headers = {
"Content-Type": "application/x-www-form-urlencoded",
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/126.0.0.0 Safari/537.36"
}
接下来是完整代码:
import csv
from urllib.parse import urlencode
import requests
from lxml import etree
root_url = "https://life.httpcn.com/xingming.asp"
def get_score(data_list: list):
data = {
"isbz": 1,
"xing": data_list[0].encode("gbk"),
"ming": data_list[1].encode("gbk"),
"sex": data_list[9],
"data_type": 0,
"year": data_list[4],
"month": data_list[5],
"day": data_list[6],
"hour": data_list[7],
"minute": data_list[8],
"pid": data_list[2].encode("gbk"),
"cid": data_list[3].encode("gbk"),
"wxxy": 0,
"xishen": "金".encode("gbk"),
"yongshen": "金".encode("gbk"),
"check_agree": "agree",
"act": "submit"
}
headers = {
"Content-Type": "application/x-www-form-urlencoded",
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/126.0.0.0 Safari/537.36"
}
# 因为headers里面限制了该网页提交的数据需要urlencode编码,所以在提交data时,需要进行编码才可以
response = requests.get(root_url, data=urlencode(data), headers=headers)
html = response.content.decode("gbk")
tree = etree.HTML(html)
wuge_score = tree.xpath("/html/body/div[5]/div[1]/div[3]/div[4]/div[14]/font[1]/b/text()")[0]
bazi_score = tree.xpath("/html/body/div[5]/div[1]/div[3]/div[4]/div[14]/font[2]/b/text()")[0]
return "%s%s" % (data_list[0], data_list[1]), wuge_score, bazi_score
# 批量调用此函数来给名字打分
with open("名字集合.csv") as fin, open("名字评分.txt", "w", encoding="utf8") as fout:
name_information = csv.reader(fin)
next(name_information)
for data in name_information:
for i in range(len(data)):
if data[i] == "":
data[i] = 0
if i >= 4:
data[i] = int(data[i])
xingming, wuge_score, bazi_score = get_score(data)
fout.write("%s \t%s\t%s\n" % (xingming, wuge_score, bazi_score))
输出结果: