python采集keep运动数据

最新推荐文章于 2025-03-05 23:44:44 发布

小白学大数据

最新推荐文章于 2025-03-05 23:44:44 发布

阅读量4.8k

点赞数 3

文章标签： python

本文链接：https://blog.csdn.net/Z_suger7/article/details/116714582

版权

我国2009年确立“全民健身日”；2012年，“广泛开展全民健身运动”写入十八大报告；到2014年上升为国家战略，全民健身的分量越来越重。推动全民健身和全民健康深度融合，通过全民健身实现全民健康。
全民健身运动在我国越来越普及，每天清晨和傍晚时分，城市的广场上、公园里，到处是运动健身的人们，跑步、打球、跳舞、游泳、打太极拳等运动项目十分丰富，人们在运动中获得越来越多的幸福感。
随之诞生出了很多运动app，例如keep,咪咕，微信等，大家可以通过app随时记录自己的运动数据，特别是对于很多的减肥人群来说，通过运动数据可以更好的让自己坚持下去。我身边就有很多的朋友使用kee使用keep来记录锻炼信息，想了解都有哪些群体使用keep这款健身软件,今天我们就使用python抓取些关于keep用户的跑步数据。
#! -- encoding:utf-8 --
import requests
import random
import requests.adapters
# 要访问的目标页面
targetUrlList = [
“https://https://www.gotokeep.com”,
“https://https://www.gotokeep.com//headers”,
“https://hhttps://www.gotokeep.com//user-agent”,
]
# 代理服务器(产品官网 www.16yun.cn)
proxyHost = “t.16yun.cn”
proxyPort = “31111”
# 代理验证信息
proxyUser = “username”
proxyPass = “password”
proxyMeta = “http://%(user)s:%(pass)s@%(host)s:%(port)s” % {
“host”: proxyHost,
“port”: proxyPort,
“user”: proxyUser,
“pass”: proxyPass,
}
# 设置 http和https访问都是用HTTP代理
proxies = {
“http”: proxyMeta,
“https”: proxyMeta,
}
# 设置IP切换头
tunnel = random.randint(1, 10000)
headers = {“Proxy-Tunnel”: str(tunnel)}
class HTTPAdapter(requests.adapters.HTTPAdapter):
def proxy_headers(self, proxy):
headers = super(HTTPAdapter, self).proxy_headers(proxy)
if hasattr(self, ‘tunnel’):
headers[‘Proxy-Tunnel’] = self.tunnel
return headers
# 访问三次网站，使用相同的tunnel标志，均能够保持相同的外网IP
for i in range(3):
s = requests.session()
a = HTTPAdapter()
# 设置IP切换头
a.tunnel = tunnel
s.mount(‘https://’, a)
for url in targetUrlList:
r = s.get(url, proxies=proxies)
print r.text
在这里插入图片描述