requests处理iframe①

最新推荐文章于 2024-09-02 09:12:50 发布

什么，你说啥⊙ω⊙

最新推荐文章于 2024-09-02 09:12:50 发布

阅读量1.4k

点赞数 1

分类专栏：笔记文章标签： python

本文链接：https://blog.csdn.net/qq_59970317/article/details/121412592

版权

笔记专栏收录该内容

13 篇文章 1 订阅

订阅专栏

由于测试一半，网站被封，先记录一下

import requests,re
from concurrent.futures import ThreadPoolExecutor


headers = {
            "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/95.0.4638.69 Safari/537.36",
            "cookie": "Hm_lvt_b2b84ee87a8af3f8ad100a3da41c8ad0=1636809421,1636895260,1638451748,1638537010; Hm_lpvt_b2b84ee87a8af3f8ad100a3da41c8ad0=1638537010; Hm_lvt_b5093235610ff1922206ff61ddb2a910=1636807512,1636895261,1638451756,1638537011; Hm_lpvt_b5093235610ff1922206ff61ddb2a910=1638537011; prec=94769%3A2322363%2C94800%3A2322547%2C60215%3A2153100%2C60173%3A2151552%2C60204%3A2152665%2C94789%3A2322520%2C94768%3A2322357%2C94753%3A2321496%2C94654%3A2321560",
            "Referer": "http://www.meiju8.cc/movie/94769.html"
        }

# 通过网页找到iframe链接
# 通过请求iframe链接来获取m3u8文件
frame = "https://jx.fqzy.cc/jx.php?url=FQ:a1924f01f35f3e7eaa6198e9dc8bca692bc3e9516ca75ef7607cddc72ee18f92709290948e8be64bcf752598bd3aba674f03b6d2583f72ad7a"

res = requests.get(frame,headers=headers)

# 提取m3u8文件的下载链接
compile = re.compile('"url": "(?P<url>.*?)",',re.S)
m3u8_url = compile.search(res.text).group("url")

# 下载m3u8文件
res_m3u8 = requests.get(m3u8_url)

# 写入m3u8文件
with open("meiju_m3u8.txt","wb") as f:
    f.write(res_m3u8.content)

部分内容如下，大概有1647行左右

#EXTM3U
#EXT-X-VERSION:3
#EXT-X-TARGETDURATION:11
#EXT-X-MEDIA-SEQUENCE:0
#EXTINF:8.125,
https://lf3-static.bytednsdoc.com/obj/tos-cn-o-0000/e99e5ea6084e4cc0adb9b251c86441af
#EXTINF:8.417,
https://lf6-static.bytednsdoc.com/obj/tos-cn-o-0000/caf770f47a284d9193b44c8ec0740ae3
#EXTINF:4.833,
https://lf9-static.bytednsdoc.com/obj/tos-cn-o-0000/e6520e84b61246a490e0c0426ff268c6
#EXTINF:10.417,
https://lf1-cdn-tos.bytegoofy.com/obj/tos-cn-o-0000/dff661b13c9a48eb8f14361a74c16cc4
#EXTINF:10.417,
https://lf3-cdn-tos.bytegoofy.com/obj/tos-cn-o-0000/f7f44bc9286c42348d9a7fad835fb052
#EXTINF:5.292,
https://lf6-cdn-tos.bytegoofy.com/obj/tos-cn-o-0000/8ff95b1924614fa7a49317e9bbc94f0e
#EXTINF:10.417,
https://lf-cdn-tos.bytescm.com/obj/tos-cn-o-0000/48e90dbdabd143009f16f58a827f6502
#EXTINF:10.417,
https://lf1-cdn-tos.bytescm.com/obj/tos-cn-o-0000/5cbd32d7287a491e9a340146302319a1

读取m3u8文件，提取ts文件的下载链接

通过请求下载链接来写入数据

# 下载视频使用
video_urls = []
# 合并视频使用
ts_name = []

# 读取全部的链接，添加进列表
with open("meiju_m3u8.txt","r") as f:
    for line in f:
        if line.startswith("#"):
            continue
        line = line.strip()
        video_urls.append(line)

# 下载ts视频
def crawl(url):
    video = requests.get(url)
    name = url.split("/")[-1]+".ts"

    ts_name.apppend(name)

    with open(name, "wb") as f:
        f.write(video.content)

# 创建线程池
with ThreadPoolExecutor(max_workers=10) as pool:
    download = pool.map(crawl,video_urls)

合并视频

ts_name = " ".join(ts_name)
os.system(f"copy {ts_name} > video.mp4")

print("over!!!")

网站恢复了，

iframe获取方式补充

requests处理iframe②

什么，你说啥⊙ω⊙

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录