Homework06

最新推荐文章于 2024-05-10 16:59:45 发布

枫梓-

最新推荐文章于 2024-05-10 16:59:45 发布

阅读量139

点赞数

本文链接：https://blog.csdn.net/Single____/article/details/99998810

版权

邮箱

import requests


headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/76.0.3809.100 Safari/537.36'
}

url = 'https://movie.douban.com/subject/10430826/discussion/44288625/'
response = requests.get(url,headers=headers)
html = response.text
print(html)

import re
str_ = html
regex = re.compile("[a-z0-9\.\-+_]+@[a-z0-9\.\-+_]+\.[a-z]+")
res = regex.findall(str_)
for r in res:
    print (r)


    
with open("D:/10999.txt","wb") as f: 
    for i in res: 
        test2 = i.encode('UTF-8') 
        f.write(test2+b'\n') 
        f.close

在这里插入图片描述

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

枫梓-

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Homework06

import requestsheaders = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/76.0.3809.100 Safari/537.36'}url = 'https://movie.douban.com/sub...
复制链接

扫一扫