线程爬虫

最新推荐文章于 2024-01-17 16:35:13 发布

ayangyangyang25

最新推荐文章于 2024-01-17 16:35:13 发布

阅读量80

点赞数 2

分类专栏： Python

本文链接：https://blog.csdn.net/ayangyangyang25/article/details/97920031

版权

Python 专栏收录该内容

4 篇文章 0 订阅

订阅专栏

普通爬糗事百科段子

import urllib.request
import re
import urllib.error

headers = ("User-Agent",'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.81 Safari/537.36')
opener = urllib.request.build_opener()
opener.addheaders = [headers]
urllib.request.install_opener(opener)
for i in range(1,2):
    url = 'https://www.qiushibaike.com/text/page/'+str(i)+'/'
    pagedata = urllib.request.urlopen(url).read().decode("utf-8","ignore")
    pat = '<div class="content">.*?<span>(.*?)</span>.*?</div>'
    datas = re.compile(pat,re.S).findall(pagedata)
    for data in datas:
        print(data)

import threading
class A(threading.Thread):
def __init__(self):
threading.Thread.__init__(self)
def run(self):
for i in range(0,10):
print("我是线程A")
class B(threading.Thread):
def __init__(self):
threading.Thread.__init__(self)
def run(self):
for i in range(0,10):
print("我是线程B")
t1=A()
t1.start()
t2=B()
t2.start()

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

ayangyangyang25

关注关注

2
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
线程爬虫

普通爬糗事百科段子import urllib.requestimport reimport urllib.errorheaders = ("User-Agent",'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.81 Safari/537...
复制链接

扫一扫