爬虫实战：爬取csdn学院所有课程名、价格

最新推荐文章于 2024-07-12 16:42:46 发布

weixin_45197326

最新推荐文章于 2024-07-12 16:42:46 发布

阅读量299

点赞数 1

文章标签： xpath json

本文链接：https://blog.csdn.net/weixin_45197326/article/details/104385216

版权

import requests
from lxml import etree
import csv
import pandas

class CSDNspider:

#爬取csdn学院所有课程名、价格

def __init__(self):

    self.url='https://edu.csdn.net/courses/o280_s355'
def fenqu(self):
    response=requests.get(self.url).content
    neirong=etree.HTML(response)
    each1=neirong.xpath('//div[@class="course_item acsdnd_item"]')
    each2 = neirong.xpath('//div[@class="course_item"]')
    return each1,each2
def name(self,each1,each2):  # 得到课程名列表
    text=[]
    for e in each1:
        name=e.xpath('.//dt/div[@class="titleInfor ellipsis-2"]/text()')[0] #if len(e.xpath('./dt/div[@class="titleInfor ellipsis-2"]/text()'))>0 else None
        #print(type(name))
        text.append(name)
    for e in each2:
        name1=e.xpath('.//span[@class="title ellipsis-2"]/@title'

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_45197326

关注关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
爬虫实战：爬取csdn学院所有课程名、价格

import requestsfrom lxml import etreeimport csvimport pandasclass CSDNspider:#爬取csdn学院所有课程名、价格def init(self):self.url=‘https://edu.csdn.net/courses/o280_s355’def fenqu(self):response=requests...
复制链接

扫一扫