山东大学项目实训——个人纪录（11）

1994695

已于 2024-06-24 07:54:58 修改

阅读量119

点赞数 3

分类专栏：项目实训文章标签：人工智能

于 2024-06-24 07:54:07 首次发布

本文链接：https://blog.csdn.net/weixin_74097070/article/details/139911597

版权

项目实训专栏收录该内容

11 篇文章 0 订阅

订阅专栏

项目团队DrugLLM开发团队

日期2024/5/26

本周进度

项目团队
DrugLLM开发团队

日期
2024/5/26

本周进度

编写一个爬虫代码，用于爬取论文数据

import requests
from bs4 import BeautifulSoup


def fetch_paper_titles(url):
    try:
        # 发送HTTP请求
        response = requests.get(url)

        # 确保请求成功
        if response.status_code == 200:
            # 使用BeautifulSoup解析HTML响应
            soup = BeautifulSoup(response.text, 'html.parser')

            # 假设论文标题包含在class为'paper-title'的HTML元素中
            title_elements = soup.find_all('div', class_='paper-title')

            # 提取并打印所有论文标题
            titles = [title.get_text(strip=True) for title in title_elements]
            return titles
        else:
            print("请求失败，状态码：", response.status_code)
    except Exception as e:
        print("发生错误：", e)


url = "https://paperswithcode.com/"

# 调用函数并打印结果
titles = fetch_paper_titles(url)
for title in titles:
    print(title)