【自学 PyTorch 】第二课 —— 【代码实战记录】PyTorch 数据集读取完整代码

最新推荐文章于 2024-04-04 22:12:27 发布

nemo_0410

最新推荐文章于 2024-04-04 22:12:27 发布

阅读量333

点赞数 2

分类专栏：深度学习/PyTorch 文章标签： pytorch python 机器学习深度学习神经网络

本文链接：https://blog.csdn.net/weixin_42306148/article/details/119235698

版权

深度学习/PyTorch 专栏收录该内容

71 篇文章 29 订阅

订阅专栏

本文介绍了如何使用PyTorch加载自定义数据集，通过定义一个继承自`Dataset`的类，实现数据的读取。在`__init__`方法中，设置数据路径和标签目录；`__getitem__`方法用于根据索引获取图像和标签；`__len__`返回数据列表长度。实例化类并打印数据集长度，展示了一个简单的数据加载流程。

摘要由CSDN通过智能技术生成

写一个系列代码实战，争取每天都更。

倒逼自己赶紧提升写 Python 代码的手感。

一、代码

在这里插入图片描述

import os

from PIL import Image
from torch.utils.data import Dataset


class Nemo(Dataset):
    def __init__(self, root_dir, label_dir):
        self.root_dir = root_dir
        self.label_dir = label_dir
        self.path = os.path.join(self.root_dir, self.label_dir)
        self.img_path_list = os.listdir(self.path)

    def __getitem__(self, idx):
        image_name = self.img_path_list[idx]
        image_item_path = os.path.join(self.root_dir, self.label_dir, image_name)
        img = Image.open(image_item_path)
        label = self.label_dir
        return img, label

    def __len__(self):
        return len(self.img_path_list)


root_dir = "D:\\Python_In_One\\Project\\XiaoTuDui\\data\\train"
ants_label_dir = "ants_label"
ants_dataset = Nemo(root_dir, ants_label_dir)

print(len(ants_dataset))