之前在计算自己的数据集Mean std 遇到Found 0 files in subfolders of 报错,以为是代码有问题,修改了很多次都没解决,代码如下:
from __future__ import print_function, division
import os
from PIL import Image
import torch
import torch.utils.data
import torchvision
from skimage import io
import torchvision.datasets as Datasat
import random
import numpy as np
from sklearn.preprocessing import Binarizer
def getStat(train_data):
'''
Compute mean and variance for training data
:param train_data: 自定义类Dataset(或ImageFolder即可)
:return: (mean, std)
'''
print('Compute mean and variance for training data.')
print(len(train_data))
train_loader = torch.utils.data.DataLoader(
train_data, batch_size=1, shuffle=False, num_workers=0,
pin_memory=True)
mean = torch.zeros(3)
std = torch.zeros(3)
for X, _ in train_loader:
for d in range(3):
mean[d] += X[:, d, :, :].mean()
std[d] += X[:, d, :, :].std()
mean.div_(len(train_data))
std.div_(len(train_data))
return list(mean.numpy()), list(std.numpy())
if __name__ == '__main__':
dataset = Datasat.ImageFolder(root="C:\\Users\\15182\\Desktop\\test", transform=torchvision.transforms.ToTensor())
print(dataset)
print(getStat(dataset))
实际上不是代码的问题,而是路径有一点小问题,pytorch认为当面对大型数据集时,图像应该在不同类别的子文件夹中进行排序。
即 上面root路径不动,在test里面新建个文件夹,例如 1,把图片都放1文件夹中,即可正常读取