pytorch 自带数据集
- torchvision提供对图片数据处理相关的api和数据
- 数据位置torchvision.datasets
- torchtext提供对文本数据处理相关的API和数据
- 数据位置torch.text.datasets
torchvision.datasets.MNIST(root=’/files/’,train=True,download=True,transform=)
- root 参数表是数据存放位置
- train是否是训练集(True训练集,False测试集)
- download是否下载
- transform实现对图片的处理函数
from torchvision.datasets import MNIST
mnist=MNIST(root='./data',train=True,download=True)
print(mnist)
Downloading http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz
Processing...
Done!
Dataset MNIST
Number of datapoints: 60000
Split: train
Root Location: ./data
Transforms (if any): None
Target Transforms (if any): None