(一)在使用google云端硬盘时,如果需要读取现有的已经下载的数据集
将文件放置于硬盘文件夹中,根据以下代码读取(如果是csv文件直接data = pd.read_csv('drive/MyDrive/Colab Notebooks/flights.csv'))即可
data_dir='./drive/MyDrive/Colab Notebooks'
Trainname=os.path.join(data_dir,'house_sales.ftr')
Testname=os.path.join(data_dir,'test.csv')
from google.colab import drive
drive.mount('/content/drive')
data=pd.read_feather('drive/MyDrive/Colab Notebooks/house_sales.ftr')
data.head()
(二)直接爬取网上的数据集则按照以下代码
zip_path = tf.keras.utils.get_file(
origin='https://storage.googleapis.com/tensorflow/tf-keras-datasets/jena_climate_2009_2016.csv.zip',
fname='jena_climate_2009_2016.csv.zip',
extract=True)
csv_path, _ = os.path.splitext(zip_path) #csv代表文本文件
df = pd.read_csv(csv_path)
df.head()
(三)读取python中