python数据导入

最新推荐文章于 2024-04-17 18:01:58 发布

Akaziki_

最新推荐文章于 2024-04-17 18:01:58 发布

阅读量1.2k

点赞数

分类专栏： python 文章标签： python 自然语言处理数据库

本文链接：https://blog.csdn.net/weixin_50146597/article/details/120310680

版权

一、导入csv地址

import pandas as pd
import numpy as np

# Import clean data 
path = 'https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-DA0101EN-SkillsNetwork/labs/Data%20files/module_5_auto.csv'
df = pd.read_csv(path)
df.to_csv('module_5_auto.csv')
df=df._get_numeric_data()
df.head()

output:
在这里插入图片描述
OR 导入xslx:

df_can = pd.read_excel(
    'https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-DV0101EN-SkillsNetwork/Data%20Files/Canada.xlsx',
    sheet_name='Canada by Citizenship',
    skiprows=range(20),
    skipfooter=2)

print('Data downloaded and read into a dataframe!')

在这里插入图片描述

# clean up the dataset to remove unnecessary columns (eg. REG) 
df_can.drop(['AREA','REG','DEV','Type','Coverage'], axis=1, inplace=True)

# let's rename the columns so that they make sense
df_can.rename(columns={'OdName':'Country', 'AreaName':'Continent','RegName':'Region'}, inplace=True)

# for sake of consistency, let's also make all column labels of type string
df_can.columns = list(map(str, df_can.columns))

# add total column
df_can['Total'] = df_can.sum(axis=1)

# years that we will be using in this lesson - useful for plotting later on
years = list(map(str, range(1980, 2014)))
print ('data dimensions:', df_can.shape)

最低0.47元/天解锁文章

Akaziki_

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
python数据导入

一、导入csv地址import pandas as pdimport numpy as np# Import clean data path = 'https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-DA0101EN-SkillsNetwork/labs/Data%20files/module_5_auto.csv'df = pd.read_csv(path)d
复制链接

扫一扫