Pandas基础
转载请注明出处 😃!
手动反爬:Pandas基础(DataFrame的创建模式 - CSV、XLSX、Dict-list、List-tuple、List-dict)
https://blog.csdn.net/Lyun911/article/details/112688678
注:以下使用 Jupyter Notebook 演示
3 DataFrame的创建模式(5种)
3.1 CSV to DataFrame
weather_data.csv
weather_data2.csv
import pandas as pd
# 使用csv 读取含有表头的数据:
df = pd.read_csv('weather_data.csv') # 使用csv 读取数据
# 使用csv 读取没有表头的数据:
# df = pd.read_csv('weather_data2.csv') # 使用csv 读取没有表头的数据(错误示范)
df = pd.read_csv('weather_data2.csv', header = None) # 使用csv 读取没有表头的数据(正确示范)
df.columns = ['day', 'temperature', 'windspeed', 'event'] # 手动添加字段头
3.2 XLSX to DataFrame
weather_data.xlsx
如果没有安装 openpyxl 包的话要先安装一下
# !pip install -i https://pypi.tuna.tsinghua.edu.cn/simple openpyxl
df = pd.read_excel('weather_data.xlsx', 'Sheet1')# 读取excel
df
3.3 Dict-List to DataFrame
# Dict-List to DataFrame 字典-列表生成DF
weather_data = {
'day': ['1/1/2021', '1/2/2021', '1/3/2021', '1/4/2021', '1/5/2021'],
'temp' :[10, 13, 16, 19, 22],
'windspeed' :[9, 8, 7, 6, 5]
}
df = pd.DataFrame(weather_data)
event = ['Rainny', 'Snowy', 'Sunny', 'Sunny', 'Sunny']
df['event'] = event
df
3.4 List-Turple to DataFrame
# List-Turple to DataFrame 列表-元祖生成DF
weather_data = [
('1/1/2021', 10, 9, 'Rainny'),
('1/2/2021', 13, 8, 'Snowy'),
('1/3/2021', 16, 7, 'Sunny')
]
df = pd.DataFrame(
data = weather_data,
columns = ['day', 'temp', 'windspeed', 'event']
)
df
3.5 List-Dict to Dataframe
# List-Dict to Dataframe 列表-字典生成DF
weather_data = [
{'day':'1/1/2021', 'temp':10, 'windspeed':9, 'event':'Rainny'},
{'day':'1/2/2021', 'temp':13, 'windspeed':8, 'event':'Snowy'},
{'day':'1/3/2021', 'temp':16, 'windspeed':7, 'event':'Sunny'}
]
df = pd.DataFrame(weather_data)
df