Python之DataFrame基础用法

doraemonjtr

已于 2022-07-05 00:12:54 修改

阅读量1.2k

点赞数

文章标签： python pandas 数据分析

于 2022-07-05 00:11:39 首次发布

本文链接：https://blog.csdn.net/doraemonjtr/article/details/125611147

版权

本文详细介绍了Python中DataFrame的基础操作，包括如何利用字典和数组创建DataFrame，读取DataFrame的行、列和单元格，以及如何写入、插入、删除和修改DataFrame的数据。此外，还讲解了DataFrame的去重和更改列名的方法。

摘要由CSDN通过智能技术生成

引入库

import pandas as pd
import numpy as np

pandas官方文档：https://pandas.pydata.org/pandas-docs/stable

1. 创建DataFrame

data={
   "one":np.random.randn(4),"two":np.linspace(1,4,4),"three":['zhangsan','李四',999,0.1]}
df=pd.DataFrame(data,index=[1,2,3,4])
df

set _index用于将df中的一行或多行设置为索引。

参数drop默认为True，意为将该列设置为索引后从数据中删除，如果设为False，将继续在数据中保留该行。

# df.set_index('one')
df.set_index(['one'],drop=False)

df.index=['a','b','c','d']

df.reset_index(drop=True)

参数drop默认值为False，意为将原来的索引做为数据列保留，如果设为True，原来的索引会直接删除。

data=np.random.randn(6,4)#创建一个6行4列的数组
df=pd.DataFrame(data,columns=list('ABCD'),index=[1,2,'a','b','2006-10-1','第六行'])
df

pd.DataFrame(columns=('id','name','grade','class'))

df[['A','B','D']]
df.loc[:,['A','B','D']]

PS: df[‘A’]和 df[[‘A’]]都能读取第一列数据，但它们返回的数据结构不同：

type(df[‘A’]): pandas.core.series.Series

type(df[[‘A’]]): pandas.core.frame.DataFrame