Pandas中DataFrame的属性及方法大全_dataframe agg last-CSDN博客

本文链接：https://blog.csdn.net/cauchy7203/article/details/107656005

所有属性

属性名	说明
at	访问行/列标签对的单个值。
attrs	此对象的全局属性字典。
axes	返回一个表示DataFrame轴的列表。
columns	DataFrame的列标签。
dtypes	返回DataFrame中的dtype。
empty	指示DataFrame是否为空。
iat	通过整数位置访问行/列对的单个值。
iloc	基于位置的纯基于整数位置的索引。
index	DataFrame的索引（行标签）。
loc	通过标签或布尔数组访问一组行和列。
ndim	返回一个表示轴数/数组维数的整数。
shape	返回一个表示DataFrame维数的元组。
size	返回一个int表示此对象中元素的数量。
style	返回一个Styler对象。
values	返回DataFrame的Numpy表示形式。

所有方法

方法名	说明
abs()	返回具有每个元素的绝对数值的Series / DataFrame。
add(other[, axis, level, fill_value])	获取数据帧和其他元素的加法（二进制运算符add）。
add_prefix(prefix)	带字符串前缀的前缀标签。
add_suffix(suffix)	带字符串后缀的后缀标签。
agg([func, axis])	使用指定轴上的一项或多项操作进行汇总。
aggregate([func, axis])	使用指定轴上的一项或多项操作进行汇总。
align(other[, join, axis, level, copy, …])	使用指定的join方法将两个对象在其轴上对齐。
all([axis, bool_only, skipna, level])	返回是否所有元素都为True（可能在某个轴上）。
any([axis, bool_only, skipna, level])	返回是否有任何元素为True（可能在某个轴上）。
append(other[, ignore_index, …])	将其他行附加到调用方的末尾，返回一个新对象。
apply(func[, axis, raw, result_type, args])	沿DataFrame的轴应用功能。
applymap(func)	将函数应用于元素的数据框。
asfreq(freq[, method, how, normalize, …])	将TimeSeries转换为指定的频率。
asof(where[, subset])	返回在此之前没有任何NaN的最后一行。
assign(**kwargs)	将新列分配给DataFrame。
astype(dtype[, copy, errors])	将熊猫对象转换为指定的dtype dtype。
at_time(time[, asof, axis])	选择一天中特定时间的值（例如，上午9:30）。
backfill([axis, inplace, limit, downcast])	DataFrame.fillna（）的同义词，带有method =‘bfill’。
between_time(start_time, end_time[, …])	选择一天中特定时间之间的值（例如9：00-9：30 AM）。
bfill([axis, inplace, limit, downcast])	DataFrame.fillna（）的同义词，带有method =‘bfill’。
bool()	返回单个元素Series或DataFrame的布尔值。
boxplot([column, by, ax, fontsize, rot, …])	从DataFrame列制作箱形图。
clip([lower, upper, axis, inplace])	修剪输入阈值处的值。
combine(other, func[, fill_value, overwrite])	与另一个DataFrame进行按列合并。
combine_first(other)	在其他位置的相同位置更新具有值的空元素。
compare(other[, align_axis, keep_shape, …])	与另一个DataFrame进行比较并显示差异。
convert_dtypes([infer_objects, …])	使用支持pd.NA的dtypes将列转换为最佳的dtypes。
copy([deep])	复制该对象的索引和数据。
corr([method, min_periods])	计算列的成对相关，不包括NA /空值。
corrwith(other[, axis, drop, method])	计算成对相关。
count([axis, level, numeric_only])	为每一列或每一行计算非NA单元。
cov([min_periods, ddof])	计算列的成对协方差，不包括NA /空值。
cummax([axis, skipna])	返回DataFrame或Series轴上的累积最大值。
cummin([axis, skipna])	返回DataFrame或Series轴上的累积最小值。
cumprod([axis, skipna])	通过DataFrame或Series轴返回累积乘积。
cumsum([axis, skipna])	返回DataFrame或Series轴上的累积总和。
describe([percentiles, include, exclude, …])	生成描述性统计信息。
diff([periods, axis])	元素的第一个离散差。
div(other[, axis, level, fill_value])	获取数据帧和其他元素的浮点除法（二进制运算符truediv）。
divide(other[, axis, level, fill_value])	获取数据帧和其他元素的浮点除法（二进制运算符truediv）。
dot(other)	计算DataFrame与其他框架之间的矩阵乘法。
drop([labels, axis, index, columns, level, …])	从行或列中删除指定的标签。
drop_duplicates([subset, keep, inplace, …])	返回删除重复行的DataFrame。
droplevel(level[, axis])	返回已删除请求的索引/列级别的DataFrame。
dropna([axis, how, thresh, subset, inplace])	删除缺失的值。
duplicated([subset, keep])	返回表示重复行的布尔系列。
eq(other[, axis, level])	等于等于数据帧和其他按元素计算（二进制运算符eq）。
equals(other)	测试两个对象是否包含相同的元素。
eval(expr[, inplace])	评估描述DataFrame列上的操作的字符串。
ewm([com, span, halflife, alpha, …])	提供指数加权（EW）函数。
expanding([min_periods, center, axis])	提供扩展的转换。
explode(column[, ignore_index])	将类似列表的每个元素转换为一行，复制索引值。
ffill([axis, inplace, limit, downcast])	DataFrame.fillna（）的同义词，方法=‘ffill’。
fillna([value, method, axis, inplace, …])	使用指定的方法填充NA / NaN值。
filter([items, like, regex, axis])	根据指定的索引标签对数据框的行或列进行子集设置。
first(offset)	根据日期偏移量选择时间序列数据的初始时段。
first_valid_index()	返回第一个非NA /空值的索引。
floordiv(other[, axis, level, fill_value])	获取数据帧的整数除法和其他逐元素的方法（二进制运算符florordiv）。
from_dict(data[, orient, dtype, columns])	从类似数组或字典的字典构造DataFrame。
from_records(data[, index, exclude, …])	将结构化或记录ndarray转换为DataFrame。
ge(other[, axis, level])	获取大于或等于数据帧和其他逐元素的值（二进制运算符ge）。
get(key[, default])	从对象获取给定键的项目（例如：DataFrame列）。
groupby([by, axis, level, as_index, sort, …])	使用映射器或按一系列列对DataFrame进行分组。
gt(other[, axis, level])	获取大于dataframe和其他逐个元素的值（二进制运算符>）。
head([n])	返回前n行
hist([column, by, grid, xlabelsize, xrot, …])	制作DataFrame的直方图。
idxmax([axis, skipna])	返回在请求轴上第一次出现最大值的索引。
idxmin([axis, skipna])	在请求的轴上第一次出现最小值的返回索引。
infer_objects()	尝试为对象列推断更好的dtype。
info([verbose, buf, max_cols, memory_usage, …])	打印DataFrame的简要摘要。
insert(loc, column, value[, allow_duplicates])	将列插入DataFrame中的指定位置。
interpolate([method, axis, limit, inplace, …])	请注意，具有MultiIndex的DataFrame / Series仅支持method =‘linear’。
isin(values)	DataFrame中的每个元素是否包含在值中。
isna()	检测缺失值。
isnull()	检测缺失值。
items()	遍历（列名，系列）对。
iteritems()	遍历（列名，系列）对。
iterrows()	将DataFrame行作为（索引，系列）对进行迭代。
itertuples([index, name])	以namedtuple的形式遍历DataFrame行。
join(other[, on, how, lsuffix, rsuffix, sort])	连接另一个DataFrame的列。
keys()	获取“信息轴”（有关更多信息，请参见索引）。
kurt([axis, skipna, level, numeric_only])	在请求的轴上返回无偏峰度。
kurtosis([axis, skipna, level, numeric_only])	在请求的轴上返回无偏峰度。
last(offset)	根据日期偏移量选择时间序列数据的最后时段。
last_valid_index()	返回上一个非NA /空值的索引。
le(other[, axis, level])	小于或等于dataframe和其他逐元素（二进制运算符）。
lookup(row_labels, col_labels)	DataFrame基于标签的“花式索引”功能。
lt(other[, axis, level])	获取小于dataframe和其他元素级数（二进制运算符lt）。
mad([axis, skipna, level])	返回所请求轴的值的平均绝对偏差。
mask(cond[, other, inplace, axis, level, …])	替换条件为True的值。
max([axis, skipna, level, numeric_only])	返回所请求轴的最大值。
mean([axis, skipna, level, numeric_only])	返回值的平均值
median([axis, skipna, level, numeric_only])	返回所请求轴的值的中值。
melt([id_vars, value_vars, var_name, …])	取消将DataFrame从宽格式转为长格式，可以选择保留标识符。
memory_usage([index, deep])	返回每列的内存使用情况（以字节为单位）。
merge(right[, how, on, left_on, right_on, …])	用数据库样式的联接合并DataFrame或命名的Series对象。
min([axis, skipna, level, numeric_only])	返回所请求轴的最小值。
mod(other[, axis, level, fill_value])	获取数据帧的Modulo以及其他按元素的方式（二进制运算符mod）。
mode([axis, numeric_only, dropna])	获取沿选定轴的每个元素的模式。
mul(other[, axis, level, fill_value])	获取数据帧和其他逐元素的乘法（二进制运算符mul）。
multiply(other[, axis, level, fill_value])	获取数据帧和其他逐元素的乘法（二进制运算符mul）。
ne(other[, axis, level])	获取不等于dataframe的值以及其他逐元素的值（二进制运算符ne）。
nlargest(n, columns[, keep])	按降序返回按列排序的前n行。
notna()	检测现有（非缺失）值。
notnull()	检测现有（非缺失）值。
nsmallest(n, columns[, keep])	按升序返回列排序的前n行。
nunique([axis, dropna])	计算请求轴上的不同观察值。
pad([axis, inplace, limit, downcast])	DataFrame.fillna（）的同义词，方法=‘ffill’。
pct_change([periods, fill_method, limit, freq])	当前元素与先前元素之间的百分比变化。
pipe(func, args, *kwargs)	应用func（self，* args，** kwargs）。
pivot([index, columns, values])	返回按给定的索引/列值组织的重整型DataFrame。
pivot_table([values, index, columns, …])	创建电子表格样式的数据透视表作为DataFrame。
plot	pandas.plotting._core.PlotAccessor的别名
pop(item)	返回项目并从框架中放下。
pow(other[, axis, level, fill_value])	获取数据帧和其他元素的指数幂（二进制运算符pow）。
prod([axis, skipna, level, numeric_only, …])	返回所请求轴的值的乘积。
product([axis, skipna, level, numeric_only, …])	返回所请求轴的值的乘积。
quantile([q, axis, numeric_only, interpolation])	在请求的轴上以给定的分位数返回值。
query(expr[, inplace])	使用布尔表达式查询DataFrame的列。
radd(other[, axis, level, fill_value])	获取数据帧和其他元素的加法（二进制运算符radd）。
rank([axis, method, numeric_only, …])	沿轴计算数值数据等级（1到n）。
rdiv(other[, axis, level, fill_value])	获取数据帧和其他元素的浮点除法（二进制运算符rtruediv）。
reindex(**kwargs)	使用可选的填充逻辑使Series / DataFrame与新索引一致。
reindex_like(other[, method, copy, limit, …])	返回具有匹配索引的对象作为其他对象。
rename(**kwargs)	更改轴标签。
rename_axis(**kwargs)	设置索引或列的轴名称。
reorder_levels(order[, axis])	使用输入顺序重新排列索引级别。
replace([to_replace, value, inplace, limit, …])	将to_replace中给定的值替换为value。
resample(rule[, axis, closed, label, …])	重新采样时间序列数据。
reset_index([level, drop, inplace, …])	重置索引或索引的级别。
rfloordiv(other[, axis, level, fill_value])	获取数据帧的整数除法和其他逐元素的方法（二进制运算符rfloordiv）。
rmod(other[, axis, level, fill_value])	获取数据帧的Modulo以及其他按元素的方式（二进制运算符rmod）。
rmul(other[, axis, level, fill_value])	获取数据帧和其他逐元素的乘法（二进制运算符rmul）。
rolling(window[, min_periods, center, …])	提供滚动窗口计算。
round([decimals])	将DataFrame舍入到小数位数可变。
rpow(other[, axis, level, fill_value])	获取数据帧和其他元素的指数幂（二进制运算符rpow）。
rsub(other[, axis, level, fill_value])	获取数据帧的减法和其他逐元素的方法（二进制运算符rsub）。
rtruediv(other[, axis, level, fill_value])	获取数据帧和其他元素的浮点除法（二进制运算符rtruediv）。
sample([n, frac, replace, weights, …])	从对象轴返回随机的项目样本。
select_dtypes([include, exclude])	根据列dtypes返回DataFrame列的子集。
sem([axis, skipna, level, ddof, numeric_only])	返回要求轴上的平均值的无偏标准误差。
set_axis(labels[, axis, inplace])	将所需的索引分配给给定轴。
set_index(keys[, drop, append, inplace, …])	使用现有列设置DataFrame索引。
shift([periods, freq, axis, fill_value])	按所需的周期数移动索引，并带有可选的时间频率。
skew([axis, skipna, level, numeric_only])	在请求的轴上返回无偏斜。
slice_shift([periods, axis])	相当于移位而不复制数据。
sort_index([axis, level, ascending, …])	按标签（沿轴）对对象排序。
sort_values(by[, axis, ascending, inplace, …])	沿任一轴的值排序。
sparse	pandas.core.arrays.sparse.accessor.SparseFrameAccessor的别名
squeeze([axis])	将一维轴对象压缩为标量。
stack([level, dropna])	从列到索引堆叠指定级别。
std([axis, skipna, level, ddof, numeric_only])	返回要求轴上的样品标准偏差。
sub(other[, axis, level, fill_value])	获取数据帧的减法和其他逐元素的方法（二进制运算符sub）。
subtract(other[, axis, level, fill_value])	获取数据帧的减法和其他逐元素的方法（二进制运算符sub）。
sum([axis, skipna, level, numeric_only, …])	返回所请求轴的值之和。
swapaxes(axis1, axis2[, copy])	适当地交换轴和交换值轴。
swaplevel([i, j, axis])	在特定轴上的MultiIndex中交换级别i和j。
tail([n])	返回最后一行
take(indices[, axis, is_copy])	沿轴返回给定位置索引中的元素。
to_clipboard([excel, sep])	将对象复制到系统剪贴板。
to_csv([path_or_buf, sep, na_rep, …])	将对象写入逗号分隔值（csv）文件。
to_dict([orient, into])	将DataFrame转换为字典。
to_excel(excel_writer[, sheet_name, na_rep, …])	将对象写入Excel工作表。
to_feather(**kwargs)	将DataFrame写入二进制Feather格式。
to_gbq(destination_table[, project_id, …])	将DataFrame写入Google BigQuery表。
to_hdf(path_or_buf, key[, mode, complevel, …])	使用HDFStore将包含的数据写入HDF5文件。
to_html([buf, columns, col_space, header, …])	将DataFrame呈现为HTML表。
to_json([path_or_buf, orient, date_format, …])	将对象转换为JSON字符串。
to_latex([buf, columns, col_space, header, …])	将对象渲染为LaTeX表格，长表或嵌套表/表格。
to_markdown([buf, mode, index])	以Markdown友好格式打印DataFrame。
to_numpy([dtype, copy, na_value])	将DataFrame转换为NumPy数组。
to_parquet(**kwargs)	将DataFrame写入二进制拼花格式。
to_period([freq, axis, copy])	将DataFrame从DatetimeIndex转换为PeriodIndex。
to_pickle(path[, compression, protocol])	将对象腌制（序列化）到文件。
to_records([index, column_dtypes, index_dtypes])	将DataFrame转换为NumPy记录数组。
to_sql(name, con[, schema, if_exists, …])	将存储在DataFrame中的记录写入SQL数据库。
to_stata(**kwargs)	将DataFrame对象导出为Stata dta格式。
to_string([buf, columns, col_space, header, …])	将DataFrame渲染到控制台友好的表格输出。
to_timestamp([freq, how, axis, copy])	在时段开始时将其强制转换为时间戳的DatetimeIndex。
to_xarray()	从pandas对象返回一个xarray对象。
transform(func[, axis])	自我调用func产生具有转换值的DataFrame。
transpose(*args[, copy])	转置索引和列。
truediv(other[, axis, level, fill_value])	获取数据帧和其他元素的浮点除法（二进制运算符或truediv）。
truncate([before, after, axis, copy])	在某个索引值之前和之后截断Series或DataFrame。
tshift([periods, freq, axis])	（已弃用）使用时标频率（如果有）移动时间索引。
tz_convert(tz[, axis, level, copy])	将可感知tz的轴转换为目标时区。
tz_localize(tz[, axis, level, copy, …])	将Series或DataFrame的tz天真索引本地化为目标时区。
unstack([level, fill_value])	枢转一个级别（必要的层次结构）索引标签。
update(other[, join, overwrite, …])	使用来自另一个DataFrame的非NA值就地进行修改。
value_counts([subset, normalize, sort, …])	返回一个包含DataFrame中唯一行数的Series。
var([axis, skipna, level, ddof, numeric_only])	返回请求轴上的无偏方差。
where(cond[, other, inplace, axis, level, …])	替换条件为False的值。
xs(key[, axis, level, drop_level])	从Series / DataFrame返回横截面。