以下是在pandas中实现数据切片的常用脚本。原理不赘述,具体示例如下:
01. 构造数据集
# 构造数据源
import pandas as pd
df = pd.DataFrame({
"序号":range(1,11),
"品类":["水果","水果","水果","水果","水果","蔬菜","蔬菜","蔬菜","蔬菜","蔬菜"],
"商品":["苹果","西瓜","荔枝","龙眼","菠萝","白菜","土豆","豆芽","番茄","豌豆"],
"销量":range(10,101,10),
"销额":range(100,1001,100)
})
df
02.选取某一列
# 选取某一列
df["商品"]
03.选取某一列
# 选取某一列
df[["商品"]]
04.选取某一列
# 选取某一列
df.商品
05.选取若干列
# 选取若干列
df[["品类","商品"]]
06.判断是否包含某关键词
# 判断某列的单元格是否包含某关键词
df["商品"].str.contains("豆")
07.根据关键词筛选
# 筛选出"商品"字段包含有"豆"字的商品记录
df[df["商品"].str.contains("豆")]
08."且"筛选
# 找出销量