import os, sys
import pandas as pd
import numpy as np
PATH = os.getcwd() + '/'
# 数据可视化notebook里下载好了文件
os.chdir(PATH)
df = pd.read_csv(PATH + 'iris.data', names=['sepal length', 'sepal width', 'petal length', 'petal width', 'class'])
1. Map
- map提供字典进行类别的印射
- map方法传入一个序列或函数,执行对某个列的转变
df['class'] = df['class'].map({
'Iris-setosa': 'SET', 'Iris-virginica': 'VIR', 'Iris-versicolor': 'VER'})
df
![602cb791b35437a24e78949cb7c46357.png](https://img-blog.csdnimg.cn/img_convert/602cb791b35437a24e78949cb7c46357.png)
2. Apply
- apply作用于整个指定列
- 根据花瓣的宽度创建新的列