使用replace 映射,处理存在天然顺序的字符串数据
5.2 Encoding Ordinal Categorical Features¶
import pandas as pd
# create features
df = pd.DataFrame({"Score": ["Low", "Low", "Medium", "Medium", "High"]})
df
Score
0 Low
1 Low
2 Medium
3 Medium
4 High
# create mapper 创建映射器
scale_mapper = {
"Low": 1,
"Medium": 2,
"High": 3
}
# replace feature values with scale 替换特征
df["Score"].replace(scale_mapper)
# create mapper 创建映射器
scale_mapper = {
"Low": 1,
"Medium": 2,
"High": 3
}
# replace feature values with scale 替换特征
df["Score"].replace(scale_mapper)
0 1
1 1
2 2
3 2
4 3
Name: Score, dtype: int64