原始数据
1.某列所有元素随机赋值
data['duration_time'] = data['duration_time'].map(lambda x: np.random.randint(0, 500))
2.两列字符串拼接
data_["activity_date"] =[ '2020/1/ % i' % i for i in data_["activity_day"]]
结果:
3.按一定概率在列表中选取元素赋值于某列
table = ['湖北', '湖南', '福建','海南','广东','上海','北京','江苏','广西','山西','山东','浙江']
data_ks['item_city'] = data_ks['item_city'].map(lambda x: np.random.choice(table,p=[0.1, 0.05,0.05, 0.1, 0.1, 0.1, 0.03, 0.1,0.1,0.1,0.07,0.1]))