字体安装好了,数据也有了,做一个词语,看下招聘的hr承诺给大家都是什么样的福利
安装jieba
pip install jieba
安装 wordcloud
pip install wordcloud
下面上代码
import pandas as pd
data=pd.read_csv("data_df.csv",index_col=0)
data.info()
'''
<class 'pandas.core.frame.DataFrame'>
Index: 450 entries, Java to java高级工程师
Data columns (total 12 columns):
1 450 non-null object
2 450 non-null object
3 450 non-null object
4 450 non-null object
5 450 non-null object
6 450 non-null object
7 450 non-null object
8 450 non-null object
9 399 non-null object
10 450 non-null object
11 435 non-null object
12 415 non-null object
dtypes: object(12)
memory usage: 55.7+ KB
从打印的数据来看提取出来的数据列索引为9,11,12都有缺失
'''
因为要选第11列内容作为词云的数据源,可以对11列内容做缺失补偿,具体代码如下
from sklearn.impute import SimpleImputer
#众数填补缺失值
mode=SimpleImputer(strategy='most_frequent')
mode=mode.fit_transform(treatment)
mode[:20]
data.loc[:,'11']=mode
data.info()
'''
9 399 non-null object
10 450 non-null object
11 450 non-null object
12 415 non-null object
从index=11的查询出来的个数可以判定此列已经被众数填补缺失值
'''
下面收集数据写入到切片
temp=[]
for item in data['11']:
items=item.split("|")
for each in items:
temp.append(each)
temp
#打印temp数据结果
["年底双薪","定期体检","绩效奖金","技能培训","股票期权","带薪年假","交通补助","健身房","股票期权","带薪年假","交通补助","健身房","技能培训","节日礼物","带薪年假","岗位晋升","绩效奖金","五险一金","年度旅游","岗位晋升","技能培训","节日礼物","带薪年假","岗位晋升","绩效奖金","带薪年假","管理规范","五险一金","技术大牛","两次年度旅游","福利倍儿好","年终奖丰厚","带薪年假","计算机软件","管理规范","定期体检","技能培训","节日礼物","带薪年假","岗位晋升","年底双薪","定期体检","带薪年假","晋升透明","绩效奖金","带薪年假","定期体检","节日礼物","绩效奖金 ","五险一金 ","带薪年假 ","年度旅游 ","绩效奖金","带薪年假","定期体检","节日礼物","年底双薪","定期体检","绩效奖金","技能培训","带薪年假","计算机软件","管理规范","定期体检","六险一金","扁平化管理","丰厚年终","丰富技术交流","带薪年假","美女多","领导好","帅哥多","绩效奖金","专项奖金","五险一金","带薪年假","绩效奖金","专项奖金","五险一金","带薪年假","绩效奖金","带薪年假","定期体检","节日礼物","六险一金","扁平化管理","丰厚年终","丰富技术交流","年底双薪","节日礼物","技能培训","绩效奖金","技能培训","节日礼物","带薪年假","岗位晋升","技能培训","节日礼物","带薪年假","岗位晋升","绩效奖金","带薪年假","年终分红","定期体检","六险一金","扁平化管理","丰厚年终","丰富技术交流","股票期权","带薪年假","交通补助","健身房","绩效奖金","专项奖金","五险一金","带薪年假","技能培训","节日礼物","带薪年假","岗位晋升","扁平管理","领导好","五险一金","绩效奖金","年底双薪","定期体检","绩效奖金","技能培训","技能培训","年度旅游","岗位晋升","五险一金","绩效奖金","年底双薪","五险一金","带薪年假","节日礼物","技能培训","绩效奖金","岗位晋升","技能培训","年度旅游","岗位晋升","五险一金","技能培训","年度旅游","岗位晋升","五险一金","扁平管理","领导好","五险一金","绩效奖金","扁平管理","弹性工作","大厨定制三餐","就近租房补贴","年底双薪","带薪年假","定期体检","绩效奖金","节日礼物","技能培训","免费班车","带薪年假","带薪年假","计算机软件","管理规范","定期体检","绩效奖金","交通补助","定期体检","通讯津贴","年底双薪","带薪年假","房屋补贴","零食饮料","扁平管理","领导好"