问题描述:
将文件格式如下的txt文档:
310102193300000000,A00
310102194100000000,A00
310102194500000000,A00
310102194900000000,A00
……
转换成apriori算法所需要的txt格式,如下
zjhm zdbm
0 110102201010060000 K07,Z01
1 110102201105190000 A49,J06,J40,K02
2 110105199006150000 I51,K82,N61,Z34
3 110108197711300000 D22,K04,N76,N83,S02
4 120104197302060000 L30,M25,M47,M51
```python
import pandas as pd
df = pd.read_csv('data/newData/zjhm_zdbm.csv')
print (df.head())
def ab(df):
return','.join(df.values)
df = df.groupby(['zjhm'])['zdbm'].apply(ab)
df = df.reset_index()
print (df)
df.to_csv('data/newData/patient_jz_new.csv')
df.to_excel('data/newData/patient_jz_new.xlsx')