一、读入数据
cl_full <-read.csv(file = "Data&Results/临床数据下载/clinical_with_os_dcf_全面数据.csv", header=T, row.names=1, check.names=FALSE)
colnames(cl_full)[grepl("radiation", colnames(cl_full))]
"radiation_therapy"
"has_radiations_information"
"additional_radiation_therapy"
"treatments_radiation_treatment_intent_type""treatments_radiation_treatment_id"
"treatments_radiation_treatment_type"
"treatments_radiation_therapeutic_agents" "treatments_radiation_treatment_or_therapy" "treatments_radiation_days_to_treatment_end"
"treatments_radiation_days_to_treatment_start" "treatments_radiation_regimen_or_line_of_therapy" "treatments_radiation_treatment_effect"
"treatments_radiation_initial_disease_status" "treatments_radiation_treatment_anatomic_site" "treatments_radiation_treatment_outcome"
cl_full的列名中,包含radiation的共有15个,其中有两个列表明是否接受过放射治疗。但是二者的信息并不相同,有帖子解释为:
radiation_therapy是最初接受放疗的信息,243个NO,39个YES
treatments_radiation_treatment_or_therapy是随访后接受放疗的信息(更新数据),321个NO,86个YES
因此实际筛选时以treatments_radiation_treatment_or_therapy为主
二、处理数据
cl_full = cl_full[cl_full$treatments_radiation_treatment_or_therapy=='yes',]
共筛选到86个接受过放射的病人