我正在从txt文件抓取并添加到数组。我已经将for循环编写为1.从txt文件中获取一个条目,2.检查结果数组中是否存在该条目,如果不存在,则将其添加到结果数组中。3.将其添加到结果数组中。
data.txt看起来像这样:
0100002159900000000000 Sales Deval Patrick Seattle
0100003159900000000000 Sales Bernie Sanders Chicago
0100004159900000000000 Sales Joseph R. Biden Dallas
0100005159900000000000 Marketing Michael R. Bloomberg Boston
...etc.
结果数组应如下所示:
employees = [
{
"dept": "Sales",
"remote": False,
"management": [
{
"manager": "Smith",
"level": "county",
"people": [
{
"first_name": "Deval",
"middle_name": "",
"last_name": "Patrick",
"city": Seattle,
},
{
"first_name": "Bernie",
"middle_name": "",
"last_name": "Sanders",
"city": Chicago,
},
{
"first_name": "Joseph",
"middle_name": "R",
"last_name": "Biden",
"city": Dallas,
},
]
},
]
},
{
"dept": "Marketing",
"remote": True,
"management": [
{
"manager": "Jones",
"level": "county",
"people": [
{
"first_name": "Michael",
"middle_name": "R",
"last_name": "Bloomberg",
"city": Boston,
}
]
},
...etc.
]
}
]
我在if逻辑中苦苦挣扎,“如果它不在数组中,则添加它”。这是我现在所拥有的:
employees = []
with open('data.txt','r') as f:
for row in data:
# splits row by an instance of two or more characters of white space
# https://stackoverflow.com/questions/48917121/split-on-more-than-one-space
row = re.split(r"\s{2,}", row)
empl_id = row[0]
empl_name = row[2]
empl_dept = row[1]
for item in employees:
current_dept = item['dept']
if empl_dept != current_dept:
# create/append dept object:
dept_obj = {
"dept": "Sales",
"remote": False,
"management": [
{
"manager": "Smith",
"level": "county",
"people": []
}
]
}
employees.append(dept_obj)
else: # if dept_object does exist
#add people object:
employees[current_dept]['management']['people'].append({
"empl_name": empl_name,
"empl_id": empl_id
})
我知道这种逻辑,可能某些语法是错误的。我当前的主要问题是,它正在为employees数组中的每个项目添加一个dept_obj,这显然会越来越大。我敢肯定,有一种更好的方法可以编写此代码,我只是不知道它是什么。任何帮助表示赞赏。