本子上没装mongodb, 所以我只帮你处理数据, 怎么插入你自己搞定~
按照你的设定, 我有一个名为test的文本, 内容长这样
http://titie1
key1 value1
key2 value2
key3 value3
key4 value4
http://title2
key1 value1
key2 value2
key3 value3
key4 value4
key5 value5
http://title3
key1 value1
key2 value2
key3 value3
key4 value4
key5 value5
http://title4
key1 value1
key2 value2
key3 value3
key4 value4
key5 value5
key6 value6
直接放码
lst = []
with open('test', 'r') as fh:
line = fh.readline().strip()
while line:
item = {
'title': '',
'data': []
}
if line.startswith("http://"):
item['title'] = line.split("//")[-1]
line = fh.readline().strip()
while line and not line.startswith("http://"):
_dic = {}
_k, _v = line.split()
_dic[_k] = _v
item['data'].append(_dic)
line = fh.readline().strip()
else:
lst.append(item)
print(lst)
运行结果
[{'title': 'titie1', 'data': [{'key1': 'value1'}, {'key2': 'value2'}, {'key3': 'value3'}, {'key4': 'value4'}]}, {'title': 'title2', 'data': [{'key1': 'value1'}, {'key2': 'value2'}, {'key3': 'value3'}, {'key4': 'value4'}, {'key5': 'value5'}]}, {'title': 'title3', 'data': [{'key1': 'value1'}, {'key2': 'value2'}, {'key3': 'value3'}, {'key4': 'value4'}, {'key5': 'value5'}]}, {'title': 'title4', 'data': [{'key1': 'value1'}, {'key2': 'value2'}, {'key3': 'value3'}, {'key4': 'value4'}, {'key5': 'value5'}, {'key6': 'value6'}]}]
剩下写入mongodb你看着办吧