Mongodb bulk_write & UpdateOne

 

Using the bulk_write can speed the mongo up, compared with the one by one update or insert.

 1 import pymongo, time, psutil
 2 from concurrent.futures import ProcessPoolExecutor
 3 from pymongo.operations import UpdateOne
 4 
 5 
 6 tic = time.time()
 7 mongoclient = pymongo.MongoClient(host="mongodb3.xxx.com", port=27017)
 8 MongoDB = mongoclient["xxx"]
 9 collection = MongoDB.resume_test
10 document = {
11     "Duplicate": '',
12     "Skill": '',
13     "SourceURL": '',
14 }
15 document = set(document)
16 
17 def bulk_write(requests, collection, last_one=False):
18     if len(requests) > 10000 or last_one:
19         collection.bulk_write(requests)
20         return []
21     else:
22         return requests
23 
24 
25 requests = []
26 for index, data in enumerate(collection.find({})):
27     if index % 10000 == 0:
28         print(index)
29 
30     id = data['_id']
31     data = set(data)
32     add = {el: '' for el in document.difference(data)}
33     if add:
34         requests.append(UpdateOne({'_id': id}, {'$set': add}))
35         requests = bulk_write(requests, collection)
36 requests = bulk_write(requests, collection, last_one=True)
37 
38 toc = time.time()
39 print(f'finished, time cost: {toc - tic}')

 

转载于:https://www.cnblogs.com/NachoLau/p/11395177.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值