如何用多核并行处理数据

最新推荐文章于 2024-01-17 16:00:37 发布

a2428083131

最新推荐文章于 2024-01-17 16:00:37 发布

阅读量1.2k

点赞数

文章标签： python 人工智能

本文链接：https://blog.csdn.net/a2428083131/article/details/122194898

版权

import warnings

warnings.filterwarnings('ignore')  # 警告扰人，手动封存


from multiprocessing import cpu_count, Pool

# 计算当前服务器CPU数量
cores = cpu_count()
# 将分块个数设置为CPU的数量
partitions = cores

def parallelize(df, func):
    # 数据切分
    data_split = np.array_split(df, partitions)
    # 初始化线程池
    pool = Pool(cores)
    # 数据分发，处理，再合并
    data = pd.concat(pool.map(func, data_split))
    # 关闭线程池
    pool.close()
    # 执行完close后不会有新的进程加入到pool，join函数等待所有子进程结束
    pool.join()
    # 返回处理后的数据
    return data

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

a2428083131

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
如何用多核并行处理数据

from multiprocessing import cpu_count, Pool# 计算当前服务器CPU数量cores = cpu_count()# 将分块个数设置为CPU的数量partitions = coresdef parallelize(df, func): # 数据切分 data_split = np.array_split(df, partitions) # 初始化线程池 pool = Pool(cores) # 数据分发，处理，再合.
复制链接

扫一扫