Python最快的方式来读取大文本文件（几GB）

最新推荐文章于 2024-05-14 14:10:39 发布

喜欢安静的程序猿

最新推荐文章于 2024-05-14 14:10:39 发布

阅读量9.2k

点赞数 2

本文链接：https://blog.csdn.net/weixin_39363245/article/details/100521043

版权

我有一个大文本文件（约7 GB）。我正在寻找是否存在阅读大文本文件的最快方法。我一直在阅读有关使用多种方法作为读取chunk-by-chunk以加快进程的过程。

例如，effbot建议

# File: readline-example-3.py

file = open("sample.txt")

while 1:
    lines = file.readlines(100000)
    if not lines:
        break
    for line in lines:
        pass # do something**strong text**

为了每秒处理96,900行文本。其他作者建议使用islice（）

from itertools import islice

with open(...) as f:
    while True:
        next_n_lines = list(islice(f, n))
        if not next_n_lines:
            break
        # process next_n_lines

list(islice(f, n))将返回n文件的下一行列表f。在循环中使用它将为您提供大量n行的文件

解决方案

with open(<FILE>) as FileObj:
    for lines in FileObj:
        print lines # or do some other thing with the line...

将在此时读取一行内存，并在完成后关闭文件...

本文首发于Python黑洞网，csdn同步更新

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

喜欢安静的程序猿

关注关注

2
点赞
踩
20

收藏

觉得还不错? 一键收藏
0
评论
Python最快的方式来读取大文本文件（几GB）

我有一个大文本文件（约7 GB）。我正在寻找是否存在阅读大文本文件的最快方法。我一直在阅读有关使用多种方法作为读取chunk-by-chunk以加快进程的过程。例如，effbot建议# File: readline-example-3.pyfile = open("sample.txt")while 1: lines = file.readlines(100000) ...
复制链接

扫一扫