在我的程序中,我获取所有目录和文件(walk),然后将它们全部写入字典,将文件名作为键,路径作为值,然后从接口(tk.Entry)获取一个关键字,并将所有匹配项返回到两个列表。我将显示它们(tk.Listbox)并打开选定的一个(win32shell)。在
我用thisone创建了两个具有一个理解力的列表。在评论中,它说“只运行两个独立的列表理解更简单,可能更快。”所以这让我很困惑该使用哪一个。因为这个程序将运行~3TB的数据,我现在还没有,所以我不能运行,看看哪个会更快。在
这是我的最小化代码,我删除了接口,并通过keywrd,folder变量分别定义了关键字和路径。在import os
import sqlite3
audio_ext = [".mp3",".mp4","etc..."]
folder = "C:\\Users\\Lafexlos\\Music"
keywrd = "mo" ##searching keyword which I normally get from user by Entry
conn = sqlite3.connect(":memory:")
data = conn.cursor()
data.execute(" create table if not exists audio(path text,\
filename text UNIQUE) ")
for roots ,dirs ,files in os.walk(folder):
for item in os.listdir(roots):
if "."+item.split(".")[-1].lower() in audio_ext:
#Above line is not eye-friendly but is only checks file's extension
data.execute(" INSERT OR IGNORE into audio \
(path, filename) VALUES (?,?)",(roots,item))
lines = {}
musics = data.execute("select * from audio")
[lines.update({row[1]:row[0]}) for row in musics]
# This is the option 1. Using zip to create two lists
results,paths = zip(*[(k,v) for k,v in lines.items() if keywrd in k])
# This is option 2. Running same list comprehension twice
results = [k for (k,v) in lines.items() if keywrd in k]
paths = [v for (k,v) in lines.items() if keywrd in k]
print ("Results: ", results)
print ("\n\nPaths: ", paths)
如前所述,我的问题是,在处理大量数据时,哪一种更快?在