你的程序中有很多小错误。让我试着列出它们:re.findall返回一个列表,但您似乎将其视为单个字符串。尝试email = email[0]只考虑列表的第一个元素。在
您的第一个SELECT语句有(email)。将一个项目放在括号内不会使其成为元组。请尝试(email,)或{}。在
在for循环之后的if将在for循环的每次迭代中出现,因此它必须缩进一个句点。在
if的正文不能为空。取消该操作的注释,或将其更改为pass。在
最后一个for循环的主体需要缩进一个句点。在
出于对堆栈溢出阅读器的礼貌,请复制粘贴整个独立程序,而不仅仅是代码片段。在
以下是我解决问题后的程序:import sqlite3
import re
conn = sqlite3.connect(':memory:')
cur = conn.cursor()
cur.execute('''
DROP TABLE IF EXISTS Counts''')
cur.execute('''
CREATE TABLE Counts (email TEXT, count INTEGER)''')
fname = raw_input('Enter file name: ')
if ( len(fname) < 1 ) : fname = 'mbox-short.txt'
fh = open(fname)
for line in fh:
if not line.startswith('From: ') : continue
line = line.rstrip()
email = re.findall('@(\S+[a-zA-Z]+)', line)
email = email[0]
cur.execute('SELECT count FROM Counts WHERE email = ? ', (email,))
row = cur.fetchone()
if row is None:
cur.execute('''INSERT INTO Counts (email, count)
VALUES ( ?, 1 )''', ( email, ) )
else :
cur.execute('UPDATE Counts SET count=count+1 WHERE email = ?',
(email, ))
# This statement commits outstanding changes to disk each
# time through the loop - the program can be made faster
# by moving the commit so it runs only after the loop completes
conn.commit()
# https://www.sqlite.org/lang_select.html
sqlstr = 'SELECT email, count FROM Counts ORDER BY count DESC LIMIT 10'
print "Counts:"
for row in cur.execute(sqlstr) :
print str(row[0]), row[1]
cur.close()