本问题已经有最佳答案,请猛点这里访问。
我在一个pandas df中有大约1.3m的字符串(表示用户在发送IT帮助台时的需求)。我还想从这些字符串中删除一系列29813个名称,以便只剩下描述问题的单词。这里有一个数据的小例子——它是有效的,但花费的时间太长了。我正在寻找一种更有效的方法来实现这一结果:
输入:
List1 = ["George Lucas has a problem logging in",
"George Clooney is trying to download data into a spreadsheet",
"Bart Graham needs to logon to CRM urgently",
"Lucy Anne George needs to pull management reports"]
List2 = ["Access Team","Microsoft Team","Access Team","Reporting Team"]
df = pd.DataFrame({"Team":List2,"Text":List1})
xwords = pd.Series(["George","Lucas","Clooney","Lucy","Anne","Bart","Graham"])
for word in range(len(xwords)):
df["Text"] = df["Text"].str.replace(xwords[word],"!"