sub()需要传入两个参数,第一个是字符串,用以取代发现的匹配;第二个也是一个字符串,即正则表达式。
注意:两个参数间必须有空格
>>> import re
>>> namesRegex=re.compile(r'Agent\w+')#最开始尝试时,没有加空格,因此失败
>>> namesRegex.sub('CENSORED','Agent Alice gave the secret documents to Agent Bob.')
'Agent Alice gave the secret documents to Agent Bob.'
>>> namesRegex=re.compile(r'Agent \w+')#加入空格后,成功,✌
>>> namesRegex.sub('CENSORED','Agent Alice gave the secret documents to Agent Bob.')
'CENSORED gave the secret documents to CENSORED.'
>>> namesRegex=re.compile(r'Agent,\w+')#逗号也不行
>>> namesRegex.sub('CENSORED','Agent Alice gave the secret documents to Agent Bob.')
'Agent Alice gave the secret documents to Agent Bob.'
加入参数,替换匹配文本的一部分;使用正则表达式Agent (\w)\w
例:想隐去密探姓名,输入如下
>>> agentNamesRegex=re.compile(r'Agent(\w)\w*')#漏加空格,再次跌倒在同一个坑里
>>> agentNamesRegex.sub(r'\1****','Agent Alice told Agent Carol that Agent\
Eve knew Agent Bob was a double agent.')
'Agent Alice told Agent Carol that E**** knew Agent Bob was a double agent.'
>>> agentNamesRegex=re.compile(r'Agent (\w)\w*')#加上空格,成功
>>> agentNamesRegex.sub(r'\1****','Agent Alice told Agent Carol that Agent\
Eve knew Agent Bob was a double agent.')
'A**** told C**** that AgentEve knew B**** was a double agent.'
>>> agentNamesRegex=re.compile(r'Agent (\w)\w+')#测试了一下,将正则表达式的*(0次或多次)换为+(1次或多次),不影响
>>> agentNamesRegex.sub(r'\1****','Agent Alice told Agent Carol that Agent\
Eve knew Agent Bob was a double agent.')
'A**** told C**** that AgentEve knew B**** was a double agent.'