#匹配网址
import re
strtest = """
http://www.interoem.com/messageinfo.asp?id=35
http://3995503.com/class/class09/news_show.asp?id=14
http://lib.wzmc.edu.cn/news/onews.asp?id=769
http://www.zy-ls.com/alfx.asp?newsid=377&id=6
http://www.fincm.com/newslist.asp?id=415
"""
new_strtest = re.findall("http://[\w+.|\d+.].*/",strtest)
print("修正前:",strtest)
print("修正后:")
for new in new_strtest:
print(new)
运行结果:
修正前:
http://www.interoem.com/messageinfo.asp?id=35
http://3995503.com/class/class09/news_show.asp?id=14
http://lib.wzmc.edu.cn/news/onews.asp?id=769
http://www.zy-ls.com/alfx.asp?newsid=377&id=6
http://www.fincm.com/newslist.asp?id=415
修正后:
http://www.interoem.com/
http://3995503.com/class/class09/
http://lib.wzmc.edu.cn/news/
http://www.zy-ls.com/
http://www.fincm.com/
<