相关的 python 库 Diffbot、Readability、Newspaper GeneralNewsExtractor gerapy-auto-extractor(崔庆才): https://mp.weixin.qq.com/s?__biz=Mzg3MjU3NzU1OA==&mid=2247496418&idx=1&sn=5c5733bd2b316818d8a39297435fd576&source=41#wechat_redirect