python strip_tags 支持保留指定标签

最新推荐文章于 2024-05-16 15:50:31 发布

weixin_30765319

最新推荐文章于 2024-05-16 15:50:31 发布

阅读量170

点赞数

文章标签： python

原文链接：http://www.cnblogs.com/bushe/p/4482114.html

版权

#coding:utf-8

import re

def strip_tags(string, allowed_tags=''):
  if allowed_tags != '':
    # Get a list of all allowed tag names.
    allowed_tags = allowed_tags.split(',')
    allowed_tags_pattern = ['</?'+allowed_tag+'[^>]*>' for allowed_tag in allowed_tags]
    all_tags = re.findall(r'<[^>]+>', string, re.I)
    not_allowed_tags = []
    tmp = 0
    for tag in all_tags:
        for pattern in allowed_tags_pattern:
            rs = re.match(pattern,tag)
            if rs:
                tmp += 1
            else:
                tmp += 0
        if not tmp:
            not_allowed_tags.append(tag)
        tmp = 0
    for not_allowed_tag in not_allowed_tags:
        string = re.sub(re.escape(not_allowed_tag), '',string)
    print not_allowed_tags
  else:
    # If no allowed tags, remove all.
    string = re.sub(r'<[^>]*?>', '', string)
 
  return string

转载于:https://www.cnblogs.com/bushe/p/4482114.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_30765319

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python strip_tags 支持保留指定标签

#coding:utf-8import redef strip_tags(string, allowed_tags=''): if allowed_tags != '': # Get a list of all allowed tag names. allowed_tags = allowed_tags.split(',') allowed_...
复制链接

扫一扫