Java过滤HTML标签、属性等正则表达式汇总

Java过滤HTML标签、属性等正则表达式汇总

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
$str =preg_replace( "/\s+/" " " $str );  //过滤多余回车
$str =preg_replace( "/<[ ]+/si" , "<" , $str );  //过滤<__("<"号后面带空格)
   
$str =preg_replace( "/<\!--.*?-->/si" , "" , $str );  //注释
$str =preg_replace( "/<(\!.*?)>/si" , "" , $str );  //过滤DOCTYPE
$str =preg_replace( "/<(\/?html.*?)>/si" , "" , $str );  //过滤html标签
$str =preg_replace( "/<(\/?head.*?)>/si" , "" , $str );  //过滤head标签
$str =preg_replace( "/<(\/?meta.*?)>/si" , "" , $str );  //过滤meta标签
$str =preg_replace( "/<(\/?body.*?)>/si" , "" , $str );  //过滤body标签
$str =preg_replace( "/<(\/?link.*?)>/si" , "" , $str );  //过滤link标签
$str =preg_replace( "/<(\/?form.*?)>/si" , "" , $str );  //过滤form标签
$str =preg_replace( "/cookie/si" , "COOKIE" , $str );  //过滤COOKIE标签
   
$str =preg_replace( "/<(applet.*?)>(.*?)<(\/applet.*?)>/si" , "" , $str );  //过滤applet标签
$str =preg_replace( "/<(\/?applet.*?)>/si" , "" , $str );  //过滤applet标签
   
$str =preg_replace( "/<(style.*?)>(.*?)<(\/style.*?)>/si" , "" , $str );  //过滤style标签
$str =preg_replace( "/<(\/?style.*?)>/si" , "" , $str );  //过滤style标签
   
$str =preg_replace( "/<(title.*?)>(.*?)<(\/title.*?)>/si" , "" , $str );  //过滤title标签
$str =preg_replace( "/<(\/?title.*?)>/si" , "" , $str );  //过滤title标签
   
$str =preg_replace( "/<(object.*?)>(.*?)<(\/object.*?)>/si" , "" , $str );  //过滤object标签
$str =preg_replace( "/<(\/?objec.*?)>/si" , "" , $str );  //过滤object标签
   
$str =preg_replace( "/<(noframes.*?)>(.*?)<(\/noframes.*?)>/si" , "" , $str );  //过滤noframes标签
$str =preg_replace( "/<(\/?noframes.*?)>/si" , "" , $str );  //过滤noframes标签
   
$str =preg_replace( "/<(i?frame.*?)>(.*?)<(\/i?frame.*?)>/si" , "" , $str );  //过滤frame标签
$str =preg_replace( "/<(\/?i?frame.*?)>/si" , "" , $str );  //过滤frame标签
   
$str =preg_replace( "/<(script.*?)>(.*?)<(\/script.*?)>/si" , "" , $str );  //过滤script标签
$str =preg_replace( "/<(\/?script.*?)>/si" , "" , $str );  //过滤script标签
$str =preg_replace( "/javascript/si" , "Javascript" , $str );  //过滤script标签
$str =preg_replace( "/vbscript/si" , "Vbscript" , $str );  //过滤script标签
$str =preg_replace( "/on([a-z]+)\s*=/si" , "On\\1=" , $str );  //过滤script标签
$str =preg_replace( "/&#/si" , "&#" , $str );  //过滤script标签,如javAsCript:alert(

清除空格,换行

1
2
3
4
5
6
7
8
9
10
11
function  DeleteHtml( $str )
{
$str  = trim( $str );
$str  strip_tags ( $str , "" );
$str  ereg_replace ( "\t" , "" , $str );
$str  ereg_replace ( "\r\n" , "" , $str );
$str  ereg_replace ( "\r" , "" , $str );
$str  ereg_replace ( "\n" , "" , $str );
$str  ereg_replace ( " " , " " , $str );
return  trim( $str );
}

过滤HTML属性

1,过滤所有html标签的正则表达式:

1
2
3
4
5
</?[^>]+>
  
//过滤所有html标签的属性的正则表达式:
  
$html  = preg_replace( "/<([a-zA-Z]+)[^>]*>/" , "<\\1>" , $html );

  

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

花语者

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值