asp.net替换html标签的程序

最新推荐文章于 2021-07-13 19:00:20 发布

步恒者

最新推荐文章于 2021-07-13 19:00:20 发布

阅读量1.6k

点赞数

分类专栏：程序生涯文章标签： asp.net html string regex div function

程序生涯专栏收录该内容

64 篇文章 0 订阅

订阅专栏

HTML代码导致显示的问题，但这也是一个程序的BUG，以前写asp时都会写HtmlEncode函数，这次却忽略了，找了一下以前的程序，如下：
function HTMLEncode(fString)
fString=replace(fString,";",";")
fString=replace(fString,"<","<")
fString=replace(fString,">",">")
fString=replace(fString,"/","/")
fString=replace(fString,"--","--")
fString=replace(fString,CHR(9),"	")
fString=replace(fString,CHR(10),"<br>")
fString=replace(fString,CHR(13),"")
fString=replace(fString,CHR(22),"")
fString=replace(fString,CHR(32)," ")
fString=replace(fString,CHR(34),""")'双引号
fString=replace(fString,CHR(39),"'")'单引号
HTMLEncode=fString
end function
但是这种程序在PHP中是不用写的，已经有人写好。
我想asp.net肯定不会落后，找了一下，在System.Web.HttpUtility找到了。
PHP中还有自带的一些正则式的公式，功能比较强，如ereg()等。.net中没有发现有类似函数，不过我觉得自己写写也不是坏事，因为并不难，还可以知其所以然。就如上面那个asp程序，知道他做了些什么。
附带一下asp.net替换html标签的程序：
public static string NoHTML(string Htmlstring)
    {

        //删除脚本

        Htmlstring = Regex.Replace(Htmlstring, @"<script[^>]*?>.*?</script>", "", RegexOptions.IgnoreCase);

        //删除HTML

        Htmlstring = Regex.Replace(Htmlstring, @"<(.[^>]*)>", "", RegexOptions.IgnoreCase);

        Htmlstring = Regex.Replace(Htmlstring, @"([/r/n])[/s]+", "", RegexOptions.IgnoreCase);

        Htmlstring = Regex.Replace(Htmlstring, @"-->", "", RegexOptions.IgnoreCase);

        Htmlstring = Regex.Replace(Htmlstring, @"",
          @"<!--.*/n",
          @"<FONT.*?>",
          @"<SPAN.*?>",
          @"<?xml.*?/>",
          @"</?",
          @"<(///s*)?!?((/w+:)?/w+)(/w+(/s*=?/s*(([""'])([url=file:[""'tbnr]|[^/7])*?/7|/w+)|.{0})|/s)*?(///s]//[""'tbnr]|[^/7])*?/7|/w+)|.{0})|/s)*?(///s[/url]*)?>",
          @"/[p/]",
          @"/[//p/]",
          @"/[div/]",
          @"/[//div/]",
          @"/[br/]",
          @"/[img",
          @"/[a",
          @"/[//a/]",
         };
            string[] aryRep = {

           "",
                                  "[p]",
                                  "[/p]",
                                  "[div]",
                                  "[/div]",
                                  "[br]",
                                  "[img",
                                  "[a",
                                  "[/a]",
           "",
           "",
           "/"",
           "&",
           "<",
           ">",
           " ",
           "/xa1",
           "/xa2",
           "/xa3",
           "/xa9",
           "",
           "/r/n",
           "",
           "",
           "",
           "",
           "",
           "",
                                   "<div style='text-indent:2em'>",
           "</div>",
                                   "<div style='text-indent:2em'>",
           "</div>",
           "<div style='text-indent:2em'> </div>",
           "<img",
                                   "<a",
                                  "</a>"
          };
            string newReg = aryReg[0];
            string strOutput = strHtml;
            for (int i = 0; i < aryReg.Length; i++)
            {
                Regex regex = new Regex(aryReg[i], RegexOptions.IgnoreCase);
                strOutput = regex.Replace(strOutput, aryRep[i]);
            }
            //strOutput.Replace("[p]", "<p>");
            //strOutput.Replace("[/p]>", "</p>");
            //strOutput.Replace("[br]", "<br />");
            //strOutput.Replace("[img", "<img");
            return strOutput;
        }

本文引用地址： http://www.sciencenet.cn/m/user_content.aspx?id=39127

步恒者

关注

0
点赞
踩
6

收藏

觉得还不错? 一键收藏
0
评论
asp.net替换html标签的程序

HTML代码导致显示的问题，但这也是一个程序的BUG，以前写asp时都会写HtmlEncode函数，这次却忽略了，找了一下以前的程序，如下： function HTMLEncode(fString) fString=replace(fString,";",";") fString=replace(fString,"fString=replace(fString,">"
复制链接

扫一扫

专栏目录