正则表达html表格,表中的C＃正则表达式html表(C# regex html table inside a table)

最新推荐文章于 2021-12-13 15:42:00 发布

彭浩翔

最新推荐文章于 2021-12-13 15:42:00 发布

阅读量150

点赞数

文章标签：正则表达html表格

Don't do this.

HTML is not a regular grammar and so a regular expression is not a good tool with which to parse it. What you are asking in your last sentence is for a contextual parser, not a regular expression. Bare regular expression parsing it is too likely fail to parse HTML correctly to be responsible coding.

HtmlAgilityPack is a MsPL-licensed solution I've used in the past that has widely acceptable license terms and provides a well-formed DOM which can be probed with XPath or manipulated in other useful ways ("Extract all text, dropping out tags" being a popular one for importing HTML mail for search, for example, that is nigh trivial after letting a DOM parser rip through the HTML and only coding the part that adds value for your specific business case).