c# 解析html网页获取某个节点的内容

最新推荐文章于 2024-07-15 08:00:00 发布

Simon Jing

最新推荐文章于 2024-07-15 08:00:00 发布

阅读量8.2k

点赞数 2

分类专栏： c# 文章标签： c# 解析html网页

本文链接：https://blog.csdn.net/Simon1003/article/details/80359325

版权

本文档展示了如何使用C#中的HtmlAgilityPack库来解析HTML网页，提取特定ID的节点，并进一步转换为XML进行处理。通过加载网页、查找元素、解析内容并下载PDF，演示了对HTML内容的详细操作。

摘要由CSDN通过智能技术生成

首先添加 HtmlAgilityPack.dll引用

private void JieXiHTML(string htmlURL)
{
WirteLog("加载网页内容 -- 开始");
HtmlWeb webClient = new HtmlWeb();
HtmlAgilityPack.HtmlDocument doc = webClient.Load(htmlURL);
var rootNode = doc.GetElementbyId("main-list-table");
WirteLog("加载网页内容 -- 结束");
WirteLog("解析网页内容 -- 开始");

string xml = "<?xml version=\"1.0\" encoding=\"utf-8\" ?>" + "<table>" + rootNode.InnerHtml + "</table>";
XmlDocument xmlDoc = new XmlDocument();
xmlDoc.LoadXml(xml);
WirteLog("解析网页内容 -- 结束");
WirteLog("下载网页pdf -- 开始");
XmlNodeList nodelist = xmlDoc.SelectNodes("//table/tr");

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

Simon Jing

关注关注

2
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
c# 解析html网页获取某个节点的内容

首先添加 HtmlAgilityPack.dll引用private void JieXiHTML(string htmlURL) { WirteLog("加载网页内容 -- 开始"); HtmlWeb webClient = new HtmlWeb(); HtmlAgilityPack.HtmlDocument do...
复制链接

扫一扫