html给td内容加删除线,搜索HTML线和删除线不与</form></td><a

weixin_39978276

于 2021-06-10 17:35:17 发布

阅读量112

点赞数

C++ HTML解析正则表达式用户信息提取轻量级库

关键词由CSDN通过智能技术生成

I have an HTML file with very bad formatted code that I get from a website, I want to extract some very small pieces of information.搜索HTML线和删除线不与

I am only interested in lines that start like this:

user897HouseA2HouseA Type12 1 of 2user12310

and I want to extract 3 fields:

A:HouseA

B:HouseA Type12

C:user123

D:10

I know I've seen people recommend HTML Agility Pack and lib2xml but I really don't think I need all that. My app is in C/C++.

I am already using getline to start reading lines, I am just not sure what's the best way to proceed. Thanks!

std::ifstream data("Home.html");

std::string line;

while(std::getline(data,line))

{

linenum++;

std::stringstream lineStream(line);

std::string user;

if (strncmp(line.c_str(), "

",strlen("")) == 0)

{

printf("found a wanted line in line:%d\n", linenum);

}

}

2011-02-17

emge

+0

你有没有尝试用正则表达式解析你的HTML？ :-p –

2011-02-17 22:48:43

+0

你有什么库可以使用C++ stdlib吗？你的目标是什么平台？ –

2011-02-17 22:51:14

weixin_39978276

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
html给td内容加删除线,搜索HTML线和删除线不与</form></td><a

I have an HTML file with very bad formatted code that I get from a website, I want to extract some very small pieces of information.搜索HTML线和删除线不与I am only interested in lines that start like this: u...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。