html给td内容加删除线,搜索HTML线和删除线不与</form></td>​​<a

I have an HTML file with very bad formatted code that I get from a website, I want to extract some very small pieces of information.搜索HTML线和删除线不与​​

I am only interested in lines that start like this:

user897HouseA2HouseA Type12 1 of 2user12310

and I want to extract 3 fields:

A:HouseA

B:HouseA Type12

C:user123

D:10

I know I've seen people recommend HTML Agility Pack and lib2xml but I really don't think I need all that. My app is in C/C++.

I am already using getline to start reading lines, I am just not sure what's the best way to proceed. Thanks!

std::ifstream data("Home.html");

std::string line;

while(std::getline(data,line))

{

linenum++;

std::stringstream lineStream(line);

std::string user;

if (strncmp(line.c_str(), "

",strlen("")) == 0)

{

printf("found a wanted line in line:%d\n", linenum);

}

}

2011-02-17

emge

+0

你有没有尝试用正则表达式解析你的HTML? :-p –

2011-02-17 22:48:43

+0

你有什么库可以使用C++ stdlib吗?你的目标是什么平台? –

2011-02-17 22:51:14

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值