matlab爬虫

https://www.bilibili.com/video/BV1ti4y1t7L4/?spm_id_from=333.999.0.0&vd_source=0fe41db04b329b4a48aa7e33b09042ab

 ie_operation

ie=actxserver('internetexplorer.application');
ie.Navigate('https://www.baidu.com/');
while ~strcmp(ie.readystate,'READYSTATE_COMPLETE')
pause(0.01)
end
ie.visible = 1;
SearchItem = ie.document.body.getElementsByClassName('s_ipt').item(0);
SearchItem.value = '打浦桥程序员';
ButtonItem = ie.document.body.getElementsByClassName('bg s_btn').item(0);
ButtonItem.click
ie=actxserver('internetexplorer.application');
ie.Navigate('https://www.baidu.com/');
ie.visible = 1;
while ~strcmp(ie.readystate,'READYSTATE_COMPLETE')
    pause(0.01)
end
NumComment = ie.document.body.getElementsByClassName('item-name').length;
CommentAll = cell(NumComment,4);
for i = 0:1:NumComment-1
    Ele_User = ie.document.body.getElementsByClassName('item-name').item(1);
    Ele_User = Ele_User.innerText

    Ele_Position = ie.document.body.getElementsByClassName('item-tc').item(1);
    Ele_Position = Ele_Position.innerText

    Ele_Comment = ie.document.body.getElementsByClassName('item-content').item(1);
    Ele_Comment = Ele_Comment.innerText

    Ele_Date = ie.document.body.getElementsByClassName('item-date').item(i+1);
    Ele_Date = Ele_Date.innerText
    
    CommentAll(i+1,:) = {Ele_User,Ele_Position,Ele_Comment,Ele_Date};
end

https://www.bilibili.com/video/BV1ti4y1V7uS/?spm_id_from=333.999.0.0&vd_source=0fe41db04b329b4a48aa7e33b09042ab

 Webread,Regexpi,Cell,Writetable pachong

url=
da=webread(url);
city= 'city"';
d=regexpi(da,city);
d=d';
for i=1: size(d,1);
    y = da(d(i,1)+35:d(i,1)+50);
    x = regexpi(y,'【^\x00-\xff】');
    h = y(x);

    supplier(i)={h};

end
T= cell2table(supplier','variableName',{'Supplier list'})
   
filename = 'supplier.xlsx';
%writetable(T,filename,'Sheet',1,'Range','A1')

  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值