php抓取网页内容并json过滤,php抓取这个网页的数据,只要数据,不要html内容,然后json后写入文件,新手求教...

#isU', $s, $m);foreach(array_map('strip_tags', $m[0]) as $r) { $a = preg_split('/\s+/', $r, -1, PREG_SPLIT_NO_EMPTY); $res[] = array_chunk(array_slice($a, 0, -1), 3);}print_r($res);echo json_encode($res); Array( [0] => Array ( [0] => Array ( [0] => 排名 [1] => 球队 [2] => 积分 ) [1] => Array ( [0] => 1 [1] => 尤文图斯 [2] => 102 ) [2] => Array ( [0] => 2 [1] => 罗马 [2] => 85 ) [3] => Array ( [0] => 3 [1] => 那不勒斯 [2] => 78 ) [4] => Array ( [0] => 4 [1] => 佛罗伦萨 [2] => 65 ) [5] => Array ( [0] => 5 [1] => 国际米兰 [2] => 60 ) [6] => Array ( [0] => 6 [1] => 帕尔马 [2] => 58 ) [7] => Array ( [0] => 7 [1] => 都灵 [2] => 57 ) [8] => Array ( [0] => 8 [1] => AC米兰 [2] => 57 ) [9] => Array ( [0] => 9 [1] => 拉齐奥 [2] => 56 ) [10] => Array ( [0] => 10 [1] => 维罗纳 [2] => 54 ) ) [1] => Array ( [0] => Array ( [0] => 进球数 [1] => 球员 [2] => 球队 ) [1] => Array ( [0] => 22 [1] => 伊莫比莱 [2] => 都灵 ) [2] => Array ( [0] => 20 [1] => 托尼 [2] => 维罗纳 ) [3] => Array ( [0] => 19 [1] => 特维斯 [2] => 尤文图斯 ) [4] => Array ( [0] => 17 [1] => 帕拉西奥 [2] => 国际米兰 ) [5] => Array ( [0] => 17 [1] => 伊瓜因 [2] => 那不勒斯 ) [6] => Array ( [0] => 16 [1] => 略伦特 [2] => 尤文图斯 ) [7] => Array ( [0] => 16 [1] => 迪纳塔莱 [2] => 乌迪内斯 ) [8] => Array ( [0] => 16 [1] => 朱塞佩·罗西 [2] => 佛罗伦萨 ) [9] => Array ( [0] => 16 [1] => 贝拉尔迪 [2] => 莎索罗 ) [10] => Array ( [0] => 15 [1] => 卡列洪 [2] => 那不勒斯 ) ))[[["\u6392\u540d","\u7403\u961f","\u79ef\u5206"],["1","\u5c24\u6587\u56fe\u65af","102"],["2","\u7f57\u9a6c","85"],["3","\u90a3\u4e0d\u52d2\u65af","78"],["4","\u4f5b\u7f57\u4f26\u8428","65"],["5","\u56fd\u9645\u7c73\u5170","60"],["6","\u5e15\u5c14\u9a6c","58"],["7","\u90fd\u7075","57"],["8","AC\u7c73\u5170","57"],["9","\u62c9\u9f50\u5965","56"],["10","\u7ef4\u7f57\u7eb3","54"]],[["\u8fdb\u7403\u6570","\u7403\u5458","\u7403\u961f"],["22","\u4f0a\u83ab\u6bd4\u83b1","\u90fd\u7075"],["20","\u6258\u5c3c","\u7ef4\u7f57\u7eb3"],["19","\u7279\u7ef4\u65af","\u5c24\u6587\u56fe\u65af"],["17","\u5e15\u62c9\u897f\u5965","\u56fd\u9645\u7c73\u5170"],["17","\u4f0a\u74dc\u56e0","\u90a3\u4e0d\u52d2\u65af"],["16","\u7565\u4f26\u7279","\u5c24\u6587\u56fe\u65af"],["16","\u8fea\u7eb3\u5854\u83b1","\u4e4c\u8fea\u5185\u65af"],["16","\u6731\u585e\u4f69\u00b7\u7f57\u897f","\u4f5b\u7f57\u4f26\u8428"],["16","\u8d1d\u62c9\u5c14\u8fea","\u838e\u7d22\u7f57"],["15","\u5361\u5217\u6d2a","\u90a3\u4e0d\u52d2\u65af"]]]写文件你就自己练练手吧

就一个 file_put_contents 函数

也不是非得用正则,试试用DOM处理,搜下Simple HTML DOM看

本文原创发布php中文网,转载请注明出处,感谢您的尊重!

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值