微信公众号文章内容采集API接口源码其他有用第三方编辑器的微信公众号请在正则里面继续添加规则,此代码良好适配默认的微信公众号发布平台!
是为了指定目标采集或者搬迁到第三方有可用接口的平台。
PHP代码
<?php
$url = @$_GET['url']?$_GET['url'];
$headers = array(
'Host' => 'mmbiz.qpic.cn',
'Connection' => 'keep-alive',
'Pragma' => 'no-cache',
'Accept-Encoding' => 'gzip, deflate, sdch',
'Accept-Language' => 'zh-CN,zh;q=0.8,en;q=0.6,zh-TW;q=0.4'
);
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL,$url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER,1);
$dom = new DOMDocument();
try {
$dom->loadHTML($result);
} catch (Exception $e) {
die('加载HTML时出错:' . $e->getMessage());
}
foreach ($dom->getElementsByTagName('*') as $tag) {
if ($tag->hasAttribute('style')) {
$tag->removeAttribute('style');
}
}
$newstext = '';
$divtext = $dom->getElementById('js_content');
foreach ($divtext->childNodes as $child) {
$newstext .=$child->ownerDocument->saveHTML($child);
}
$Json=json_encode($content,JSON_PRETTY_PRINT|JSON_UNESCAPED_UNICODE);
echo stripslashes($Json);
运行需要PHP配置DOMDocument扩展
扩展下载:公众号文章采集API.zip