网上找到这个问题的答案多是,要模拟浏览器.
可是 有个网站我模拟了还是有错误? 为什么
String url="https://pixabay.com/videos/"; //通过 解析 xml 文本 // Connection conn = Jsoup.connect(url).timeout(5000); // conn.header("Accept", "text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8"); // conn.header("Accept-Encoding", "gzip, deflate, sdch"); // conn.header("Accept-Language", "zh-CN,zh;q=0.8"); // conn.header("User-Agent", "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36"); // // Document doc= conn.get(); Document doc= Jsoup.connect(url).maxBodySize(0).timeout(0).userAgent("Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; WOW64; Trident/5.0").get();
尝试了模拟还是无法正确获取这个页面?