Jsoup抓取网页内容超时问题的处理

在利用Jsoup抓取网页的时候,遇到超时问题:

 

java.net.SocketTimeoutException: Read timed out
	at java.net.SocketInputStream.socketRead0(Native Method)
	at java.net.SocketInputStream.read(Unknown Source)
	at java.net.SocketInputStream.read(Unknown Source)
	at java.io.BufferedInputStream.fill(Unknown Source)
	at java.io.BufferedInputStream.read1(Unknown Source)
	at java.io.BufferedInputStream.read(Unknown Source)
	at sun.net.www.http.HttpClient.parseHTTPHeader(Unknown Source)
	at sun.net.www.http.HttpClient.parseHTTP(Unknown Source)
	at sun.net.www.protocol.http.HttpURLConnection.getInputStream(Unknown Source)
	at java.net.HttpURLConnection.getResponseCode(Unknown Source)
	at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:655)
	at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:628)
	at org.jsoup.helper.HttpConnection.execute(HttpConnection.java:260)
	at org.jsoup.helper.HttpConnection.get(HttpConnection.java:249)
	at com.accord.jsoup.JsoupUtils.getHtmlByUrl2(JsoupUtils.java:70)
	at com.accord.jsoup.JsoupMain.getListDatas(JsoupMain.java:63)
	at com.accord.jsoup.JsoupMain.main(JsoupMain.java:32)
Exception in thread "main" java.lang.NullPointerException
	at com.accord.jsoup.JsoupMain.parserHtml(JsoupMain.java:169)
	at com.accord.jsoup.JsoupMain.getListDatas(JsoupMain.java:64)
	at com.accord.jsoup.JsoupMain.main(JsoupMain.java:32)

 

 

 

 

 

我一开始代码里面是:

Jsoup.connect(url).cookie("JSESSIONID", sessionId).get();

方法获取网页内容,这种方式使用的是默认超时时间30秒。

由于请求数据慢,就会造成超时的,修改成如下就可以了:

Jsoup.connect(url).cookie("JSESSIONID", sessionId).timeout(50000).get(); // 50S

这个时间,自己看着设置!

评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值