问题描述:本文设计java网页抓取,其中涉及到.net网站抓取,涉及到__VIEWSTATE等参数。
参考文章:
httpclient入门:http://www.ibm.com/developerworks/cn/opensource/os-httpclient/
httpclient怎样自动登陆__VIEWSTATE验证的系统:http://wangrusheng5200.iteye.com/admin/blogs/new
核心代码:
String url = "http://www.2552.net/Book/LC/1.aspx";
// 构造HttpClient的实例
HttpClient httpClient = new HttpClient();
PostMethod postMethod = new PostMethod(url);
postMethod
.addParameter(
"__VIEWSTATE",
"省略");
postMethod.addParameter("__EVENTTARGET", "_ctl0:pager");
postMethod.addParameter("__EVENTARGUMENT", "2");
httpClient.executeMethod(postMethod);
String body = postMethod.getResponseBodyAsString();
System.out.println(body);