该程序是爬取京东上的Java图书信息
book模型:
private String bookID;
private String bookName;
private String bookPrice;
文件结构
1)httpclient maven配置:(不同版本创建HttpClient方法不同)
<dependency>
<groupId>org.apache.httpcomponents</groupId>
<artifactId>httpclient</artifactId>
<version>4.1.2</version>
</dependency>
2)main方法:(获取数据,存放数据)
public class bookMain {
static final Log logger = LogFactory.getLog(bookMain.class); //log4j
public static void main(String[] args) throws Exception {
HttpClient httpclient = new DefaultHttpClient(); //创建HttpClient
String url = "https://search.jd.com/Search?keyword=java&enc&