http://www.xinglongjian.com/index.php/2015/02/06/hadoop2-6hbase0-98-9nutch2-3/
集成Nutch/Hadoop/Hbase/Solr构建搜索引擎:安装及运行【集群环境】:
http://blog.csdn.net/jediael_lu/article/details/43086439
基于Nutch+Hadoop+Hbase+ElasticSearch的网络爬虫及搜索引擎:
nutch搭建eclipse开发:
htmlunit和爬虫问题:http://shenbai.iteye.com/blog/1985844
HtmlUnit中AJAX执行的问题:http://www.codeweblog.com/htmlunit%E4%B8%ADajax%E6%89%A7%E8%A1%8C%E7%9A%84%E9%97%AE%E9%A2%98/