1,修改nutch-site.xml
<property>
<name>storage.data.store.class</name>
<value>org.apache.gora.hbase.store.HBaseStore</value>
<description>Default class for storing data</description>
</property>
<property>
<name>http.agent.name</name>
<value>JustinNutchAgent</value>
</property>
<property>
<name>plugin.includes</name>
<value>protocol-httpclient|urlfilter-regex|index-(basic|more)|query-(basic|site|url|lang)|indexer-solr|nutch-extensionpoints|protocol-httpclient|urlfilter-regex|parse-(text|html|msexcel|msword|mspowerpoint|pdf)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|more|metadata)</value>
</property>
2,修改ivy.xml中包含org.apache.hadoop的对应的hadoop对应版本,我的对应版本为1.2.1
<dependency org="org.apache.gora" name="gora-hbase" rev="0.5" conf="*->default" />
<dependency org="org.apache.gora" name="gora-core" rev="0.5" conf="*->default"/>
<dependency org="org.apache.gora" name="gora-compiler-cli" rev="0.5" conf="*->default"/>
<dependency org="org.apache.gora" name="gora-compiler" rev="0.5" conf="*->default"/>
3,在gora.properties中增加
gora.datastore.default=org.apache.gora.hbase.store.HBaseStore
4,修改build.xml修改hadoop-*test*.jar改为hadoop-*.jar