20130518--开源搜索引擎 zettair 安装 (0)

zettair快速安装

20130518写

0 安装环境

centos6.0

intel i5

1官网安装指导(见文章尾)

http://www.seg.rmit.edu.au/zettair/quick_start.html

2 bug&fix

1, 找不到g++
     ===fix=== yum install g++
     2, src/libtextcodec/gunzipfilter.c:34:18: error: zlib.h: No such file or directory
     ===fix=== yum install zlib-devel.i686
     3, /home/wyk/work/zettair-0.9.3/.libs/libzet.so: undefined reference to `inflateEnd'    等其他reference
     ===fix=== Makefile中在LIBS=-lm 中加入-lz,  LIBS=-lm -lz 

附录:官方安装指导
  1. Download Zettair as a zip file.
  2. Change into the directory where you've saved Zettair and unzip it:
    $ cd ~
    $ unzip zettair-0.9.3.zip
    

  3. Download this zipped collection (40,872 bytes) of HTML documents (which are part of the HTML 4.01 standard at http://www.w3.org/TR/html4/)
  4. Change into the directory where you've saved the collection and unzip it:
    $ cd ~
    $ unzip html.zip
    Archive:  html.zip
      inflating: collection/about.html
      inflating: collection/charset.html
      inflating: collection/conform.html
    ...
    

  5. Make and install the Zettair software:
    $ cd zettair-0.9.3
    $ ./configure --prefix=$HOME/local/zettair-0.9.3
    $ make
    $ make install
    

  6. Build an index on the files in the collection:
    $ mkdir ~/index
    $ cd ~/index
    $ find ~/collection/* | ~/local/zettair-0.9.3/bin/zet -i 
    zettair version 0.9.3
    created new index 'index'
    sources (type html): collection/about.html collection/charset.html 
    collection/conform.html collection/cover.html collection/references.html 
    collection/types.html 
    parsing collection/about.html...
    parsing collection/charset.html...
    parsing collection/conform.html...
    parsing collection/cover.html...
    parsing collection/references.html...
    parsing collection/types.html...
    merging...
    
    summary: 6 documents, 2049 distinct index terms, 0 10541 terms
    

    A Unix note: the command find ~/collection/* lists all files in the directory ~/collection, and this is piped as input into the Zettair index construction process. The result is that Zettair indexes all files in the directory. This command does the same thing:

    $ ~/local/zettair-0.9.3/bin/zet -i -c ../config/parser_settings.html -t 
    HTML collection/about.html collection/charset.html collection/conform.html 
    collection/cover.html collection/references.html collection/types.html
    

  7. Search the collection:
    $ ~/local/zettair-0.9.3/bin/zet
    > Tim Berners-Lee
    1. file:///collection/about.html (score 2.455709, docid 0)
    2. file:///collection/references.html (score 1.087303, docid 4)
    
    2 results of 2 shown (took 0.001164 seconds)
    > tags
    1. file:///collection/conform.html (score 0.952401, docid 2)
    2. file:///collection/references.html (score 0.664334, docid 4)
    
    2 results of 2 shown (took 0.000962 seconds)
    

  8. Enjoy!



评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值