sphinx+scws

系统:ubutu 14.06

一.sphinx

sudo apt-get install apache2
apt-get install php5 libapache2-mod-php5

apt-get install mysql-server

apt-get install php5-mysql php5-curl php5-gd

apt-get install sphinxsearch

apt-get install libsphinxclient-dev

apt-get install php-pear php5-dev

sudo pecl install sphinx

echo "extension=sphinx.so" | sudo tee /etc/php5/mods-available/sphinx.ini

sudo ln -s /etc/php5/mods-available/sphinx.ini /etc/php5/apache2/conf.d/20-sphinx.ini

sudo ln -s /etc/php5/mods-available/sphinx.ini /etc/php5/cli/conf.d/20-sphinx.ini

二.scws

mkdir /root/install

cd /root/install

wget -c http://www.xunsearch.com/scws/down/scws-1.2.3.tar.bz2

tar jxvf scws-1.2.3.tar.bz2

cd scws-1.2.3

./configure --prefix=/usr/local/scws

make && make install

词库
cd /usr/local/scws/etc

wget http://www.xunsearch.com/scws/down/scws-dict-chs-gbk.tar.bz2

wget http://www.xunsearch.com/scws/down/scws-dict-chs-utf8.tar.bz2

tar xvjf scws-dict-chs-gbk.tar.bz2

tar xvjf scws-dict-chs-utf8.tar.bz2

检查
ls -al /usr/local/scws/lib/libscws.la

/usr/local/scws/bin/scws -h

cd /usr/local/scws/bin

./scws -i ../etc/test.txt -o ../etc/out.txt -r ../etc/rules.utf8.ini -d ../etc/dict.utf8.xdb -c utf8

提示
+--[scws(scws-cli/1.2.3)]----------+
| TextLen:   15                  |
| Prepare:   0.0048    (sec)     |
| Segment:   0.0001    (sec)     |
+--------------------------------+


php扩展

cd /root/install/scws-1.2.3/phpext/

phpize

./configure --with-scws=/usr/local/scws

make

make install

touch /etc/php5/mods-available/scws.ini

vi /etc/php5/mods-available/scws.ini
添加
extension = /usr/lib/php5/20121212(版本日期路径,这个可能不同)/scws.so
/usr/lib/php5/20121212/
scws.default.charset = utf8
scws.default.fpath = /usr/local/scws/etc


sudo ln -s /etc/php5/mods-available/scws.ini /etc/php5/apache2/conf.d/20-scws.ini

sudo ln -s /etc/php5/mods-available/scws.ini /etc/php5/cli/conf.d/20-scws.ini


phpinfo()显示

/etc/php5/apache2/conf.d/20-scws.ini

scws
SCWS support    Enabled
SCWS Description     Simple Chinese Words Segmentation
PECL Module version     0.2.3
SCWS Library     1.2.3
SCWS BugReport     http://www.xunsearch.com/scws

Directive    Local Value    Master Value
scws.default.charset    utf8    utf8
scws.default.fpath    /usr/local/scws/etc    /usr/local/scws/etc

php -m | grep scws
显示红色 scws(一定要有,如果没有要排查原因)

cd /root/install/scws-1.2.3/phpext

php scws_test.php 通过


Test [1] ... FAILURE!
----------------------------------------
ORGINAL TEXT: 大家好,我是马明练
EXPECTED RESULT: 大家 好 , 我 是 马明练
ACTUAL RESULT: 大家 好 , 我 是 马 明练
========================================
Test [2] ... PASS!
Test [3] ... PASS!
Test [4] ... PASS!
Test [5] ... PASS!
Test [6] ... PASS!
Test [7] ... PASS!
Test [8] ... PASS!
Test [9] ... PASS!
Test [10] ... PASS!
Test [11] ... PASS!
Test [12] ... PASS!
Test [13] ... PASS!
Test [14] ... PASS!
Test [15] ... FAILURE!
----------------------------------------
ORGINAL TEXT: 李姚明
EXPECTED RESULT: 李姚明
ACTUAL RESULT: 李 姚明
========================================
Test [16] ... PASS!
// -------------------------------------
// TEST result report
// SCWS (Module version:0.2.3, Library version:1.2.3) - by hightman
// -------------------------------------
// Total test: 16
// Passed Num: 14 (87.50%)
// Failed Num: 2 (12.50%)

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值