系统:ubutu 14.06
一.sphinx
sudo apt-get install apache2
apt-get install php5 libapache2-mod-php5
apt-get install mysql-server
apt-get install php5-mysql php5-curl php5-gd
apt-get install sphinxsearch
apt-get install libsphinxclient-dev
apt-get install php-pear php5-dev
sudo pecl install sphinx
echo "extension=sphinx.so" | sudo tee /etc/php5/mods-available/sphinx.ini
sudo ln -s /etc/php5/mods-available/sphinx.ini /etc/php5/apache2/conf.d/20-sphinx.ini
sudo ln -s /etc/php5/mods-available/sphinx.ini /etc/php5/cli/conf.d/20-sphinx.ini
二.scws
mkdir /root/install
cd /root/install
wget -c http://www.xunsearch.com/scws/down/scws-1.2.3.tar.bz2
tar jxvf scws-1.2.3.tar.bz2
cd scws-1.2.3
./configure --prefix=/usr/local/scws
make && make install
词库
cd /usr/local/scws/etc
wget http://www.xunsearch.com/scws/down/scws-dict-chs-gbk.tar.bz2
wget http://www.xunsearch.com/scws/down/scws-dict-chs-utf8.tar.bz2
tar xvjf scws-dict-chs-gbk.tar.bz2
tar xvjf scws-dict-chs-utf8.tar.bz2
检查
ls -al /usr/local/scws/lib/libscws.la
/usr/local/scws/bin/scws -h
cd /usr/local/scws/bin
./scws -i ../etc/test.txt -o ../etc/out.txt -r ../etc/rules.utf8.ini -d ../etc/dict.utf8.xdb -c utf8
提示
+--[scws(scws-cli/1.2.3)]----------+
| TextLen: 15 |
| Prepare: 0.0048 (sec) |
| Segment: 0.0001 (sec) |
+--------------------------------+
php扩展
cd /root/install/scws-1.2.3/phpext/
phpize
./configure --with-scws=/usr/local/scws
make
make install
touch /etc/php5/mods-available/scws.ini
vi /etc/php5/mods-available/scws.ini
添加
extension = /usr/lib/php5/20121212(版本日期路径,这个可能不同)/scws.so
/usr/lib/php5/20121212/
scws.default.charset = utf8
scws.default.fpath = /usr/local/scws/etc
sudo ln -s /etc/php5/mods-available/scws.ini /etc/php5/apache2/conf.d/20-scws.ini
sudo ln -s /etc/php5/mods-available/scws.ini /etc/php5/cli/conf.d/20-scws.ini
phpinfo()显示
/etc/php5/apache2/conf.d/20-scws.ini
scws
SCWS support Enabled
SCWS Description Simple Chinese Words Segmentation
PECL Module version 0.2.3
SCWS Library 1.2.3
SCWS BugReport http://www.xunsearch.com/scws
Directive Local Value Master Value
scws.default.charset utf8 utf8
scws.default.fpath /usr/local/scws/etc /usr/local/scws/etc
php -m | grep scws
显示红色 scws(一定要有,如果没有要排查原因)
cd /root/install/scws-1.2.3/phpext
php scws_test.php 通过
Test [1] ... FAILURE!
----------------------------------------
ORGINAL TEXT: 大家好,我是马明练
EXPECTED RESULT: 大家 好 , 我 是 马明练
ACTUAL RESULT: 大家 好 , 我 是 马 明练
========================================
Test [2] ... PASS!
Test [3] ... PASS!
Test [4] ... PASS!
Test [5] ... PASS!
Test [6] ... PASS!
Test [7] ... PASS!
Test [8] ... PASS!
Test [9] ... PASS!
Test [10] ... PASS!
Test [11] ... PASS!
Test [12] ... PASS!
Test [13] ... PASS!
Test [14] ... PASS!
Test [15] ... FAILURE!
----------------------------------------
ORGINAL TEXT: 李姚明
EXPECTED RESULT: 李姚明
ACTUAL RESULT: 李 姚明
========================================
Test [16] ... PASS!
// -------------------------------------
// TEST result report
// SCWS (Module version:0.2.3, Library version:1.2.3) - by hightman
// -------------------------------------
// Total test: 16
// Passed Num: 14 (87.50%)
// Failed Num: 2 (12.50%)