CentOS下安装word2vec的linux教程

关于google的word2vec项目,这里就不用介绍了,接下来需要很干货的分享给大伙,在CentOS下安装word2vec的linux教程。关于如何在各个linux操作系统上安装word2vec,google的word2vec项目首页并未过多讲,或是详细的描述安装的linux教程。

word2vec项目的官网上,只有这么几段简短的关于使用的描述和linux教程:

Quick start

Download the code: svn checkout http://word2vec.googlecode.com/svn/trunk/

Run ‘make’ to compile word2vec tool

Run the demo scripts: ./demo-word.sh and ./demo-phrases.sh

For questions about the toolkit, see http://groups.google.com/group/word2vec-toolkit

详细的安装word2vec项目的linux如下:

当然,如上所说,该操作是在CentOS系统下的。

1、确保gcc为最新版

     yum install gcc

2、安装subversion,用来获取word2vec源码

    yum install subversion

3、指定一个任意目录为word2vec工作目录,通过svn指令获取word2vec源码

 
svn checkout http://word2vec.googlecode.com/svn/trunk/

4、使用make指令进行安装

  make

可以发现在安装的时候,会出现如下error:

gcc word2vec.c -o word2vec -lm -pthread -0fast -march=native -Wall -funroll-loops -Wno-unused-result

cc1: error: invalid option argument ?0fast?

cc1: warning: unrecognized command line option “-Wno-unused-result”

make: *** [word2vec] Error 1

这是因为该系统的gcc不支持0fast选项的原因。

5、编辑makefile文件,更改gcc安装选项

   vi ./makefile

word2vec的makefile源码如下:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
CC = gcc
#The -Ofast might not work with older versions of gcc; in thatcase,use-O2
CFLAGS = -lm -pthread -Ofast -march=native -Wall -funroll-loops -Wno-unused-result
 
all: word2vec word2phrase distance word-analogy compute-accuracy
 
word2vec : word2vec.c
 $(CC) word2vec.c -o word2vec $(CFLAGS)
word2phrase : word2phrase.c
 $(CC) word2phrase.c -o word2phrase $(CFLAGS)
distance : distance.c
 $(CC) distance.c -o distance $(CFLAGS)
word-analogy : word-analogy.c
 $(CC) word-analogy.c -o word-analogy $(CFLAGS)
compute-accuracy : compute-accuracy.c
 $(CC) compute-accuracy.c -o compute-accuracy $(CFLAGS)
 chmod+x *.sh
 
clean:
 rm -rf word2vec word2phrase distance word-analogy compute-accuracy

注意观察,第二行:

#The -0fast might not work with older versions of gcc; in that case, use -02(是0)

google的工程师这里有写到相关注意事项,告诉开发者,-0fast安装选项,可能对于旧版本的gcc不支持,如果在这种情况下,需要将-0fast改为-02

所以将第三行

CFLAGS = -lm -pthread -0fast -march=native -Wall -funroll-loops -Wno-unused-result

更改为

CFLAGS = -lm -pthread -02-march=native -Wall -funroll-loops -Wno-unused-result

6、使用make指令安装通过

 make

安装通过后的内容为:

gcc word2vec.c -o word2vec -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
gcc word2phrase.c -o word2phrase -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
gcc distance.c -o distance -lm -pthread -O2 -march=native -Wall -funroll-loops -Wno-unused-result
gcc word-analogy.c -o word-analogy -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
gcc compute-accuracy.c -o compute-accuracy -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
chmod +x *.sh

关于CentOS下安装word2vec的linux教程介绍到此,应该没有问题,亲自尝试后,通过。


阅读更多
博主设置当前文章不允许评论。

没有更多推荐了,返回首页