----------------------------------------------------------------------
The Records of Running Moses
University of Macau
2010-1-27
----------------------------------------------------------------------
这是统计机器翻译系统Moses运行的记录过程,首先第一部分给出运行所需要的准备文件,第二部分给出安装的过程和步骤,第三部分给出构建语言模型和翻译过程,第四部分给出翻译的结果和评测。首先这是第一部分的内容:Moses运行前的准备:
The purpose of this guide is to offer a step-by step example of downloading, compiling, and running the Moses decoder and related support tools. What I want to do is to record the steps in order to use some day. Here, I make no claims that all of the steps here will work perfectly on every machine you try it on, or that things will stay the same as the software changes. Please remember that Moses is research software under active development.
In this record, there are six parts to introduce the process .The structure of the records is like below:
1. Structure of this passage
2. Working Environment and dealing with corpus
PART I - Download Tools and Data
In this part, I download some tools that will be needed, including translation tools GIZA++ and MKCLS,
language model SRILM or IRSTLM; deal with language sentences tools SCRIPTS, evaluation tools NIST and
BLEU. I also give the URL where to download them.
PART II – Support Tools Installation
In this part, I introduce how to install the tools. Honestly speaking, the installation process is complex, for it
contains a lot of folders that even a test folder is included. At first run this program, you may be confuse about it!
The most complex is that it needs many additional tools to support the installation. So be careful to install all the
extra tools first.
PART III – Build language model
In this part, I build a language model using C