- text tokenization, including deep semantic features like parse trees
- inverted and forward indexes with compression and various caching strategies
- a collection of ranking functions for searching the indexes
- topic models
- classification algorithms
- graph algorithms
- language models
- CRF implementation (POS-tagging, shallow parsing)
- wrappers for liblinear and libsvm (including libsvm dataset parsers)
- UTF8 support for analysis on various languages
- multithreaded algorithms
项目主页
Ubuntu 14.04 LTS Build Guide
Ubuntu 14.04 has a recent enough GCC for building MeTA, but we’ll need to add a ppa for a more recent version of CMake.
Start by running the following commands to install the dependencies for MeTA.
# this might take a while
sudo apt-get update
sudo apt-get install software-properties-common
# add the ppa for cmake
sudo add-apt-repository ppa:george-edison55/cmake-3.x
sudo apt-get update
# install dependencies
sudo apt-get install cmake libicu-dev git
Once the dependencies are all installed, you should double check your versions by running the following commands.
g++ --version
should output
g++ (Ubuntu 4.8.2-19ubuntu1) 4.8.2
Copyright (C) 2013 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
and
cmake --version
should output
cmake version 3.1.1
CMake suite maintained and supported by Kitware (kitware.com/cmake).
Once the dependencies are all installed, you should be ready to build. Run the following commands to get started:
# clone the project
git clone https://github.com/meta-toolkit/meta.git
cd meta/
# set up submodules
git submodule update --init --recursive
# set up a build directory
mkdir build
cd build
cp ../config.toml .
# configure and build the project
cmake ../ -DCMAKE_BUILD_TYPE=Release
make
You can now test the system by running the following command:
ctest --output-on-failure
If everything passes, congratulations! MeTA seems to be working on your system.