https://github.com/wilkeraziz/mosesdecoder
Raw File
Tip revision: b82ef874f4c58d2cd562672a13aa894fdb9a29a6 authored by Ales Tamchyna on 06 December 2012, 11:41:34 UTC
Merge branch 'jointlm_improvements' of github.com:moses-smt/mosesdecoder into factored_hacks
Tip revision: b82ef87
BUILD-INSTRUCTIONS.txt
PRELIMINARIES

Moses is primarily targeted at gcc on UNIX.  

Moses requires gcc, Boost >= 1.36, and zlib including the headers that some
distributions package separately (i.e. -dev or -devel packages).  Source is
available at http://boost.org .

There are several optional dependencies:

GIZA++ from http://code.google.com/p/giza-pp/ is used to align words in the parallel corpus during training.

Moses server requires xmlrpc-c with abyss-server.  Source is available from
http://xmlrpc-c.sourceforge.net/.  

The scripts support building ARPA format language models with SRILM or IRSTLM.
To apply models inside the decoder, you can use SRILM, IRSTLM, or KenLM.  The
ARPA format is exchangable so that e.g. you can build a model with SRILM and
run the decoder with IRSTLM or KenLM.

If you want to use SRILM, you will need to download its source and build it.  
The SRILM can be downloaded from 
http://www.speech.sri.com/projects/srilm/download.html .
On x86_64, the default machine type is broken.  Edit sbin/machine-type, find
this code
    else if (`uname -m` == x86_64) then
        set MACHINE_TYPE = i686
and change it to
    else if (`uname -m` == x86_64) then
        set MACHINE_TYPE = i686-m64
You may have to chmod +w sbin/machine-type first.  

If you want to use IRSTLM, you will need to download its source and build it.  
The IRSTLM can be downloaded from either the SourceForge website
http://sourceforge.net/projects/irstlm
or the official IRSTLM website
http://hlt.fbk.eu/en/irstlm

KenLM is included with Moses.  

--------------------------------------------------------------------------

ADVICE ON INSTALLING EXTERNAL LIBRARIES

Generally, for trouble installing external libraries, you should get support
directly from the library maker:

Boost: http://www.boost.org/doc/libs/1_48_0/more/getting_started/unix-variants.html
IRSTLM: https://list.fbk.eu/sympa/subscribe/user-irstlm
SRILM: http://www.speech.sri.com/projects/srilm/#srilm-user

However, here's some general advice on installing software (for bash users):

#Determine where you want to install packages
PREFIX=$HOME/usr
#If your system has lib64 directories, lib64 should be used AND NOT lib  
if [ -d /lib64 ]; then
  LIBDIR=$PREFIX/lib64
else
  LIBDIR=$PREFIX/lib
fi
#If you're installing to a non-standard path, tell programs where to find things:
export PATH=$PREFIX/bin${PATH:+:$PATH}
export LD_LIBRARY_PATH=$LIBDIR${LD_LIBRARY_PATH:+:$LD_LIBRARY_PATH}
export LIBRARY_PATH=$LIBDIR${LIBRARY_PATH:+:$LIBRARY_PATH}
export CPATH=$PREFIX/include${CPATH:+:$CPATH}

Add all the above code to your .bashrc or .bash_login as appropriate.  Then
you're ready to install packages in non-standard paths:

#For autotools packages e.g. xmlrpc-c
./configure --prefix=$PREFIX --libdir=$LIBDIR [other options here]

#For tcmalloc (we only need the minimal version)
./configure --prefix=$PREFIX --libdir=$LIBDIR --enable-shared --enable-static --enable-minimal

#For Boost:
./bootstrap.sh
./b2 --prefix=$PREFIX --libdir=$LIBDIR --layout=tagged link=static,shared threading=multi,single install

--------------------------------------------------------------------------

BUILDING

Building consists of running
  ./bjam [options]

Common options are:
--with-srilm=/path/to/srilm to compile the decoder with SRILM support
--with-irstlm=/path/to/irstlm to compile the decoder with IRSTLM support
-jN where N is the number of CPUs

--with-macports=/path/to/macports use MacPorts on Mac OS X.

If you leave out /path/to/macports bjam will use the /opt/local as default.
You don't have to use --with-boost with-macports as it is implicitly set.
Also note that using --with-macports automatically triggers "using darwin".

Binaries will appear in dist/bin.  

You can clean up data from previous builds using
  ./bjam --clean

For further documentation, run 
  ./bjam --help

--------------------------------------------------------------------------

ALTERNATIVE WAYS TO BUILD ON UNIX AND OTHER PLATFORMS

Microsoft Windows
-----------------
Moses is primarily targeted at gcc on UNIX.  Windows users should consult
http://ssli.ee.washington.edu/people/amittai/Moses-on-Win7.pdf .  

Binaries for all external libraries needed can be downloaded from 
	http://www.statmt.org/moses/?n=Moses.LibrariesUsed

Only the decoder is developed and tested under Windows. There are difficulties
using the training scripts under Windows, even with Cygwin.
back to top