-
Notifications
You must be signed in to change notification settings - Fork 4
GenerationModel
This page is under development. Its objective is to show how a trigram model for generation ranking can be built. The first part is partly overlapping with DelphinTools and RedwoodsTop, and the second part describes a procedure made by ErikVelldal.
The procedure requires three tools:
-
The CMU Toolkit ([http://mi.eng.cam.ac.uk/~prc14/toolkit.html]),
-
SMT QuickRun ([http://ufal.ms.mff.cuni.cz/~curin/SMT_QuickRun/]), and
The installation of the CMU Toolkit may conflict with Logon, so you may need to comment out the Logon settings in the .bashrc temporarily file while the installation is going on.
TSDBHOME=$LOGONROOT/lingo/lkb/src/tsdb/home
export PATH=$LOGONROOT/lingo/lkb/src/tsdb/home:$PATH
This step is needed if you do not have a profile. It shows how you can get a profile from the object sentences of a bilingual corpus file. (See also DelphinTools.)
logon_do --bitext PATH/TO/BILINGUAL/CORPUS/FILE $TSDBHOME/bitxt
logon_do --count 2 --limit 5 --grammar jaen --task omrs $TSDBHOME/bitxt
$TSDBHOME/export --binary --condition "result-id=0" --format triples bitxt/omrs
Home | Forum | Discussions | Events