https://github.com/vsiivola/variKN
History
Tip revision: c525568ed62fddfb0351946efe049c4e9ead9ddf authored by Vesa Siivola on 16 January 2014, 08:27:59 UTC
Implement leave-one-out estimates for the discounts. If optimization corpus is not set, use these estimates. Also, initialize numerical search for the parameters with these values. In the latter case, preliminary tests seem to indicate that better accuracy is reached than with the original heuristic search start point.
Tip revision: c525568
File Mode Size
CMakeLists.txt -rw-r--r-- 232 bytes
add_zeroprob_grams.cc -rw-r--r-- 1.0 KB
arpa2arpa.cc -rw-r--r-- 876 bytes
arpa2bin.cc -rw-r--r-- 680 bytes
arpasize.cc -rw-r--r-- 3.3 KB
bin2arpa.cc -rw-r--r-- 637 bytes
check_model.cc -rw-r--r-- 3.0 KB
counts2kn.cc -rw-r--r-- 8.3 KB
perplexity.cc -rw-r--r-- 5.3 KB
varigram_kn.cc -rw-r--r-- 5.6 KB

back to top