Content - 5144809939e4b99b5ba67deb53c49804a4810610 - b86886c/evaluation/README

visit type:

Tip revision: 519bb1677a7c747b3863d37069ffd395062288d8 authored by Ryan Cotterell on 27 June 2017, 04:32:31 UTC
added answers

Tip revision: 519bb16

README

The official evaluation script lives in this directory. We have provided sample output from the baseline model on the development data in sample-output/. You may run the baseline as shown in the examples below.

Task 1 Evaluation:

python evalm.py --guess sample-output/task1/persian-medium-out --gold ../all/task1/persian-dev --task 1

acccuracy:	66.20
levenshtein:	1.03

Task 2 Evaluation:
python evalm.py --guess sample-output/task2/albanian-high-out --gold ../all/task2/albanian-uncovered-dev --gold_input ../all/task2/albanian-covered-dev --task 2

accuracy:	94.23
levenshtein:	0.12
paradigm:	76.00

Browse the archive

https://github.com/sigmorphon/conll2017