https://github.com/sordonia/hed-qs
Raw File
Tip revision: b217a94387eca1e37975d7dd770d94981ec7eac1 authored by Alessandro Sordoni on 07 April 2017, 14:07:22 UTC
Update README.md
Tip revision: b217a94
README.md
# Hierarchical Recurrent Encoder-Decoder code (HRED) for Query Suggestion.

This code accompanies the paper:

"A Hierarchical Recurrent Encoder-Decoder For Generative Context-Aware Query Suggestion", by Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob G. Simonsen, Jian-Yun Nie, to appear in CIKM'15.

The pre-print of the paper is available at: http://arxiv.org/abs/1507.02221.

-- Data processing

The dataset must consist in two files:

data.ses: each line is a sequence of tab-separated strings (queries). Each line represents a query session.
data.rnk: each line is a sequence of tab-separated integers (not currently used in the model, can be set to a tab-separated list of 0).

Basically, the .rnk file is not used by the model but it contains the rank of the clicked documents for each of the queries.

./convert-text2dict.py data

This will create the preprocessed dataset for training.

-- Training

Create a prototype by modifying state.py and launch:

python train.py --prototype your_prototype
back to top