Revision - ac48403 - Better default hyper-parameters for vae_vanilla example - origin: https://github.com/casperkaae/parmesan

visit type:

https://github.com/casperkaae/parmesan

11 April 2020, 13:53:59 UTC

Revision ac48403bed22081758cc905c57145217739942c5 authored by wuaalb on 16 October 2015, 11:07:55 UTC, committed by wuaalb on 16 October 2015, 11:07:55 UTC

Better default hyper-parameters for vae_vanilla example

With previous parameters the test ELBO would start going up after
approx. 60 epochs.
Decreased learning rate and number of hidden units in deterministic
layers of encoder/decoder.

Set `analytic_kl_term=True` by default as it seems to improve results
and is what the Kingma et al. paper does in its examples.

Changed non-linearity `softplus` for deterministic hidden layers. Also
tried `tanh` and `very_leaky_rectify`, but this seemed to perform best.

Results with these settings
```
*Epoch: 999 Time: 9.03 LR: 0.00030 LL Train: -90.331 LL test: -93.592
```

1 parent a2c1086

Files
Changes

Permalinks

Tip revision: ac48403bed22081758cc905c57145217739942c5 authored by wuaalb on 16 October 2015, 11:07:55 UTC
Better default hyper-parameters for vae_vanilla example

Tip revision: ac48403

File	Mode	Size
examples
misc
parmesan
.gitignore	-rw-r--r--	129 bytes
LICENSE	-rwxr-xr-x	1.4 KB
MANIFEST.in	-rwxr-xr-x	139 bytes
README.rst	-rwxr-xr-x	3.2 KB
requirements-dev.txt	-rwxr-xr-x	121 bytes
requirements.txt	-rwxr-xr-x	114 bytes
setup.py	-rwxr-xr-x	1.5 KB

Showing with 0 additions and 0 deletions (0 / 0 diffs computed)

Computing file changes ...

https://github.com/casperkaae/parmesan

Better default hyper-parameters for vae_vanilla example

README.rst