Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 879 Bytes

README.md

File metadata and controls

21 lines (16 loc) · 879 Bytes

Neural Turing Machines

Attempt at implementing system described in "Neural Turing Machines." by Alex Graves, Greg Wayne, and Ivo Danihelka.

This is still very much a work in progress:

  • Only the copy task has been implemented.
  • Cost does not consistently converge.
  • The structure of the controller is merely speculation on my part, the paper does not provide the details.

I've documented some of my adventures on my blog, you can check out the articles on this particular project here.

There's been some discussion going on in the Reddit thread on /r/machinelearning. You can check that out here.

Usage

python train_copy.py [output_model_file]