Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LSTM #7

Open
txir-met opened this issue Mar 9, 2022 · 0 comments
Open

LSTM #7

txir-met opened this issue Mar 9, 2022 · 0 comments
Labels
documentation Improvements or additions to documentation

Comments

@txir-met
Copy link
Collaborator

txir-met commented Mar 9, 2022

The LSTM code is based on this tutorial and this github repo.

An attempt to bring the tutorial libraries up to date was not fruitful, and caused the network to train more quickly and significantly less well, with large white sections. The exact source of the issue was not identified, so the old library versions were used.

The network struggles with image sharpness, giving blurry predictions for both the MNIST dataset in the tutorial, and our own Cyclone data. In our own data, the output is usually a somewhat blurrier copy of the most recent input.

The performance does not drop significantly loss-wise when moving from one-hourly to three-hourly steps. The loss fell below 0.005, so it may be worth attempting to introduce a different/additional loss heuristic in the future.

Overall, this approach was not the most promising so was not investigated in much greater depth, and we stuck with 64x64 pixels.

@txir-met txir-met added the documentation Improvements or additions to documentation label Mar 9, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant