Feed-Forward Attention

PyTorch implementation of the Feed-Forward Attention Mechanism.

This is based on work by Colin Raffel and Daniel P. Ellis, Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems (https://arxiv.org/abs/1512.08756)

Usage

Subclassing FFAttention

The main class has been implemented as the FFAttention class. You can subclass this to use in your own problems. The learning is done sequentially, with five methods forming the forward pass of the algorithm. You will need to implement some of them.

1 embedding (not implemented) : computes embeddings $$h_t$$ for element of sequence $$x_t$$.

In : torch.Size([batch_size, sequence_length, sequence_dim])
Out: torch.Size([batch_size, sequence_length, hidden_dimensions])

2 activation (not implemented) : computes the embedding activations $$e_t$$.

In : torch.Size([batch_size, sequence_length, hidden_dimensions])
Out: torch.Size([batch_size, sequence_length, 1])

3 attention (Already implemented) : computes the probabilities $$\alpha_t$$.

In : torch.Size([batch_size, sequence_length, 1])
Out: torch.Size([batch_size, sequence_length, 1])

4 context (Already implemented) : computes the context vector $c$.

In : torch.Size([batch_size, sequence_length, 1]), torch.Size([batch_size, sequence_length, sequence_dim])
Out: torch.Size([batch_size, 1, hidden_dimensions])

5 out (not implemented) : Feed-forward prediction.

In : torch.Size([batch_size, 1, hidden_dimensions])
Out: torch.Size([batch_size, 1, 1])

Note that you can reimplement or extend any of the other methods, for logging purposes for example.

Data

Typical pipelines include data processing steps. In the examples provided, several steps are separated. Loading and saving is done manually. Feel free to dump to pickle files pre-processed data and reload it.

Feed this data to the algorithm is done by implementing a custom PyTorch Dataset class. See the examples to see how to use this.

Data visualisation

Several plotting utilities are available in utils.py:

`plot_loss`

`plot_predictions`

`plot_error`

`plot_attention`

`plot_confusion`

`plot_context`

Attention log

A utility to track the algorithm's progress in provided in the form of a logger object. It is easy to extend this object with new fields. A standard architecture is provided here. Note that the plotting utilities do depend on the presence of some fields for some plots so it is recommended to extend the object not re-implement it.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
examples		examples
res/img		res/img
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
ff_attention.py		ff_attention.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feed-Forward Attention

Usage

Subclassing FFAttention

Data

Data visualisation

`plot_loss`

`plot_predictions`

`plot_error`

`plot_attention`

`plot_confusion`

`plot_context`

Attention log

About

Releases

Packages

Languages

License

dtsbourg/ff-attention

Folders and files

Latest commit

History

Repository files navigation

Feed-Forward Attention

Usage

Subclassing FFAttention

Data

Data visualisation

plot_loss

plot_predictions

plot_error

plot_attention

plot_confusion

plot_context

Attention log

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`plot_loss`

`plot_predictions`

`plot_error`

`plot_attention`

`plot_confusion`

`plot_context`

Packages