Implementation of Incremental Decoding #1582

barry-jin · 2021-11-30T18:56:18Z

Description

The implementation of incremental decoding in gluon-nlp is somewhat different from fairseq. In fairseq, the keys/values both before and after linear projection are memorialized, but in gluon-nlp, only the keys/values before the linear projection is memorialized. This difference leads to different execution number of FC operators (In fairseq, keys/values are directly pulled from prev_keys/prev_values; In gluon-nlp, two more linear projections are needed to get the projectioned keys/values). We may need to correct the gluon-nlp's implementation of incremental decoding.

barry-jin added the bug Something isn't working label Nov 30, 2021

barry-jin linked a pull request Mar 15, 2022 that will close this issue

[Decoding] Update incremental decoding implementation #1583

Draft

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of Incremental Decoding #1582

Implementation of Incremental Decoding #1582

barry-jin commented Nov 30, 2021

Implementation of Incremental Decoding #1582

Implementation of Incremental Decoding #1582

Comments

barry-jin commented Nov 30, 2021

Description