Can you explain the algorithm the prefix_recognize used? #15

qzfnihao · 2021-01-12T08:32:02Z

In streamin_transformer.py, prefix_recognize looks like frame-synchronize decoding algorithm, and merges chunk decoding and trigger decoding。I try to search papers about chunk transformer and trigger attention, but not found! Can you show me the paper that introduced the algorithm?
I also have some questions about ctc prefix search in the code.
In line 662-664:
if l_plus not in hype:
Pb[l_plus] += lpz[i][0] + ...
Pb[l_plus] += lpz[i][c] * Pnb_prev[l_plus]

I doubt the line 664 should be:
Pnb[l_plus] += lpz[i][c] * Pnb_prev[l_plus]
This satisfies Algorithm 1 in paper: First-pass large vocabulary continuous speech recognition using bi-directional recurrent Dnns

cywang97 · 2021-01-15T08:15:50Z

Yes, you are right. I've fixed the bug in my repo now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you explain the algorithm the prefix_recognize used? #15

Can you explain the algorithm the prefix_recognize used? #15

qzfnihao commented Jan 12, 2021

cywang97 commented Jan 15, 2021

Can you explain the algorithm the prefix_recognize used? #15

Can you explain the algorithm the prefix_recognize used? #15

Comments

qzfnihao commented Jan 12, 2021

cywang97 commented Jan 15, 2021