-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crash in subtitle generation - IndexError: list index out of range #282
Comments
That is when used with "--highlight_words"? Share whole command used. |
No, the only command args I use are --model, --language and file name. And yes, it is consistently reproducible on the file I use.
In fact, I see that some characters in the console output look like question marks (copied here as 很 or ル), which obviously do not occur in the audio and cannot occur in the selected language. Perhaps they break something during output into file? |
Can you share the json file produced with |
This time it crashed AFTER producing the output and "Operation finished in:" ... line. Apparently this last crash is a case of SYSTRAN/faster-whisper#71 or something similar, but seems to be unrelated to this issue. |
Can you share the message of this new crash? |
There is no message in the console, it's just a standard Windows popup saying that "program has stopped working". For each of these new crashes Windows Event Viewer contains pairs of error messages like these:
Note that, by the time it happens everything is already done and the program is exiting, and at no point it maxes out on memory. For this reason this new crash is not so bad, just inconvenient. |
Can reproduce it with
There is "beep" sound code after "Operation finished in:" ... line. |
By default this second crash comes after the beep. With |
I encountered a similar error message on Ubuntu 22.04 using Faster-Whisper-XXL_r192.3.1_linux. mis@ai-ai:~/下載/Faster-Whisper-XXL_r192.3.1_linux/Whisper-Faster-XXL$ sudo ./whisper-faster-xxl "2024-08-01 09-32-20.mkv" --language Chinese --initial_prompt "這是一段主要是繁體中文(台灣)的影片:" --model large-v2 Standalone Faster-Whisper-XXL r192.3.1 running on: CUDA Starting work on: 2024-08-01 09-32-20.mkv [00:00.520 --> 00:02.800] 但是其實呢 Transcription speed: 36.67 audio seconds/s Traceback (most recent call last): Additional information |
I've been randomly getting these too. I think one was reproduceable, but a power failure made me lose track of it. I'll keep my eye out Pasted post from an another thread:I'm wondering why I get these errors when I run Particularly since I don't have a ``d:\whisper-fast_XXL``` folder They happen... for certain songs (1 out of 10-15), but not for others. I can't say the exact cause, that i also can't fathom why it would be referencing a folder that doesn't exist on my D: drive ...
|
@ClaireCJS Those are internal paths inside exe, not on your PC. |
I know. It's just weird. I don't even have whisper on my D: ... I understand it's not real, it's just... weird. It's failing and knowing why would be nice? Sorry 😅 |
Fixed in v193.1 |
Unfortunately, it is still reproducible in v193.1, albeit with a slightly different stacktrace, but the error appears to be the same.
This is on attempt to use |
Can you share json file? |
I was actually hoping to do that by asking for all formats, to save time on transcription, but apparently the "bad" one comes earlier in the queue. In what sequence are they processed with |
I think json is the last, I'll put it as first in the next release. |
It's not, because it's not the same bug. |
Indeed, this may be related to the length of produced chunks. The model I am using does not split the text into sentences properly for some reason, therefore I am using Do you want me to create a separate issue for this? |
Share your command.
Nah. |
The one to reproduce with the attached JSON file is The one where I encountered it originally in this release is |
Program (r192.3.4) crashes at the end of execution, but before generating a subtitle file on some videos with tiny model, but usually exits correctly with other models on the same video (it may not be directly related to the model used, just the fact that its output has or doesn't have some offending attribute).
I think this is different to the crashes that may happen at the end of processing, also reported in the original faster-whisper.
The text was updated successfully, but these errors were encountered: