Skip to content

1.3.3 - Bug Fix

Compare
Choose a tag to compare
@Dadangdut33 Dadangdut33 released this 11 Nov 09:43
· 74 commits to master since this release

Fix bug with the logger, language code, and loading whisper model.

What's Changed

Full Changelog: 1.3.2...1.3.3

Notes

  • Before downloading / installing please take a look at the wiki and read the getting started section.
  • If you previously installed speech translate as a module, you can update by doing pip install -U git+https://github.com/Dadangdut33/Speech-Translate.git --upgrade --force-reinstall
  • If you install from installer, you can download and launch the installer below to update
  • If you have any suggestions or found any bugs please feel free to open a disccussion or open an issue

Requirements

  • Compatible OS:
OS Prebuilt binary As a module
Windows ✔️ ✔️
MacOS ✔️
Linux ✔️

* Python 3.8 or later (3.11 is recommended) for installation as module.

  • Speaker input only work on windows 8 and above.
  • Internet connection (for translation with API)
  • FFmpeg is required to be installed and added to the PATH environment variable. You can do it when prompted in the app, or you can download it here and add it to your path manually. Alternatively, you can also download and add it to path automatically by using the following commands:
# on Windows using powershell (Also included in the release page, and can be run by right clicking and selecting "Run with PowerShell")
# Must be run in an elevated PowerShell prompt (Run as administrator)
Set-ExecutionPolicy RemoteSigned -Scope CurrentUser # Optional: Needed to run a remote script the first time
& ([scriptblock]::Create(
     (New-Object System.Net.WebClient).DownloadString('https://raw.githubusercontent.com/Dadangdut33/Speech-Translate/master/install_ffmpeg.ps1')
  )) -webdl

# on Windows using Winget (Default package manager for Windows 10 and above)
winget install --id=Gyan.FFmpeg  -e

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg
  • Recommended to have capable GPU with CUDA compatibility (prebuilt version is using CUDA 11.8) to run each model. Each whisper model has different requirements, for more information you can check it directly at the whisper repository.
Size Parameters English-only model Multilingual model Required VRAM Relative speed
tiny 39 M tiny.en tiny ~1 GB ~32x
base 74 M base.en base ~1 GB ~16x
small 244 M small.en small ~2 GB ~6x
medium 769 M medium.en medium ~5 GB ~2x
large 1550 M N/A large ~10 GB 1x

* This information is also available in the app (hover over the model selection in the app and there will be a tooltip about the model info). Also note that when using faster-whisper, the speed will be significantly faster and the required vram size will be reduced depending on the usage, for more information about this please visit faster-whisper repository