You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
The key has expired.
New features
Add sentence weighting: the data parameter example_weights accepts a file with one weight per line that are used to scale the loss of the corresponding training example
Summarize out of vocabulary tokens at the end of the training logs:
ratio of OOV tokens in the training data
10 most frequent OOV tokens
[API] Add argument fallback_to_cpu to Runner.train to declare whether CPU training is allowed or not (defaults to True for backward compatibility)
Fixes and improvements
Fix error when computing BLEU score with SacreBLEU
Fix vocabulary generation when using SentencePiece with a pre-tokenization
Remove verbose checkpoint warnings when an exception occurs before checkpoint weights are actually loaded
Enable the pyonmttok and ctranslate2 dependencies on macOS