-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support 100 translation languages with m2m-100 #5
Comments
Is this something currently being worked on? If not, I would love to contribute. |
In the long-term, I am looking into better translation support by LLMs like the tower of unbabel. Though it takes additional steps till we have general models with this enhancement. |
@Rohith04MVK This is not actively being worked on, but if folks want this I'm happy for it to be added. I haven't thought about this deeply, but I would imagine this could be implemented as something like: def translate(text, src_lang, dst_lang):
"""Translate `text` from `src_lang` to `dst_lang`"""
... It should be a lot like the code function. |
I'd love to help! While I think M2M-100 418M model with CTranslate2 (>512 MB) has potential, are there any other models or approaches we should consider before moving forward? |
My approach has been to try to define the simplest possible interface without worrying too much about specific models. New and improved models are created regularly, and one of my goals for this project is to provide easy access to the current state-of-the-art model for its size without users of the package needing to keep track of the latest and greatest models. There's a priority list of available models that is used to determine which model to use. The package searches through this list in order until a model is found the matches the current inference requirement (max RAM, license, tuning, etc). I would hope that we would be able to do the same for m2m100 looks like a reasonable place to start from my point of view. I just uploaded the ct2 int8 quantized models. |
NLLB models are also supported by cTranslate. They support up to 200 languages but are a magnitude bigger. |
Was the |
That's an oversight on my part. I have a notebook that I use to quickly convert these models. I didn't see that this file needed to be added to the files copied by ct2-transformers-converter. I've added those files now. |
We could support more translation directions with m2m-100 in cTranslate or use easy translate.
The text was updated successfully, but these errors were encountered: