You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Once we'll make the app able to generate images (#24) - so it can receive an image not just text, and it can display the result, - then we should be able to edit image as well. Maybe this feature won't even need any extra coding?
The text was updated successfully, but these errors were encountered:
I've seen other assistant project which also used STT and TTS like me.
We can consider Imagen3 for image related generations or edits, but that would be a separate interaction mode, since the prompt would need to be passed to Imagen3 and not Gemini.
Similarly, for music or audio generation we'd need a dedicated interaction, maybe it could be an extension of the Shazam mode? #38
Once we'll make the app able to generate images (#24) - so it can receive an image not just text, and it can display the result, - then we should be able to edit image as well. Maybe this feature won't even need any extra coding?
The text was updated successfully, but these errors were encountered: