Vanilla JS web interface for Gemini 2.0 Flash Experimental Multimodal API with text, audio, camera, screen inputs and audio response and function calling from the LLM
-
Updated
Jan 9, 2025 - JavaScript
Vanilla JS web interface for Gemini 2.0 Flash Experimental Multimodal API with text, audio, camera, screen inputs and audio response and function calling from the LLM
A Python-based tool that generates engaging podcast conversations using Google's Gemini 2.0 Flash Experimental model for script generation and text-to-speech conversion.
Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, images, videos, audio, text and more. Enjoy real-time responses, customizable parameters, and easy integration for diverse AI tasks.
AI-Powered Podcast Generator: A Python-based tool that converts text scripts into realistic audio podcasts using Google's Generative AI API. This project leverages advanced text-to-speech technology to create dynamic, multi-speaker conversations with customizable voices.
Agentic framework for dynamic function calling across latest LLMs (gpt-4o, gemini-2.0-flash, groq modes, and anthropic models). Converts Python functions into provider-specific schemas for autonomous tool use. Features unified API, JSON schema generation, and integrated tool execution handling.
A chatbot using Streamlit and Google Gemini 2 flash with chat sessions
A Python-based agent showcasing the use of Google's Gemini 2.0 Flash Thinking Experimental model to reason by combining search results with user queries
The Phidata Video AI Summarizer Agent revolutionizes video content analysis by offering fast, AI-driven summarization and insightful query-based information retrieval. Built with the advanced capabilities of Google’s Gemini 2.0 Flash Exp and integrated with Phidata,
BookGen: A Python application using Google's Gemini 2.0 Flash to automatically generate complete books (PDF and TXT) from a given title and outline specifications.
Implementation of Anthropic agentic workflow's using Google Gemini 2.0 Flash Experimental
Simple CLI demo of the new free Gemini 2.0 Flash Expiremental
AtmosPhase isn't just another weather app 🌤️ - it's your personal weather command center, powered by cutting-edge technology and AI.
This project demonstrates object detection in images using Google's Gemini 2.0 spatial understanding. The code identifies objects within an image, draws bounding boxes around them, and labels them with their detected names. The example is made for detecting a weld quality inspection with Gemini 2.0 spatial understanding.
Add a description, image, and links to the gemini-2-0-flash-exp topic page so that developers can more easily learn about it.
To associate your repository with the gemini-2-0-flash-exp topic, visit your repo's landing page and select "manage topics."