This repository provides a comprehensive guide on leveraging the Gemini Flash API for handling multimodal queries. You'll learn how to upload and process multiple file types such as audio, video, images, and text, and use these files within a single query using Google's Generative AI.
- Upload various file types to the Gemini API
- Wait for files to be processed and ready
- Start a chat session with the generative model using uploaded files
- Compare different media types in a single query
-
Clone the repository:
git clone https://github.com/developersdigest/gemini-flash-api.git
-
Install dependencies:
npm install
or if you are using Bun:
bun install
-
Set up your API key: Ensure your
GEMINI_API_KEY
is set in your environment variables. -
Run the script:
node index.js
This project is licensed under the MIT License. See the LICENSE file for details.