Gemini with File

Overview

Gemini with File allows you to use Google’s Gemini AI to analyze and generate text based on various file types including PDFs, images, audio, and video files. This multimodal AI capability enables comprehensive document analysis and content generation.

Key Features

GEMINI_INVOKE_WITH_FILE
- Analyze PDF documents with AI
- Process images and extract information
- Handle audio and video files
- Generate structured responses based on file content
- Support for multiple file formats

Authentication

If you do not have an API key, you can still run Gemini tools by using Jinba API credit. If you’d like to use your own API key, you can obtain one from the official Gemini website. Note: Treat API keys as sensitive information and never commit them to public repositories.

Example: Document Analysis

- id: analyze_document
  name: analyze_document
  tool: GEMINI_INVOKE_WITH_FILE
  config:
    - name: version
      value: gemini-1.5-flash
    - name: token
      value: "{{secrets.GEMINI_API_KEY}}"
  input:
    - name: prompt
      value: |
        Please analyze this document and extract the following information:
        1. Main topics and themes
        2. Key data points and statistics
        3. Any actionable insights
        4. Summary of conclusions
    - name: file_url
      value: "{{steps.input_file.result.file_url}}"

Use Cases

Document Review: Analyze contracts, reports, and proposals
Content Extraction: Extract key information from research papers
Data Analysis: Process charts and graphs in images
Media Processing: Analyze video and audio content
Compliance Checking: Review documents for specific criteria

Docs

​Overview

​Key Features

​Authentication

​Example: Document Analysis

​Use Cases

Overview

Key Features

Authentication

Example: Document Analysis

Use Cases