Overview

Gemini with File allows you to use Google’s Gemini AI to analyze and generate text based on various file types including PDFs, images, audio, and video files. This multimodal AI capability enables comprehensive document analysis and content generation.

Key Features

  • GEMINI_INVOKE_WITH_FILE
    • Analyze PDF documents with AI
    • Process images and extract information
    • Handle audio and video files
    • Generate structured responses based on file content
    • Support for multiple file formats

Authentication

If you do not have an API key, you can still run Gemini tools by using Jinba API credit. If you’d like to use your own API key, you can obtain one from the official Gemini website. Note: Treat API keys as sensitive information and never commit them to public repositories.

Example: Document Analysis

- id: analyze_document
  name: analyze_document
  tool: GEMINI_INVOKE_WITH_FILE
  config:
    - name: version
      value: gemini-1.5-flash
    - name: token
      value: "{{secrets.GEMINI_API_KEY}}"
  input:
    - name: prompt
      value: |
        Please analyze this document and extract the following information:
        1. Main topics and themes
        2. Key data points and statistics
        3. Any actionable insights
        4. Summary of conclusions
    - name: file_url
      value: "{{steps.input_file.result.file_url}}"

Use Cases

  • Document Review: Analyze contracts, reports, and proposals
  • Content Extraction: Extract key information from research papers
  • Data Analysis: Process charts and graphs in images
  • Media Processing: Analyze video and audio content
  • Compliance Checking: Review documents for specific criteria