Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.jinba.io/llms.txt

Use this file to discover all available pages before exploring further.

Overview

The File Extraction tools allow you to extract information from files using their base64 representation. This is particularly useful for analyzing images or documents where you need to extract text or other relevant data.

Key Features

  • PDF_EXTRACT_TEXT: Extract text from PDF files.
  • DOCX_EXTRACT_TEXT: Extract text from DOCX files.
  • EXCEL_EXTRACT_TEXT: Extract structured data from Excel files.

Example: Extract Data from Base64 Files

- id: extract_pdf
  tool: PDF_EXTRACT_TEXT
  config:
    []
  input:
    - name: pdf_base64
      value: "{{secrets.PDF_FILE_BASE64}}"
- id: extract_docx
  tool: DOCX_EXTRACT_TEXT
  config:
    []
  input:
    - name: docx_base64
      value: "{{secrets.DOCX_FILE_BASE64}}"
- id: extract_pdf
  tool: PDF_EXTRACT_TEXT
  config:
    []
  input:
    - name: pdf_base64
      value: "{{secrets.PDF_FILE_BASE64}}"