Overview
Jinba Vector Search enables semantic search capabilities for knowledge bases using vector embeddings. This tool uses OpenAI’s text-embedding-3-large to vectorize queries and retrieve relevant data from your knowledge base based on similarity scores, providing powerful RAG (Retrieval-Augmented Generation) functionality.Key Features
- Semantic Search: Find conceptually similar content, not just exact matches
- Knowledge Base Integration: Search across uploaded documents and files
- Similarity Scoring: Filter results by similarity threshold
- Configurable Results: Control the number of results returned
- RAG Support: Perfect for building question-answering and information retrieval workflows
Authentication
This tool requires a Jinba API Token to access knowledge bases. Required Configuration:- token: Your Jinba API Token (stored as a secret)
Input Parameters
- query (required): Search query for semantic search
- knowledgeBaseId (required): ID of the knowledge base to search
- topK (optional): Number of top results to return (1-50, default: 3)
- threshold (optional): Similarity threshold for filtering results (0-1, default: 0.3)
Output Structure
Returns an array of search results with:- chunk: Content chunk with ID, file ID, content, and metadata
- score: Similarity score (0-1, higher is more similar)
- file: File information including filename and content type
- query: The original search query
- totalResults: Total number of results found
Example: Document Q&A System
Example: Research Assistant
Example: Content Recommendation
Best Practices
Query Optimization
- Use natural language: Write queries as you would ask a human
- Be specific: More specific queries often yield better results
- Include context: Add relevant keywords and context terms
Threshold Selection
- 0.7-1.0: Very high similarity, exact or near-exact matches
- 0.4-0.7: High similarity, closely related content
- 0.2-0.4: Moderate similarity, potentially relevant content
- 0.0-0.2: Low similarity, may include tangentially related content
Performance Tips
- Limit topK: Don’t retrieve more results than needed
- Adjust threshold: Higher thresholds = fewer, more relevant results
- Use metadata: Leverage chunk metadata for additional filtering
Knowledge Base Setup
Before using vector search, ensure your knowledge base contains relevant documents:- Upload Documents: Add PDFs, text files, or other supported formats
- Processing: Allow time for document chunking and vectorization
- Test Queries: Start with simple queries to understand your data
- Iterate: Refine queries and thresholds based on results
Use Cases
- Customer Support: Find relevant documentation for user questions
- Research Assistant: Discover related research papers and documents
- Content Discovery: Recommend similar articles or resources
- FAQ Automation: Automatically answer common questions
- Document Analysis: Find specific information across large document sets
- Knowledge Management: Quick access to institutional knowledge
- Legal Research: Search through contracts and legal documents
- Product Information: Find technical specifications and manuals