Overview
Pinecone is a fully managed vector database that enables semantic search and RAG (Retrieval-Augmented Generation) applications. The Pinecone tool allows you to query vector indexes for similar vectors using semantic similarity.Key Features
-
PINECONE_CREATE_INDEX- Create new serverless vector indexes
- Configure vector dimension and distance metric
- Select cloud provider and deployment region
- Optional deletion protection
- Wait-until-ready option for immediate use
-
PINECONE_QUERY- Search for similar vectors using semantic similarity
- Dual-mode operation: Vector query OR text query with integrated inference
- Support for top-K results retrieval
- Metadata filtering with Pinecone filter syntax
- Namespace isolation for data partitioning
- Similarity threshold filtering (minScore)
- Optional inclusion of metadata and vector values
-
PINECONE_UPSERT- Insert or update vectors in Pinecone index
- Dual-mode operation: Vector upsert OR document upsert with integrated inference
- Batch operations (up to 100 vectors/documents per request)
- Metadata support for rich vector annotations
- Namespace isolation for data organization
Authentication
You need a Pinecone API key to use this tool. You can obtain one from the Pinecone Console. Note: Treat API keys as sensitive information and never commit them to public repositories.Examples
Example: Create Index and Upsert Documents (Integrated Inference)
This example shows the complete workflow using Pinecone’s integrated inference feature - no external embedding tool needed!Example: Query with Reranking for Improved Relevance
Reranking re-scores initial results to improve relevance. This is especially useful for complex queries.Example: Traditional Semantic Search with OpenAI Embeddings
Example: Search with Metadata Filtering
Parameters
PINECONE_CREATE_INDEX
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
indexName | string | Yes | - | Name of the index (lowercase alphanumeric and hyphens, max 45 chars) |
dimension | number | Yes | - | Vector dimension (e.g., 1536 for OpenAI ada-002) |
metric | string | No | cosine | Distance metric: cosine, euclidean, or dotproduct |
cloud | string | No | aws | Cloud provider: aws, gcp, or azure |
region | string | Yes | - | Deployment region (e.g., us-east-1) |
deletionProtection | string | No | disabled | Enable deletion protection: enabled or disabled |
waitUntilReady | boolean | No | true | Wait until index is ready before returning |
PINECONE_QUERY
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
indexName | string | Yes | - | The name of the Pinecone index to query |
vector | number[] | No* | - | Query vector as array of numbers |
query | string | No* | - | Text query for integrated inference (automatic embedding) |
topK | number | No | 10 | Number of results to return (1-100) |
namespace | string | No | - | Namespace to query within the index |
filter | object | No | - | Metadata filter using Pinecone filter syntax |
minScore | number | No | - | Minimum similarity score threshold (0-1) |
includeMetadata | boolean | No | true | Include metadata in results |
includeValues | boolean | No | false | Include vector values in results |
vector or query must be provided, but not both.
PINECONE_UPSERT
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
indexName | string | Yes | - | The name of the Pinecone index |
vectors | array | No* | - | Array of vector objects (id, values, metadata) |
documents | array | No* | - | Array of document objects for integrated inference (id, text, metadata) |
namespace | string | No | - | Namespace for data organization |
vectors or documents must be provided, but not both.
Output
The tool returns an object with the following structure:Use Cases
RAG (Retrieval-Augmented Generation)
Combine Pinecone with OpenAI to build RAG applications:- Generate embeddings from user query
- Search Pinecone for relevant documents
- Use retrieved context in LLM prompts
Semantic Search
Search documents by meaning rather than keywords:- Find similar articles
- Recommend related content
- Discover relevant information
Question Answering
Build Q&A systems with context retrieval:- Technical documentation search
- Customer support knowledge base
- Research paper discovery
Jinba Toolbox