Agora Conversational AI with RAG and Pinecone

This repository demonstrates how to integrate Retrieval-Augmented Generation (RAG) using Pinecone into a custom LLM for Agora's Conversational AI. The project provides a foundation for creating more contextually aware and knowledge-enhanced AI conversations by leveraging vector embeddings and semantic similarity search.

Features

Vector Database Integration: Store and retrieve vector embeddings with Pinecone
RAG Implementation: Enhance LLM responses with relevant context from your knowledge base
API Endpoints: Ready-to-use REST APIs for managing records and generating AI completions
Streaming Support: Real-time streaming responses from LLM APIs
Easy Setup: Simple configuration with environment variables

Prerequisites

Node.js (v18+)
Pinecone account and API key
OpenAI API key (or other LLM provider)
Agora account (for Conversational AI integration)

Getting Started

1. Clone the Repository

git clone https://github.com/TJ-Agora/Convo-AI-Custom-LLM-Pinecone.git
cd Convo-AI-Custom-LLM-Pinecone

2. Install Dependencies

npm install

3. Configure Environment Variables

Create a .env file in the root directory:

# LLM API (OpenAI by default)
LLM_API_KEY=your_llm_api_key

# Pinecone Configuration
PINECONE_API_KEY=your_pinecone_api_key
PINECONE_INDEX_NAME=your_pinecone_index_name

# Server Configuration
PORT=3000

4. Start the Server

npm run dev

The server will be available at http://localhost:3000 (or the port you specified).

Project Structure

.
├── libs/
│   └── pinecone/
│       ├── config.js         # Pinecone initialization
│       └── pineconeService.js # Vector database operations
├── routes/
│   ├── chatCompletionRouter.js # LLM integration with RAG
│   └── pineconeRouter.js     # CRUD operations for Pinecone
├── .env.example              # Example environment variables
├── package.json              # Dependencies and scripts
├── server.js                 # Express server setup
└── README.md                 # Project documentation

API Endpoints

Pinecone Operations

POST /rag/pinecone/store: Store a new record with vector embedding

{
  "text": "Your text to be embedded and stored",
  "id": "optional-custom-id"
}

POST /rag/pinecone/query: Search for records by semantic similarity

{
  "query": "Your search query",
  "options": {
    "limit": 5
  }
}

DELETE /rag/pinecone/:id: Delete a specific record by ID
DELETE /rag/pinecone/clear/all: Clear all records (use with caution)
POST /rag/pinecone/embed: Generate embedding for text without storing
```
{
  "text": "Text to generate embedding for"
}
```

Chat Completion with RAG

POST /chat/completions: Get AI response with RAG enhancement

{
  "messages": [
    { "role": "system", "content": "You are a helpful assistant." },
    { "role": "user", "content": "Tell me about vector databases." }
  ],
  "model": "gpt-4o-mini",
  "stream": false,
  "queryRag": true
}

How RAG Works in This Project

User query is received by the chat completion endpoint
If queryRag is enabled, the system:
- Converts the query to a vector embedding
- Searches Pinecone for semantically similar records
- Formats and injects relevant records as context
The enhanced prompt is sent to the LLM (e.g., OpenAI)
The LLM response, now informed by your knowledge base, is returned

Integration with Agora Conversational AI

This project is designed to work with Agora's Conversational AI by:

Providing enhanced context to improve responses
Supporting streaming for real-time conversation
Maintaining conversation history and context

For full integration with Agora's Convo AI Engine, refer to the official Agora documentation.

Deploying to Heroku

Follow these steps to deploy your application to Heroku:

1. Login to Heroku

heroku login

2. Create a new Heroku app

heroku create your-app-name-here

Replace your-app-name-here with your desired app name.

3. Set environment variables on Heroku

heroku config:set PINECONE_API_KEY=your_pinecone_api_key
heroku config:set PINECONE_INDEX_NAME=your_pinecone_index_name
heroku config:set LLM_API_KEY=your_llm_api_key

4. Add Heroku remote (if not automatically added)

heroku git:remote -a your-app-name-here

5. Commit and deploy

git push heroku main

Or use git push heroku master if your default branch is master.

6. Open your deployed app

heroku open

7. Check logs if needed

heroku logs --tail

Important Notes:

Make sure your package.json has the correct Node.js version in the engines section
Ensure you have a Procfile in your root directory (should contain: web: node server.js)
Remember that Heroku's filesystem is ephemeral - any file changes will be lost on dyno restart

Built for demonstration purposes - customize for your specific needs.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
libs/pinecone		libs/pinecone
routes		routes
.env.example		.env.example
.gitignore		.gitignore
GUIDE.md		GUIDE.md
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
package.json		package.json
server.js		server.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agora Conversational AI with RAG and Pinecone

Features

Prerequisites

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Configure Environment Variables

4. Start the Server

Project Structure

API Endpoints

Pinecone Operations

Chat Completion with RAG

How RAG Works in This Project

Integration with Agora Conversational AI

Deploying to Heroku

1. Login to Heroku

2. Create a new Heroku app

3. Set environment variables on Heroku

4. Add Heroku remote (if not automatically added)

5. Commit and deploy

6. Open your deployed app

7. Check logs if needed

Important Notes:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

AgoraIO-Community/agora-convo-ai-custom-llm-pinecone

Folders and files

Latest commit

History

Repository files navigation

Agora Conversational AI with RAG and Pinecone

Features

Prerequisites

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Configure Environment Variables

4. Start the Server

Project Structure

API Endpoints

Pinecone Operations

Chat Completion with RAG

How RAG Works in This Project

Integration with Agora Conversational AI

Deploying to Heroku

1. Login to Heroku

2. Create a new Heroku app

3. Set environment variables on Heroku

4. Add Heroku remote (if not automatically added)

5. Commit and deploy

6. Open your deployed app

7. Check logs if needed

Important Notes:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages