Ollama Proxy with Helicone Logging

A powerful Express.js proxy server that connects Ollama with Helicone for advanced LLM observability and monitoring of your local Llama requests.

🌟 Overview

This proxy server acts as a bridge between your Ollama instance and Helicone's powerful observability platform. It enables you to monitor, track, and analyze all your Ollama LLM interactions while being able to run your local Llama models.

You use this proxy server as the endpoint for your LLM requests, which takes charge of logging the requests to Helicone and serving the responses from your local Ollama instance.

✨ Features

Seamless integration between Ollama and Helicone
Full support for Ollama's chat and generation endpoints
Automatic request/response logging
Version and model tag checking endpoints

🚀 Getting Started

Prerequisites

Node.js (14.x or higher)
Ollama installed locally
Helicone API key

Environment Setup

Sign up for a free account at Helicone and get your API key from the dashboard settings.
Create a .env file in your project root:

HELICONE_API_KEY=your_helicone_api_key_here

Installation & Setup

Install dependencies:

npm install

Start your local Ollama server:

ollama serve

This command starts the Ollama server locally on port 11434. The server must be running for the proxy to work. You can also determine the specific model you want to use by running ollama list to see all the models available.

Start the proxy server:

npm start

The server will start on port 3100 by default.

Testing the Setup

To verify that everything is working correctly, you can send a test request:

curl -X POST http://localhost:3100/api/chat \
  -H "Content-Type: application/json" \
  -d '{"model":"llama2","messages":[{"role":"user","content":"Hello"}]}'

After running this command:

You should see a response from the Ollama model being served from your local machine in your terminal
Log into your Helicone dashboard to see the request logged with all its details in the Requests view.
The dashboard will show metrics like response time, token usage, and request status, etc.

If you can see the request in your Helicone dashboard, congratulations! Your setup is working correctly.

📡 API Endpoints

Chat Completion

POST http://localhost:3100/api/chat

Example request:

{
  "model": "llama2",
  "messages": [
    {
      "role": "user",
      "content": "Hello, how are you?"
    }
  ]
}

Text Generation

POST http://localhost:3100/api/generate

Example request:

{
  "model": "llama2",
  "prompt": "Write a story about a space cat"
}

Utility Endpoints

GET /api/version - Get Ollama version
GET /api/tags - List available models

🔍 Helicone Integration

This proxy automatically integrates with Helicone, a powerful LLM observability platform that provides:

Detailed request/response logging
Cost tracking and analytics
Latency monitoring
User behavior analytics
Custom property tracking
Advanced filtering and search capabilities

By using Helicone, you gain unprecedented visibility into your LLM operations, helping you:

Optimize costs and performance
Debug issues faster
Understand usage patterns
Make data-driven decisions about your LLM implementation

🛠️ Technical Details

The proxy server handles both streaming and non-streaming responses from Ollama, ensuring compatibility with various client implementations while maintaining detailed logging through Helicone.

📝 License

MIT Open Source

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dist		dist
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
proxy.ts		proxy.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ollama Proxy with Helicone Logging

🌟 Overview

✨ Features

🚀 Getting Started

Prerequisites

Environment Setup

Installation & Setup

Testing the Setup

📡 API Endpoints

Chat Completion

Text Generation

Utility Endpoints

🔍 Helicone Integration

🛠️ Technical Details

📝 License

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

juliettech13/helicone-ollama-proxy

Folders and files

Latest commit

History

Repository files navigation

Ollama Proxy with Helicone Logging

🌟 Overview

✨ Features

🚀 Getting Started

Prerequisites

Environment Setup

Installation & Setup

Testing the Setup

📡 API Endpoints

Chat Completion

Text Generation

Utility Endpoints

🔍 Helicone Integration

🛠️ Technical Details

📝 License

🤝 Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages