hivemind-bot

This repository is made for TogetherCrew's LLM bot.

Evaluations

Run our RAG evaluations locally or in GitHub Actions. Results are written to results.csv and results_cost.json.

Run locally (Docker Compose)

Prerequisites:

Create a .env file with your OPENAI_API_KEY (and any other required envs).

Run:

docker compose -f docker-compose.evaluation.yml up --build

This will:

Start a local Qdrant at port 6333
Run evaluation/evaluation.py --community-id 1234 --platform-id 4321
Persist results.csv and results_cost.json to the repo root on your host

Run in GitHub Actions (manual)

Workflow: RAG Evaluation (manual trigger)
Steps performed:
- Boot a Qdrant service
- Install Python dependencies and spaCy model
- Run the evaluation
- Compute and publish averages (faithfulness, answer_relevancy, context_precision, context_recall) to the job summary
- Upload results.csv and results_cost.json as artifacts

Ensure OPENAI_API_KEY is set as a repository secret.

Outputs

results.csv: exact evaluation results (per-sample)
results_cost.json: aggregate token/cost info

TODOs

Fetch the Qdrant snapshot from S3 and persist it in Docker Compose evaluation
Fetch the test dataset from S3 and update evaluation/evaluation.py to load from S3 (configurable root)

Name		Name	Last commit message	Last commit date
Latest commit History 631 Commits
.github/workflows		.github/workflows
bot		bot
evaluation		evaluation
routers		routers
schema		schema
services		services
tests		tests
utils		utils
worker		worker
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.yamllint		.yamllint
Dockerfile		Dockerfile
README.md		README.md
docker-compose.dev.temporal.yml		docker-compose.dev.temporal.yml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.evaluation.yml		docker-compose.evaluation.yml
docker-compose.test.yml		docker-compose.test.yml
docker-entrypoint.sh		docker-entrypoint.sh
main.py		main.py
requirements.txt		requirements.txt
subquery.py		subquery.py
temporal_tasks.py		temporal_tasks.py
temporal_worker.py		temporal_worker.py
test_rb_send_message.py		test_rb_send_message.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

hivemind-bot

Evaluations

Run locally (Docker Compose)

Run in GitHub Actions (manual)

Outputs

TODOs

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

TogetherCrew/hivemind-bot

Folders and files

Latest commit

History

Repository files navigation

hivemind-bot

Evaluations

Run locally (Docker Compose)

Run in GitHub Actions (manual)

Outputs

TODOs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

Packages