Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .changeset/config.json
Original file line number Diff line number Diff line change
Expand Up @@ -7,5 +7,5 @@
"access": "restricted",
"baseBranch": "main",
"updateInternalDependencies": "patch",
"ignore": ["@roo-code/cli"]
"ignore": ["@klaus-code/cli"]
}
6 changes: 3 additions & 3 deletions .github/pull_request_template.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
<!--
Thank you for contributing to Roo Code!
Thank you for contributing to Klaus Code!

Before submitting your PR, please ensure:
- It's linked to an approved GitHub Issue.
Expand All @@ -12,10 +12,10 @@ Before submitting your PR, please ensure:

Closes: # <!-- Replace with the issue number, e.g., Closes: #123 -->

### Roo Code Task Context (Optional)
### Klaus Code Task Context (Optional)

<!--
If you used Roo Code to help create this PR, you can share public task links here.
If you used Klaus Code to help create this PR, you can share public task links here.
This helps reviewers understand your development process and provides additional context.
Example: https://app.roocode.com/share/task-id
-->
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/evals.yml
Original file line number Diff line number Diff line change
Expand Up @@ -64,10 +64,10 @@ jobs:
docker compose run --rm runner docker ps

- name: Run database migrations
run: docker compose run --rm runner pnpm --filter @roo-code/evals db:migrate
run: docker compose run --rm runner pnpm --filter @klaus-code/evals db:migrate

- name: Run evals
run: docker compose run --rm runner pnpm --filter @roo-code/evals cli --ci
run: docker compose run --rm runner pnpm --filter @klaus-code/evals cli --ci

- name: Cleanup
if: always()
Expand Down
46 changes: 0 additions & 46 deletions .github/workflows/website-deploy.yml

This file was deleted.

89 changes: 0 additions & 89 deletions .github/workflows/website-preview.yml

This file was deleted.

2 changes: 1 addition & 1 deletion .roo/commands/cli-release.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
---
description: "Create a new release of the Roo Code CLI"
description: "Create a new release of the Klaus Code CLI"
argument-hint: "[version-description]"
mode: code
---
Expand Down
2 changes: 1 addition & 1 deletion .roo/commands/release.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
---
description: "Create a new release of the Roo Code extension"
description: "Create a new release of the Klaus Code extension"
argument-hint: patch | minor | major
mode: code
---
Expand Down
29 changes: 4 additions & 25 deletions .roo/skills/evals-context/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
---
name: evals-context
description: Provides context about the Roo Code evals system structure in this monorepo. Use when tasks mention "evals", "evaluation", "eval runs", "eval exercises", or working with the evals infrastructure. Helps distinguish between the evals execution system (packages/evals, apps/web-evals) and the public website evals display page (apps/web-roo-code/src/app/evals).
description: Provides context about the Klaus Code evals system structure in this monorepo. Use when tasks mention "evals", "evaluation", "eval runs", "eval exercises", or working with the evals infrastructure. Helps distinguish between the evals execution system (packages/evals, apps/web-evals) and the internal eval results display.
---

# Evals Codebase Context
Expand All @@ -12,7 +12,6 @@ Use this skill when the task involves:
- Modifying or debugging the evals execution infrastructure
- Adding new eval exercises or languages
- Working with the evals web interface (apps/web-evals)
- Modifying the public evals display page on roocode.com
- Understanding where evals code lives in this monorepo

## When NOT to Use This Skill
Expand All @@ -31,7 +30,6 @@ This monorepo has **two distinct evals-related locations** that can cause confus
| --------------------------- | -------------------------------------------------------------- | -------------------------------------------------------------- |
| **Evals Execution System** | `packages/evals/` | Core eval infrastructure: CLI, DB schema, Docker configs |
| **Evals Management UI** | `apps/web-evals/` | Next.js app for creating/monitoring eval runs (localhost:3446) |
| **Website Evals Page** | `apps/web-roo-code/src/app/evals/` | Public roocode.com page displaying eval results |
| **External Exercises Repo** | [Roo-Code-Evals](https://github.com/RooCodeInc/Roo-Code-Evals) | Actual coding exercises (NOT in this monorepo) |

## Directory Structure Reference
Expand Down Expand Up @@ -84,18 +82,6 @@ apps/web-evals/
│ └── lib/ # Utilities and schemas
```

### `apps/web-roo-code/src/app/evals/` - Public Website Evals Page

```
apps/web-roo-code/src/app/evals/
├── page.tsx # Fetches and displays public eval results
├── evals.tsx # Main evals display component
├── plot.tsx # Visualization component
└── types.ts # EvalRun type (extends packages/evals types)
```

This page **displays** eval results on the public roocode.com website. It imports types from `@roo-code/evals` but does NOT run evals.

## Architecture Overview

The evals system is a distributed evaluation platform that runs AI coding tasks in isolated VS Code environments:
Expand All @@ -115,7 +101,7 @@ The evals system is a distributed evaluation platform that runs AI coding tasks
**Key components:**

- **Controller**: Orchestrates eval runs, spawns runners, manages task queue (p-queue)
- **Runner**: Isolated Docker container with VS Code + Roo Code extension + language runtimes
- **Runner**: Isolated Docker container with VS Code + Klaus Code extension + language runtimes
- **Redis**: Pub/sub for real-time events (NOT task queuing)
- **PostgreSQL**: Stores runs, tasks, metrics

Expand All @@ -141,13 +127,6 @@ Edit files in [`apps/web-evals/src/`](apps/web-evals/src/):
- [`app/runs/new/new-run.tsx`](apps/web-evals/src/app/runs/new/new-run.tsx) - New run form
- [`actions/runs.ts`](apps/web-evals/src/actions/runs.ts) - Run server actions

### Modifying the Public Evals Display Page

Edit files in [`apps/web-roo-code/src/app/evals/`](apps/web-roo-code/src/app/evals/):

- [`evals.tsx`](apps/web-roo-code/src/app/evals/evals.tsx) - Display component
- [`plot.tsx`](apps/web-roo-code/src/app/evals/plot.tsx) - Charts

### Database Schema Changes

1. Edit [`packages/evals/src/db/schema.ts`](packages/evals/src/db/schema.ts)
Expand Down Expand Up @@ -179,10 +158,10 @@ cd packages/evals && npx vitest run
cd apps/web-evals && npx vitest run
```

## Key Types/Exports from `@roo-code/evals`
## Key Types/Exports from `@klaus-code/evals`

The package exports are defined in [`packages/evals/src/index.ts`](packages/evals/src/index.ts):

- Database queries: `getRuns`, `getTasks`, `getTaskMetrics`, etc.
- Schema types: `Run`, `Task`, `TaskMetrics`
- Used by both `apps/web-evals` and `apps/web-roo-code`
- Used by `apps/web-evals`
2 changes: 1 addition & 1 deletion .vscode/tasks.json
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
{
"label": "watch:webview",
"type": "shell",
"command": "pnpm --filter @roo-code/vscode-webview dev",
"command": "pnpm --filter @klaus-code/vscode-webview dev",
"group": "build",
"problemMatcher": {
"owner": "vite",
Expand Down
Loading
Loading