feat: better benchmarks with swe-bench approach by aeneasr · Pull Request #34 · ory/lumen

aeneasr · 2026-03-09T14:58:14Z

No description provided.

…hunk overlap Improve chunk partitioning for oversized code chunks: - findSplitPoint now recognizes block-ending patterns across language families: C-family (}, },, });, };), Ruby/Elixir (end), and a dedent heuristic for Python/YAML (detects indentation decreases at block boundaries) - Add 5-line overlap between adjacent sub-chunks to improve search recall for queries matching concepts that span a split boundary - Comprehensive test coverage: all boundary patterns, edge-of-lookback window, multiple consecutive splits, overlap content and line number verification Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Instead of telling users to manually run /lumen:reindex, the doctor skill now triggers force_reindex: true via semantic_search when the index is stale or missing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

These results are superseded by the bench-swe pipeline and are no longer referenced from documentation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

aeneasr and others added 5 commits March 7, 2026 12:46

docs: add doctoc TOC and fix markdown table formatting

7215e1e

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(skills): doctor skill auto-reindexes stale indexes

e8e5630

Instead of telling users to manually run /lumen:reindex, the doctor skill now triggers force_reindex: true via semantic_search when the index is stale or missing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: remove old benchmark result files

e343966

These results are superseded by the bench-swe pipeline and are no longer referenced from documentation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: better benchmarks with swe-bench approach

2f84be7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: better benchmarks with swe-bench approach#34

feat: better benchmarks with swe-bench approach#34
aeneasr wants to merge 5 commits intomainfrom
better-benchmarks

aeneasr commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aeneasr commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant