fix: include tool definition tokens in cost optimization token count by jyoti369 · Pull Request #3733 · archestra-ai/archestra

jyoti369 · 2026-04-12T15:52:12Z

Problem

The cost optimization logic (getOptimizedModel) only counted message tokens when evaluating maxLength rules. Tool definitions were completely ignored in the token count -- a request with 100 tokens of messages but 5000 tokens worth of tool definitions (e.g. many MCP tools with detailed JSON schemas) would be incorrectly routed to a cheaper/smaller model.

Issue: #3423

Changes

cost-optimization.ts

Added estimateToolTokens() -- serializes tool names, descriptions, and input schemas, then uses the same chars/4 approximation as the base tokenizer
Updated getOptimizedModel() to accept an optional tools parameter and add tool token count to the total before rule evaluation
Enhanced logging to show messageTokenCount and toolTokenCount separately

llm-proxy-handler.ts

Now passes the actual tools array from requestAdapter.getTools() into the optimizer (previously only passed a hasTools boolean)

cost-optimization.test.ts

Added 5 unit tests for estimateToolTokens: empty array, single tool, large vs small schemas, multiple tools accumulation, tools without description

How it works

Before:

tokenCount = tokenizer.countTokens(messages)  // tools ignored

After:

messageTokenCount = tokenizer.countTokens(messages)
toolTokenCount = estimateToolTokens(tools)      // serialize & count tool defs
tokenCount = messageTokenCount + toolTokenCount  // total used for maxLength check

The cost optimization logic only counted message tokens when evaluating maxLength rules, ignoring tool definitions entirely. Requests with small messages but large tool schemas (e.g. many MCP tools) could be incorrectly routed to cheaper models. Now estimateToolTokens() serializes tool names, descriptions, and input schemas, using the same chars/4 approximation as the base tokenizer. The tool token count is added to the message token count before rule evaluation. Closes archestra-ai#3423

joeyorlando

hey @jyoti369 👋 thanks for the contribution. Some quick/initial review comments.

platform/backend/src/routes/proxy/utils/cost-optimization.ts

joeyorlando

I think there’s one blocking regression to address before merge.

hasTools changed semantics in llm-proxy-handler.ts. Previously we passed requestAdapter.hasTools(), which reflects whether the incoming request declared any tools at all. The new code derives it from requestAdapter.getTools().length > 0, but getTools() is a normalized subset used for persistence/token estimation and it can intentionally drop some provider-native tools.

For example, the Anthropic adapter filters out non-custom tools like bash / text_editor in getTools(), while hasTools() still reports that the request included tools. That means existing optimization rules that key off hasTools can stop matching for requests that still do have tools.

I’d keep the original hasTools = requestAdapter.hasTools() for rule evaluation, and pass the normalized tools array separately only for token estimation.

Once that’s fixed, the main direction here looks good to me.

(lastly, please address the linting errors on CI)

… evaluation

jyoti369 · 2026-04-14T18:44:09Z

Hey @joeyorlando, good catch! You're right, getTools() only returns the normalized/filtered subset and would drop provider-native tools like Anthropic's bash/text_editor, so deriving hasTools from its length was wrong. Reverted to requestAdapter.hasTools() for rule evaluation, keeping tools = requestAdapter.getTools() only for token estimation. Fix is pushed.

jyoti369 mentioned this pull request Apr 12, 2026

Optimization rule doesn't take into account length of tools #3423

Open

joeyorlando reviewed Apr 13, 2026

View reviewed changes

platform/backend/src/routes/proxy/utils/cost-optimization.ts Show resolved Hide resolved

joeyorlando and others added 3 commits April 13, 2026 18:37

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

cbf87f9

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

e55a6b5

fix: use provider-specific tokenizer in estimateToolTokens

cb5ca68

jyoti369 requested a review from joeyorlando April 14, 2026 03:12

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

6e57303

joeyorlando requested changes Apr 14, 2026

View reviewed changes

joeyorlando and others added 2 commits April 14, 2026 11:53

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

dd58512

fix: restore hasTools from requestAdapter.hasTools() for correct rule…

35b4329

… evaluation

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

22d5edb

jyoti369 requested a review from joeyorlando April 14, 2026 18:52

joeyorlando added 2 commits April 14, 2026 14:57

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

82fae80

Merge branch 'main' into fix/cost-optimization-tool-tokens-3423

2800035

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: include tool definition tokens in cost optimization token count#3733

fix: include tool definition tokens in cost optimization token count#3733
jyoti369 wants to merge 10 commits intoarchestra-ai:mainfrom
jyoti369:fix/cost-optimization-tool-tokens-3423

jyoti369 commented Apr 12, 2026

Uh oh!

joeyorlando left a comment

Uh oh!

Uh oh!

joeyorlando left a comment •

edited

Loading

Uh oh!

jyoti369 commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jyoti369 commented Apr 12, 2026

Problem

Changes

How it works

Uh oh!

joeyorlando left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joeyorlando left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jyoti369 commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

joeyorlando left a comment •

edited

Loading

jyoti369 commented Apr 14, 2026 •

edited

Loading