demo scripts #3375

therealnb · 2026-01-21T15:47:14Z

These are some demo scripts that bootstrap a kind k8s cluster, build and install the components.

#3253) * feat: Add optimizer package with semantic tool discovery and ingestion This PR introduces the optimizer package, a Go port of the mcp-optimizer Python service that provides semantic tool discovery and ingestion for MCP servers. - **Semantic tool search** using vector embeddings (384-dim) - **Token counting** for LLM cost estimation - **Full-text search** via SQLite FTS5 - **Multiple embedding backends**: Ollama, vLLM, or placeholder (testing) - **Production-ready database** with sqlite-vec for vector similarity search

* feat: Add optimizer integration endpoints and tool discovery - Add find_tool and call_tool endpoints to vmcp optimizer - Add semantic search and string matching for tool discovery - Update optimizer integration documentation - Add test scripts for optimizer functionality

) * fix: Resolve tool names in optim.find_tool to match routing table

* feat: Add token metrics and observability to optimizer integration

…failures The checkPodsReady function was checking all pods with matching labels, including old pods that had completed (Phase: Succeeded) from previous deployments. This caused the auth discovery e2e test to fail when old pods were still present during deployment updates. Fix: Skip pods that are not in Running phase and ensure at least one running pod exists after filtering.

The test was failing with 'connection reset by peer' errors when trying to connect to the health endpoint. This can happen if pods crash or restart between the BeforeAll setup and the actual test execution. Fix: Add explicit pod readiness verification right before the health check and also check pod readiness inside the Eventually loop to catch pods that crash during health check retries. This makes the test more robust by ensuring pods are stable before attempting HTTP connections.

The health check was using http.Get() without a timeout, which could cause hangs. Add an explicit HTTP client with 10s timeout and improve error messages to help diagnose connection reset issues.

* remove docs * fixes from review * simplify code and fixes from review * fixes from review * fix ci --------- Co-authored-by: taskbot <taskbot@users.noreply.github.com>

) * fix: Resolve tool names in optim.find_tool to match routing table

…nfig - Use DeepCopy() for automatic passthrough of config fields (Optimizer, Metadata, etc.) - Add resolveEmbeddingService() to resolve Kubernetes Service names to URLs - Ensures optimizer config is properly converted from CRD to runtime config - Resolves embeddingService references in Kubernetes deployments

- Add CLI fallback for embeddingService when not resolved by operator - Normalize localhost to 127.0.0.1 in embeddings to avoid IPv6 issues - Add HTTP timeout (30s) to prevent hanging connections - Remove WithContinuousListening() to use timeout-based approach

Add tracing spans to all aggregator methods to enable visibility of capability aggregation in Jaeger. This includes spans for: - AggregateCapabilities (parent span) - QueryAllCapabilities (parallel backend queries) - QueryCapabilities (per-backend queries) - ResolveConflicts (conflict resolution) - MergeCapabilities (final merge) All spans include relevant attributes like backend counts, tool/resource/prompt counts, and error recording. This fixes the issue where capability aggregation logs appeared but no spans were visible in Jaeger.

Signed-off-by: nigel brown <nigel@stacklok.com>

github-actions

Large PR Detected

This PR exceeds 1000 lines of changes and requires justification before it can be reviewed.

How to unblock this PR:

Add a section to your PR description with the following format:

## Large PR Justification

[Explain why this PR must be large, such as:]
- Generated code that cannot be split
- Large refactoring that must be atomic
- Multiple related changes that would break if separated
- Migration or data transformation

Alternative:

Consider splitting this PR into smaller, focused changes (< 1000 lines each) for easier review and reduced risk.

See our Contributing Guidelines for more details.

This review will be automatically dismissed once you add the justification section.

Signed-off-by: nigel brown <nigel@stacklok.com>

jerm-dro and others added 16 commits January 21, 2026 15:40

Update vmcp/README

e96922d

fix: Resolve tool names in optim.find_tool to match routing table (#3337

4466fef

) * fix: Resolve tool names in optim.find_tool to match routing table

Add token metrics and observability to optimizer integration (#3347)

b33f41e

* feat: Add token metrics and observability to optimizer integration

fix: Bump operator-crds chart version to 0.0.97 after rebase

01ebd17

fix: Add HTTP client timeout to health check in flaky e2e test

caea545

The health check was using http.Get() without a timeout, which could cause hangs. Add an explicit HTTP client with 10s timeout and improve error messages to help diagnose connection reset issues.

Add dynamic/static mode support to VirtualMCPServer operator (#3235)

81e24bd

* remove docs * fixes from review * simplify code and fixes from review * fixes from review * fix ci --------- Co-authored-by: taskbot <taskbot@users.noreply.github.com>

fix: Resolve tool names in optim.find_tool to match routing table (#3337

afb53d4

) * fix: Resolve tool names in optim.find_tool to match routing table

Fix unrecognized dotty names

3f3a011

Signed-off-by: nigel brown <nigel@stacklok.com>

demo scripts

74da6d8

Signed-off-by: nigel brown <nigel@stacklok.com>

therealnb marked this pull request as draft January 21, 2026 15:47

github-actions bot added the size/XL Extra large PR: 1000+ lines changed label Jan 21, 2026

github-actions bot requested changes Jan 21, 2026

View reviewed changes

therealnb mentioned this pull request Jan 21, 2026

Merge jerm/2026-01-13-optimizer-in-vmcp into main #3373

Open

therealnb force-pushed the jerm/2026-01-13-optimizer-in-vmcp branch from 1f6f22b to 91a210d Compare January 21, 2026 18:02

Updated after merge

a526f18

Signed-off-by: nigel brown <nigel@stacklok.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo scripts #3375

demo scripts #3375

therealnb commented Jan 21, 2026

Uh oh!

github-actions bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

demo scripts #3375

Are you sure you want to change the base?

demo scripts #3375

Conversation

therealnb commented Jan 21, 2026

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Large PR Detected

How to unblock this PR:

Alternative:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants