Skip to content

Conversation

@jerm-dro
Copy link
Contributor

No description provided.

ptelang and others added 29 commits January 20, 2026 12:53
Introduces a new MCPEmbedding custom resource to deploy HuggingFace
embedding models as MCP servers in Kubernetes. This enables semantic
search and similarity features for MCP tools and resources.

Key Features:
- Custom resource definition for embedding model deployments
- Integration with HuggingFace text-embeddings-inference
- Support for model caching via PersistentVolumeClaims
- Flexible resource configuration and pod customization
- GroupRef support for organizational grouping
- Comprehensive status conditions and phase tracking

Components:
- MCPEmbedding CRD with validation and webhook support
- Controller for managing deployment lifecycle
- Generated CRD manifests and Helm chart templates
- RBAC permissions for managing embeddings
- Example configurations for various use cases

This change is based on the original commit by rebasing onto
jerm/2026-01-13-optimizer-in-vmcp to remove intermediate commits.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
… support

This refactors the EmbeddingServer controller with the following changes:

- Convert from Deployment to StatefulSet for better persistent storage support
- Add HFTokenSecretRef field for secure HuggingFace token injection from Kubernetes secrets
- Use StatefulSet volumeClaimTemplates for model cache PVCs instead of separate PVC creation
- Remove Env field from EmbeddingDeploymentOverrides API
- Add comprehensive controller unit tests

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
Regenerate Helm chart READMEs with helm-docs to reflect version 0.5.26
and fix table formatting.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: Jeremy Drouillard <jeremy@stacklok.com>
@github-actions github-actions bot added the size/XL Extra large PR: 1000+ lines changed label Jan 22, 2026
Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Large PR Detected

This PR exceeds 1000 lines of changes and requires justification before it can be reviewed.

How to unblock this PR:

Add a section to your PR description with the following format:

## Large PR Justification

[Explain why this PR must be large, such as:]
- Generated code that cannot be split
- Large refactoring that must be atomic
- Multiple related changes that would break if separated
- Migration or data transformation

Alternative:

Consider splitting this PR into smaller, focused changes (< 1000 lines each) for easier review and reduced risk.

See our Contributing Guidelines for more details.


This review will be automatically dismissed once you add the justification section.

Signed-off-by: Jeremy Drouillard <jeremy@stacklok.com>
@github-actions github-actions bot added size/XL Extra large PR: 1000+ lines changed and removed size/XL Extra large PR: 1000+ lines changed labels Jan 22, 2026
@codecov
Copy link

codecov bot commented Jan 22, 2026

Codecov Report

❌ Patch coverage is 44.37596% with 361 lines in your changes missing coverage. Please review.
✅ Project coverage is 64.40%. Comparing base (546f5cb) to head (3bb35de).
⚠️ Report is 3 commits behind head on main.

Files with missing lines Patch % Lines
...operator/controllers/embeddingserver_controller.go 42.97% 308 Missing and 45 partials ⚠️
cmd/thv-operator/main.go 0.00% 8 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #3393      +/-   ##
==========================================
- Coverage   64.82%   64.40%   -0.43%     
==========================================
  Files         375      377       +2     
  Lines       36626    37275     +649     
==========================================
+ Hits        23744    24006     +262     
- Misses      11011    11352     +341     
- Partials     1871     1917      +46     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size/XL Extra large PR: 1000+ lines changed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants