Skip to content

DeepSeek v3 Architecture Adapter#1240

Open
jlarson4 wants to merge 2 commits intodev-3.x-canaryfrom
feature/deepseek-architecture-adapter
Open

DeepSeek v3 Architecture Adapter#1240
jlarson4 wants to merge 2 commits intodev-3.x-canaryfrom
feature/deepseek-architecture-adapter

Conversation

@jlarson4
Copy link
Copy Markdown
Collaborator

@jlarson4 jlarson4 commented Apr 8, 2026

Description

  • MLA Attention Bridge handling
  • DeepSeek V3 adapter
  • Added adapter to factory
  • Scan of HuggingFace for DeepSeek models
  • Testing of a handful of small test models to do as much accuracy testing as possible on a local device.

Type of change

  • New feature (non-breaking change which adds functionality)

Checklist:

  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • I have not rewritten tests relating to key interfaces which would affect backward compatibility

@jlarson4 jlarson4 changed the title Feature/deepseek architecture adapter DeepSeek v3 Architecture Adapter Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant