Add Zagros and ZagrosNext model architectures to Transformers #41135

ZagrosLLMModel · 2025-09-24T14:11:55Z

What does this PR do?
This PR introduces two new model architectures, Zagros and ZagrosNext, to the Transformers library. The implementation includes:

Model architecture files (modeling_zagros.py and modeling_zagros_next.py) in src/transformers/models/zagros/ and src/transformers/models/zagros_next/.
Configuration files (configuration_zagros.py and configuration_zagros_next.py) for both models.
Comprehensive tests for both models in tests/models/zagros/ and tests/models/zagros_next/.
Updated documentation in docs/source/en/ with usage examples and details for Zagros and ZagrosNext.

Both models follow the standard structure and conventions of existing Transformers models, ensuring compatibility with the library's pipelines and utilities.
Motivation and Context

The Zagros and ZagrosNext models are designed to [briefly describe the purpose, e.g., "enhance performance on specific NLP tasks with novel architectural improvements"].
These models leverage standard Transformer conventions, making them easy to integrate into existing workflows.
The implementation is fully compatible with the Transformers library and supports all standard functionalities (e.g., training, inference, and pipeline integration).

Dependencies

No additional dependencies are required beyond the standard Transformers setup.
Tested with Python 3.9+ and PyTorch 2.0+.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?
@ArthurZucker (for text models)@Rocketknight1 (for pipelines and library compatibility)@stevhliu (for documentation)
Thank you for reviewing!

github-actions · 2025-09-24T14:36:47Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, zagros, zagros_next

Rocketknight1 · 2025-09-25T13:49:13Z

hi @ZagrosLLMModel, we generally don't accept PRs for new architectures without significant pre-trained checkpoints! Are there any pre-trained Zagros models available?

ZagrosLLMModel · 2025-09-26T13:09:48Z

hi @ZagrosLLMModel, we generally don't accept PRs for new architectures without significant pre-trained checkpoints! Are there any pre-trained Zagros models available?

Hi, yes sir we have:
https://huggingface.co/darsadilab/zagros-1.0-quick

ZagrosLLMModel added 16 commits September 15, 2025 05:19

Add custom Zagros model architecture

8f990ea

Add custom Zagros model architecture

75c912e

Add custom Zagros model architecture

f15c5c5

Add custom Zagros model architecture

d9fb2eb

Add custom Zagros model architecture.

a13a7e0

Add custom Zagros model architecture.

28ae22a

Add custom Zagros model architecture.

d05e1e9

Add custom Zagros model architecture.

7e1b43c

Add custom Zagros model architecture.

6f28ac4

Add custom Zagros model architecture.

16cb7fb

Add custom Zagros model architecture.

8278367

Add custom Zagros model architecture.

30eff38

Add custom Zagros model architecture.

82a86ec

Merge branch 'main' into main

676a2af

add test files of Zagros and ZagrosNext

3c7e1ed

changed

67fd664

solve the problems

36b3d10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Zagros and ZagrosNext model architectures to Transformers #41135

Add Zagros and ZagrosNext model architectures to Transformers #41135

ZagrosLLMModel commented Sep 24, 2025

Uh oh!

github-actions bot commented Sep 24, 2025

Uh oh!

Rocketknight1 commented Sep 25, 2025

Uh oh!

ZagrosLLMModel commented Sep 26, 2025

Uh oh!

Uh oh!

Add Zagros and ZagrosNext model architectures to Transformers #41135

Are you sure you want to change the base?

Add Zagros and ZagrosNext model architectures to Transformers #41135

Conversation

ZagrosLLMModel commented Sep 24, 2025

Uh oh!

github-actions bot commented Sep 24, 2025

Uh oh!

Rocketknight1 commented Sep 25, 2025

Uh oh!

ZagrosLLMModel commented Sep 26, 2025

Uh oh!

Uh oh!