Skip to content

Conversation

@kurapov-peter
Copy link

Putting up this dirty draft for early feedback/questions. I'm putting together some tests to run a e2e llama3.1 going through linalg on tensors. The goal is to generate some nice linalg that would be optimization friendly. At the moment, there are just functional blocks and pieces that are just smoke-tested. These include naive implementations for rotary embeddings, feed forward, rms, and a bunch of other small snippets that are useful to implement the model. These are already enough to put an attention block together. It'd be nice to test it against the original implementation, but that'd require fairscale as a dependency. For now I only added pytest and kept the pipeline as simple as possible. I also reused the example with the schedule, so now it is a part of every test.

adam-smnk and others added 7 commits November 14, 2025 14:34
Adds a simple end-to-end example demonstrating programatic transform
schedule creation, MLIR JIT compilation, execution, and numerical
verification of the result.

Additionally, 'utils' submodule is added with basic tools to simplify
creation of ctype arguments in format accepted by jitted function.
@rengolin
Copy link
Member

Should this be in examples?

@kurapov-peter
Copy link
Author

The e2e should be, yup, but this is mostly tests and getters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants