Skip to content
View Ammar-Alnagar's full-sized avatar
🎰
Deciphering the GPU manuscript.....
🎰
Deciphering the GPU manuscript.....

Block or report Ammar-Alnagar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ammar-Alnagar/README.md

hey, i'm Ammar 👋

Staff GenAI Systems Engineer by day, M.Sc. AI student by night. I build production AI systems that scale and research the stuff that'll matter in two years.

what i do

I work at the intersection of AI research and real-world systems engineering. That means taking bleeding-edge models and making them actually work in production—fast, reliable, and without burning through compute budgets.

Day job: Architecting GenAI systems that handle real user traffic. Multi-agent orchestration, RAG pipelines that don't hallucinate, and LLM inference that doesn't cost a fortune.

Research: Currently obsessed with making LLMs smarter through better architectures. Created and Working on Helios Engine—a Rust framework for LLM agents that prioritizes speed and reliability. Also exploring graph-based RAG, advanced reranking, and RL for domain-specific models.

my stack

Languages: Rust (when performance matters), Python (for everything else)

AI/ML: Advanced RAG systems, multi-agent orchestration, model optimization, distributed training, RLHF pipelines

Production: End-to-end system design, efficient inference, model serving at scale

current experiments

  • Graph-based retrieval systems with dynamic reranking
  • Chain-of-thought reasoning architectures
  • Ultra-low-latency inference in Rust
  • Model distillation for specialized domains

philosophy

Good AI systems should be:

  • Fast — nobody wants to wait 30 seconds for a response
  • Accessible — open source beats paywalls
  • Practical — research is cool, but only if it ships

Everything I build is open source. Knowledge hoarding is how we end up with mediocre AI.

find me

  • 🤗 HuggingFace — models and experiments
  • 💼 LinkedIn — the professional version
  • 📫 Open to collaborations on open source AI infrastructure

Building the future of AI systems — one commit at a time.

Pinned Loading

  1. Rust-Coder-CLI Rust-Coder-CLI Public

    A powerful terminal-based coding assistant that combines the convenience of a modern TUI with the intelligence of large language models. Rust TUI Coder provides an interactive environment where you…

    Rust 13

  2. Helios-Engine Helios-Engine Public

    Helios Engine is a powerful and flexible Rust framework for building LLM-powered agents with tool support, chat capabilities, and easy configuration management. Create intelligent agents that can i…

    Rust 34 3

  3. Marla Marla Public

    This project implements an agentic pipeline using the Google Agent Development Kit (ADK) framework. It features a master agent that supervises and delegates tasks to a team of specialized agents, e…

    Python

  4. SLRAG-with-COT SLRAG-with-COT Public

    Self Learning RAG With COT is a project that implements Retrieval-Augmented Generation (RAG) combined with Chain of Thought (COT) reasoning. This project aims to enhance the performance of language…

    Python 17 3

  5. Forge-Fire Forge-Fire Public

    Forge is a fully offline, on-device GraphRAG system written in Rust. It ingests documents, builds a knowledge graph of entities and relationships, and answers queries using a local LLM.

    Rust

  6. unsloth.rs unsloth.rs Public

    This is a Rust rewrite of the Unsloth framework, a tool for fine-tuning large language models.

    Rust