🚀 LLM Fine-Tuning with Unsloth + Hugging Face

Fine-tune large language models (LLMs) like Mistral or LLaMA efficiently on custom instruction datasets using Unsloth, Hugging Face Transformers, and export to GGUF for fast inference via llama.cpp.

📌 Project Summary

This project demonstrates how to fine-tune an LLM on a custom instruction-based dataset using:

🧠 Unsloth for memory-efficient fine-tuning
🔧 TRL’s SFTTrainer from Hugging Face for supervised training
💾 GGUF Export for inference-ready deployment (supports llama.cpp, llamafile, etc.)
📊 Optional W&B tracking for experiment visualization

🧱 Tech Stack

Component	Tool/Library
Model	Mistral / LLaMA
Trainer	TRL's `SFTTrainer`
Optimization	AdamW (8-bit), LR Schedulers
Quantization	8-bit / 4-bit via GGUF
Logging	Weights & Biases (optional)
Hardware Target	Colab / Kaggle GPU

✅ Features

🔧 Fine-tunes Mistral or LLaMA models with minimal VRAM requirements
🧠 Supports instruction tuning for domain-specific and structured tasks
⚡ Trains with 8-bit optimizer using bitsandbytes for faster and lighter execution
📦 Exports the final model in GGUF format compatible with llama.cpp, llamafile, etc.
🎯 Runs on Kaggle, Google Colab, or custom local GPU environments
📊 Optional Weights & Biases (W&B) logging for real-time experiment tracking

🧪 Training Configuration

Hyperparameter	Value
Epochs	2–3
Batch Size	2 (with accumulation = 8)
Max Steps	100
Learning Rate	2e-4
Optimizer	AdamW (8-bit)
Precision	fp16 / bf16
Quantization	GGUF export (8-bit)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE		LICENSE
README.md		README.md
llm-fine-tuning.ipynb		llm-fine-tuning.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 LLM Fine-Tuning with Unsloth + Hugging Face

📌 Project Summary

🧱 Tech Stack

✅ Features

🧪 Training Configuration

About

Uh oh!

Releases

Packages

Languages

License

himaenshuu/llm-fine-tuning

Folders and files

Latest commit

History

Repository files navigation

🚀 LLM Fine-Tuning with Unsloth + Hugging Face

📌 Project Summary

🧱 Tech Stack

✅ Features

🧪 Training Configuration

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages