You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Phi-4-multimodal-instruct training config for SFT finetuning.
2
+
#
3
+
# Phi-4-multimodal-instruct is a multimodal model that combines text, visual, and audio # inputs. It uses a "Mixture of LoRAs" approach, allowing you to plug in adapters for
4
+
# each modality without needing to retrain the base model.
5
+
#
6
+
# Important Note: Oumi has currently integrated and fully tested Phi-4 for vision and
7
+
# text modalities only (!).
8
+
#
9
+
# Requirements:
10
+
# - Log into WandB (`wandb login`) or disable `enable_wandb`
11
+
# - Run `pip install -U flash-attn --no-build-isolation`
0 commit comments