Change the repository type filter
All
Repositories list
53 repositories
QuickCodec
PublicVisCoder2
Public- Quick Long Video Understanding
VideoScore2
PublicHierarchical-Reasoner
PublicVLM2Vec
PublicThis repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]ImagenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]VideoEval-Pro
PublicMore reliable Video Understanding EvaluationStructEval
PublicVisCoder
PublicPixelWorld
PublicOne-Shot-CFT
PublicVisualWebInstruct
PublicABC
PublicABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]Vamba
Public- Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]
CritiqueFineTuning
PublicCode for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]ScholarCopilot
PublicMEGA-Bench
PublicThis repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]DisProtEdit
PublicVL-Rethinker
PublicAceCoder
PublicMantis
Public