Video-CoM: Interactive Video Reasoning via Chain of Manipulations
AI & ML interests
Natural Language Processing, Machine Learning, and Computer Vision
Recent Activity
Papers
Counting to Four is still a Chore for VLMs
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework
Organization Card
The Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) in Abu Dhabi, is a graduate-level, research-based academic institution that offers specialized degree programs for local and international students in the field of Artificial Intelligence.
spaces 8
pinned
Running on Zero
Agents
1
MediX-R1 Medical AI Demo
🏥
Medical image analysis and chat with MediX-R1
Paused
Agents
Nomos ZeroGPU Inference
🚀
Sleeping
Agents
BioMediaAnnotator
🏃
BioMediaAnnotator
Build error
Agents
3
ArtstASR
💭
Build error
Agents
5
ArtstTTS
🔥
Running on A10G
Agents
33
LLaVA++ (LLaMA-3-V)
👁
Start a chatbot server for text-based interactions
models 105
MBZUAI/dialseg-ar-gemma3-4B
Text Generation • 4B • Updated • 1
MBZUAI/Video-CoM-SFT
8B • Updated • 12
MBZUAI/Video-CoM
8B • Updated • 14 • 1
MBZUAI/MedMO-4B-Next
Image-Text-to-Text • 4B • Updated • 5.89k • 3
MBZUAI/MedMO-4B
Image-Text-to-Text • 4B • Updated • 2.92k • 15
MBZUAI/MedMO-8B
Image-Text-to-Text • 9B • Updated • 4.01k • 10
MBZUAI/MedMO-8B-Next
Image-Text-to-Text • 9B • Updated • 6.52k • 13
MBZUAI/CoME-VL
Image-Text-to-Text • Updated • 43 • 3
MBZUAI/MediX-R1-30B
Image-Text-to-Text • 31B • Updated • 295 • 5
MBZUAI/MediX-R1-8B
Image-Text-to-Text • 9B • Updated • 184 • 4
datasets 52
MBZUAI/dialseg-ar
Viewer • Updated • 1.01k • 12 • 1
MBZUAI/longshot-bench
Viewer • Updated • 2.08k • 41 • 2
MBZUAI/ThinkGeo
Viewer • Updated • 311 • 177 • 4
MBZUAI/Dialectal-Arabic-MMLU
Viewer • Updated • 21.9k • 1.13k
MBZUAI/OpenEarthAgent
Viewer • Updated • 1.2k • 1.6k • 4
MBZUAI/medix-rl-data
Viewer • Updated • 53.8k • 607 • 6
MBZUAI/DeepfakeJudge-Dataset
Viewer • Updated • 2.05k • 47 • 4
MBZUAI/VCGBench-Diverse
Viewer • Updated • 4.35k • 96 • 3
MBZUAI/DuwatBench
Viewer • Updated • 1.27k • 395 • 4
MBZUAI/Video-R2-Dataset
Viewer • Updated • 15.3k • 93 • 2