LongVU - a Vision-CAIR Collection

Vision-CAIR 's Collections

LongVU

updated about 14 hours ago

Vision-CAIR/LongVU_Qwen2_7B

Video-Text-to-Text • 8B • Updated Feb 28, 2025 • 183 • 76
Vision-CAIR/LongVU_Llama3_2_3B

Video-Text-to-Text • Updated Feb 28, 2025 • 25 • 8
Vision-CAIR/LongVU_Llama3_2_3B_img

Updated Feb 28, 2025 • 3 • 6
Vision-CAIR/LongVU_Qwen2_7B_img

Updated Feb 28, 2025 • 7 • 5
Vision-CAIR/LongVU_Llama3_2_1B

Video-Text-to-Text • Updated Feb 28, 2025 • 30 • 12
Vision-CAIR/LongVU_Llama3_2_1B_img

Updated Oct 24, 2024
Running on Zero

88

LongVU

🌖

88

Generate responses to video or image inputs
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 27