DAMO-NLP-SG 's Collections

VideoLLaMA 2

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability