Leng Sicong's picture

Leng Sicong

Sicong

·

AI & ML interests

None yet

Organizations

Sicong's activity

upvoted a paper 6 days ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published 6 days ago • 28

upvoted a paper 19 days ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published 19 days ago • 33

upvoted a paper 3 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29 • 54

upvoted a paper 4 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

upvoted a collection 4 months ago

VideoLLaMA 2

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability • 14 items • Updated about 10 hours ago • 18

upvoted a paper 4 months ago

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11 • 32

upvoted 2 papers 10 months ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 64

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1 • 14

upvoted a paper 11 months ago

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Paper • 2311.16922 • Published Nov 28, 2023 • 1