Submitted by akhaliq 51 LongVILA: Scaling Long-Context Visual Language Models for Long Videos · 18 authors 3
Submitted by NCJ 32 MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model · 12 authors 3
Submitted by akhaliq 15 Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data · 6 authors 3
Submitted by akhaliq 12 SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views · 7 authors 2
Submitted by Study-is-happy 11 NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices · 4 authors 2
Submitted by akhaliq 9 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering · 7 authors 2
Submitted by canyuchen 9 Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges · 3 authors 2
Submitted by akhaliq 4 Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models · 27 authors 2