Submitted by zsytony 41 HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models · 14 authors 4
Submitted by akhaliq 32 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling · 4 authors 2
Submitted by yizhilll 25 OmniBench: Towards The Future of Universal Omni-Language Models · 20 authors 2
Submitted by WenhaoWang 17 MonoFormer: One Transformer for Both Diffusion and Autoregression · 8 authors 4
Submitted by mhamilton723 15 Seeing Faces in Things: A Model and Dataset for Pareidolia · 7 authors 2
Submitted by akhaliq 11 Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts · 7 authors 2
Submitted by akhaliq 6 Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation · 10 authors 2