Submitted by akhaliq 14 Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning · 27 authors 1
Submitted by akhaliq 10 Matcha-TTS: A fast TTS architecture with conditional flow matching · 5 authors
Submitted by akhaliq 8 Physically Grounded Vision-Language Models for Robotic Manipulation · 8 authors 1
Submitted by akhaliq 6 Bayes' Rays: Uncertainty Quantification for Neural Radiance Fields · 5 authors