Adina Yakefu

AdinaY

AI & ML interests

None yet

Articles

Organizations

AdinaY's activity

upvoted an article about 2 hours ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

about 18 hours ago

• 16

upvoted an article about 4 hours ago

Article

Deploying Speech-to-Speech on Hugging Face

22 days ago

• 13

upvoted 2 papers about 6 hours ago

Zero-shot Model-based Reinforcement Learning using Large Language Models

Paper • 2410.11711 • Published 7 days ago • 6

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published about 24 hours ago • 51

upvoted a collection about 8 hours ago

Pangea

Collection

A Fully Open Multilingual Multimodal LLM for 39 Languages • 8 items • Updated about 14 hours ago • 4

upvoted a paper about 8 hours ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published 1 day ago • 25

upvoted a paper about 9 hours ago

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published 1 day ago • 38

upvoted 2 papers 1 day ago

Large Language Models as Markov Chains

Paper • 2410.02724 • Published 19 days ago • 31

DPLM-2: A Multimodal Diffusion Protein Language Model

Paper • 2410.13782 • Published 5 days ago • 18

upvoted 5 papers 4 days ago

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Paper • 2410.12705 • Published 6 days ago • 24

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published 8 days ago • 17

Can MLLMs Understand the Deep Implication Behind Chinese Images?

Paper • 2410.13854 • Published 5 days ago • 7

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published 5 days ago • 27

VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI

Paper • 2410.11623 • Published 7 days ago • 45

upvoted 2 papers 7 days ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published 8 days ago • 34

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published 10 days ago • 53

upvoted 2 collections 8 days ago

Ovis1.6

Collection

With just 10B parameters, Ovis1.6-Gemma2-9B leads the OpenCompass benchmark among open-source MLLMs within 30B parameters. • 2 items • Updated 6 days ago • 2

🔊 Audio Models 音频模型

Collection

12 items • Updated 1 day ago • 2

upvoted 3 papers 8 days ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published 13 days ago • 33

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published 12 days ago • 45

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published 11 days ago • 80

upvoted 2 collections 9 days ago

🖼️ MLLMs 多模态模型

Collection

30 items • Updated 11 days ago • 5

🏆 Leaderboards & Arenas 排行榜和评测基准

Collection

18 items • Updated about 9 hours ago • 5

upvoted a paper 11 days ago

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published 14 days ago • 104

upvoted a collection 11 days ago

MathCoder2

Collection

8 items • Updated 7 days ago • 3

upvoted an article 11 days ago

Article

Welcome, Gradio 5

14 days ago

• 59

upvoted a paper 11 days ago

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published 12 days ago • 44

upvoted a paper 14 days ago

Differential Transformer

Paper • 2410.05258 • Published 15 days ago • 159

upvoted 2 papers 15 days ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published 20 days ago • 6

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published 21 days ago • 138

upvoted an article 18 days ago

Article

Improving Parquet Dedupe on Hugging Face Hub

18 days ago

• 27

upvoted a collection 20 days ago

📑Trending Papers - September 9⃣️

Collection

10 items • Updated 20 days ago • 8

upvoted an article 21 days ago

Article

Does Daily Software Engineering Work Need Reasoning Models?

•

29 days ago

• 5

upvoted a collection 22 days ago

Emu3

Collection

4 items • Updated about 6 hours ago • 60

upvoted a paper 22 days ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 25 days ago • 84

upvoted 2 papers 26 days ago

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published 29 days ago • 25

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published 27 days ago • 59

upvoted a paper 27 days ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published 28 days ago • 41

upvoted an article 27 days ago

Article

Introducing the SQL Console on Datasets

Sep 17

• 18

upvoted an article 29 days ago

Article

Exploring the Daily Papers Page on Hugging Face

30 days ago

• 37

upvoted a paper 29 days ago

Prithvi WxC: Foundation Model for Weather and Climate

Paper • 2409.13598 • Published Sep 20 • 35

upvoted a paper about 1 month ago

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Paper • 2409.02048 • Published Sep 3 • 1

upvoted a collection about 1 month ago

Oryx

Collection

Oryx: One Multi-Modal LLM for On-Demand Spatial-Temporal Understanding • 7 items • Updated about 2 hours ago • 11

upvoted a paper about 1 month ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19 • 46

upvoted a collection about 1 month ago

jina-embeddings-v3

Collection

Multilingual multi-task general text embedding model • 6 items • Updated Sep 19 • 14

upvoted 2 papers about 1 month ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 72

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 123

upvoted 2 collections about 1 month ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 211

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 268

upvoted a paper about 1 month ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28 • 83

upvoted an article about 1 month ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 84

upvoted 8 papers about 1 month ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17 • 69

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17 • 82

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12 • 66

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13 • 45

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos

Paper • 2409.08353 • Published Sep 12 • 10

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 80

IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation

Paper • 2409.08240 • Published Sep 12 • 15

Insights from Benchmarking Frontier Language Models on Web App Code Generation

Paper • 2409.05177 • Published Sep 8 • 5

upvoted an article about 1 month ago

Article

All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes

•

Sep 12

• 4

Adina Yakefu

AI & ML interests

Articles

A Short Summary of Chinese AI Global Expansion

A Short Summary of Chinese AI Global Expansion

Exploring the Daily Papers Page on Hugging Face

Organizations

AdinaY's activity

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Deploying Speech-to-Speech on Hugging Face

Welcome, Gradio 5

Improving Parquet Dedupe on Hugging Face Hub

Does Daily Software Engineering Work Need Reasoning Models?

Introducing the SQL Console on Datasets

Exploring the Daily Papers Page on Hugging Face

The 5 Most Under-Rated Tools on Hugging Face

All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes