Granite 3.0 models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated about 17 hours ago • 50
ApolloMoE & Apollo2 Collection English, Chinese, French, Hindi, Spanish, Arabic, Russian, Japanese, Korean, German, Italian, Portuguese and 38 Minor Languages • 7 items • Updated 8 days ago • 2
LoLCATS Collection Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated 8 days ago • 12
Qwen2 Collection Qwen2 language models, instruction-tuned models of 3 sizes: 0.5B, 1.5B, 7B. • 3 items • Updated Jun 13 • 1
Arctic Collection A collection of pre-trained dense-MoE Hybrid transformer models • 2 items • Updated Apr 24 • 23
Under 10b iq4_nl gguf Collection Under 10B GGUFs Non-linear Quantized to try on 4GiB VRAM. Leaderboard https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard • 41 items • Updated Sep 15 • 2
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 27 days ago • 257
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 211
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 76
view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 12
view article Article Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive Jan 15 • 4
view article Article Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny Aug 1, 2023 • 2
occiglot-eu5-7b-v0.1 Collection First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1. • 10 items • Updated Mar 7 • 21
Jamba-1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 80
🦅 🐍 FalconMamba 7B Collection This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated 12 days ago • 26
XGen-MM-1 models and datasets Collection A collection of all XGen-MM (Foundation LMM) models! • 14 items • Updated 14 days ago • 34
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging By akjindal53244 • Aug 19 • 73
💻 Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 41
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated 19 days ago • 57
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 44
Quantized models Collection Select models helpfully quantized by others as well as myself • 58 items • Updated Jul 28 • 2
INDUS: Effective and Efficient Language Models for Scientific Applications Paper • 2405.10725 • Published May 17 • 32
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 147