NikolayKozloff (Nikolay Kozlov)

upvoted a collection 1 day ago

Granite 3.0 models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated about 17 hours ago • 50

upvoted a collection 2 days ago

v4

Collection

18 items • Updated 3 days ago • 20

upvoted a collection 5 days ago

Arch-Function

Collection

6 items • Updated 14 days ago • 7

upvoted a collection 6 days ago

ApolloMoE & Apollo2

Collection

English, Chinese, French, Hindi, Spanish, Arabic, Russian, Japanese, Korean, German, Italian, Portuguese and 38 Minor Languages • 7 items • Updated 8 days ago • 2

upvoted a collection 7 days ago

LoLCATS

Collection

Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated 8 days ago • 12

upvoted 3 collections 12 days ago

upvoted an article 19 days ago

Article

Introducing Würstchen: Fast Diffusion for Image Generation

Sep 13, 2023

• 11

upvoted a collection 22 days ago

Emu3

Collection

4 items • Updated about 6 hours ago • 60

upvoted a collection 23 days ago

Under 10b iq4_nl gguf

Collection

Under 10B GGUFs Non-linear Quantized to try on 4GiB VRAM. Leaderboard https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard • 41 items • Updated Sep 15 • 2

upvoted a collection 26 days ago

lmrs

Collection

Language models in the LMRS format. • 10 items • Updated 8 days ago • 2

upvoted a collection 27 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 27 days ago • 257

upvoted a collection about 1 month ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 211

upvoted an article about 1 month ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 177

upvoted 3 collections about 1 month ago

Flow-Judge-v0.1

Collection

Flow-Judge-v0.1 models • 5 items • Updated Sep 17 • 16

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 76

v3

Collection

12 items • Updated Sep 13 • 6

upvoted a collection about 2 months ago

Yi-Coder

Collection

4 items • Updated Sep 4 • 29

upvoted an article about 2 months ago

Article

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 12

upvoted a collection about 2 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 15 items • Updated Sep 18 • 141

upvoted 2 articles about 2 months ago

Article

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

Jan 15

• 4

Article

Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

Aug 1, 2023

• 2

upvoted 3 collections about 2 months ago

occiglot-eu5-7b-v0.1

Collection

First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1. • 10 items • Updated Mar 7 • 21

CogVideo

Collection

7 items • Updated Sep 18 • 22

magnum-v2

Collection

12 items • Updated Aug 23 • 7

upvoted 3 collections 2 months ago

Jamba-1.5

Collection

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 80

🦅 🐍 FalconMamba 7B

Collection

This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated 12 days ago • 26

XGen-MM-1 models and datasets

Collection

A collection of all XGen-MM (Foundation LMM) models! • 14 items • Updated 14 days ago • 34

upvoted an article 2 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 73

upvoted 5 collections 2 months ago

💻 Local SmolLMs

Collection

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 41

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated 19 days ago • 57

Hermes 3

Collection

The Hermes 3 Series of Models • 8 items • Updated Aug 23 • 85

magnum-v2.5

Collection

3 items • Updated Aug 21 • 7

InternLM2.5

Collection

14 items • Updated Sep 14 • 68

upvoted an article 2 months ago

Article

Your AI, Everywhere

By

•

Aug 9

• 10

upvoted 2 collections 2 months ago

Qwen2-Audio

Collection

Audio-language model series based on Qwen2 • 4 items • Updated Sep 18 • 41

Parler-TTS: fully open-source high-quality TTS

Collection

If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 44

upvoted an article 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 79

upvoted 2 collections 3 months ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

Quantized models

Collection

Select models helpfully quantized by others as well as myself • 58 items • Updated Jul 28 • 2

upvoted an article 3 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 248

upvoted a collection 3 months ago

H2O Danube3

Collection

6 items • Updated 5 days ago • 52

upvoted an article 4 months ago

Article

How to run Gemini Nano locally in your browser

By

•

Jul 11

• 42

upvoted a paper 4 months ago

INDUS: Effective and Efficient Language Models for Scientific Applications

Paper • 2405.10725 • Published May 17 • 32

upvoted a collection 4 months ago

LLM Compiler

Collection

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 147

upvoted a collection 6 months ago

LLava Llama 3 8B Quants

Collection

12 items • Updated Apr 22 • 4

upvoted an article 6 months ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

By

•

Jun 3

• 40

Nikolay Kozlov

AI & ML interests

Organizations

NikolayKozloff's activity

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Introducing RWKV — An RNN with the advantages of a transformer

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Your AI, Everywhere

XetHub is joining Hugging Face!

SmolLM - blazingly fast and remarkably powerful

How to run Gemini Nano locally in your browser

Mergoo: Efficiently Build Your Own MoE LLM