tensorkelechi (kelechic)

upvoted an article about 10 hours ago

Article

Allegro: Advanced Video Generation Model

By

•

about 17 hours ago

• 49

upvoted 3 articles 1 day ago

Article

Stable Diffusion in JAX/Flax 🚀

Oct 13, 2022

• 2

Article

Speech Synthesis, Recognition, and More With SpeechT5

Feb 8, 2023

• 7

Article

Introducing Würstchen: Fast Diffusion for Image Generation

Sep 13, 2023

• 11

upvoted an article 5 days ago

Article

Understanding InstaFlow/Rectified Flow

By

•

Oct 6, 2023

• 14

upvoted a collection 10 days ago

CogVideo

Collection

7 items • Updated Sep 18 • 22

upvoted a paper 11 days ago

LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance

Paper • 2307.00522 • Published Jul 2, 2023 • 31

upvoted an article 13 days ago

Article

Recoloring photos with diffusers

By

•

13 days ago

• 27

upvoted a collection 13 days ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 15 items • Updated Sep 18 • 141

upvoted an article 13 days ago

Article

A Dive into Pretraining Strategies for Vision-Language Models

Feb 3, 2023

• 38

upvoted an article 14 days ago

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 105

upvoted an article 16 days ago

Article

AudioLDM 2, but faster ⚡️

Aug 30, 2023

• 7

upvoted a collection 21 days ago

Free Music Archive

Collection

ISMIR's 2017 FMA Dataset, Optimized for 🤗 Datasets / 🥐 Croissant, with Clear Licensing • 4 items • Updated Sep 13 • 3

upvoted an article 24 days ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

25 days ago

• 33

upvoted a paper 28 days ago

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 37

upvoted an article about 1 month ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 103

upvoted a collection about 1 month ago

llm-uncensored

Collection

1 item • Updated Dec 19, 2023 • 1

upvoted an article about 1 month ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 84

upvoted a collection about 1 month ago

LLMs [UNCENSORED]

Collection

7 items • Updated May 16 • 3

upvoted a paper about 1 month ago

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13 • 45

upvoted an article about 1 month ago

Article

Using 🤗 to Train a GPT-2 Model for Music Generation

By

•

Oct 5, 2023

• 7

upvoted 3 collections about 1 month ago

upvoted a paper about 1 month ago

SpeechVerse: A Large-scale Generalizable Audio Language Model

Paper • 2405.08295 • Published May 14 • 14

upvoted an article about 2 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18

• 28

upvoted a collection about 2 months ago

Quantized-Mistral

Collection

Quantized Mistral models in 2,4, and 8 bit versions • 4 items • Updated Aug 31 • 4

upvoted an article about 2 months ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 18

upvoted 2 papers about 2 months ago

FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation

Paper • 2409.02245 • Published Sep 3 • 9

FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1 • 31

upvoted 2 articles about 2 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

By

•

May 7

• 38

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

By

•

Aug 26

• 35

upvoted a collection about 2 months ago

aaliyah

Collection

personal collection of convnet models and paper implementations for different applications. • 2 items • Updated Aug 25 • 1

upvoted an article 2 months ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

• 29

upvoted 3 papers 3 months ago

Diffusion Models as Data Mining Tools

Paper • 2408.02752 • Published Jul 20 • 13

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5 • 60

upvoted an article 3 months ago

Article

Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

Aug 1, 2023

• 2

upvoted a paper 3 months ago

LLM-AD: Large Language Model based Audio Description System

Paper • 2405.00983 • Published May 2 • 16

upvoted an article 3 months ago

Article

So WTF is an Audio Embedding Model?

By

•

May 30

• 6

upvoted 2 papers 3 months ago

Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion

Paper • 2407.13759 • Published Jul 18 • 17

Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity

Paper • 2407.10387 • Published Jul 15 • 6

upvoted 4 articles 4 months ago

Article

Image-based search engine

By

•

Jul 4

• 23

Article

Image search with 🤗 datasets

Mar 16, 2022

• 5

Article

Train custom AI models with the trainer API and adapt them to 🤗

By

•

Jun 29

• 33

Article

The Annotated Diffusion Model

Jun 7, 2022

• 90

upvoted a paper 4 months ago

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

Paper • 2310.05400 • Published Oct 9, 2023 • 1

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 350

kelechic

AI & ML interests

Organizations

tensorkelechi's activity

Allegro: Advanced Video Generation Model

Stable Diffusion in JAX/Flax 🚀

Speech Synthesis, Recognition, and More With SpeechT5

Introducing Würstchen: Fast Diffusion for Image Generation

Understanding InstaFlow/Rectified Flow

Recoloring photos with diffusers

A Dive into Pretraining Strategies for Vision-Language Models

Fine-Tune Whisper with 🤗 Transformers

AudioLDM 2, but faster ⚡️

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

How to generate text: using different decoding methods for language generation with Transformers

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Using 🤗 to Train a GPT-2 Model for Music Generation

quanto: a pytorch quantization toolkit

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

Introduction to 3D Gaussian Splatting

Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

So WTF is an Audio Embedding Model?

Image-based search engine

Image search with 🤗 datasets

Train custom AI models with the trainer API and adapt them to 🤗

The Annotated Diffusion Model

Uncensor any LLM with abliteration