GPT007 (Marc Kovka)

upvoted an article 2 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 59

upvoted an article 3 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 79

upvoted 4 collections 3 months ago

upvoted a paper 3 months ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 54

upvoted a collection 3 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 27 days ago • 597

upvoted a paper 3 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 37

upvoted a collection 3 months ago

DynMoE Family

Collection

DynMoE model checkpoints and paper on huggingface • 4 items • Updated Aug 19 • 3

upvoted a paper 3 months ago

Scaling Diffusion Transformers to 16 Billion Parameters

Paper • 2407.11633 • Published Jul 16 • 25

upvoted a collection 3 months ago

DCLM

Collection

DCLM Models + Datasets • 6 items • Updated 18 days ago • 24

upvoted a paper 3 months ago

Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 15

upvoted a collection 3 months ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 343

upvoted a paper 3 months ago

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11 • 47

upvoted an article 3 months ago

Article

Introducing Ghost 8B Beta: A Game-Changing Language Model

By

•

Jul 17

• 7

upvoted a paper 3 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 143

upvoted an article 3 months ago

Article

Train a Llama model from scratch

By

•

Jul 29

• 44

upvoted 2 papers 3 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 82

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Paper • 2407.06723 • Published Jul 9 • 10

upvoted an article 3 months ago

Article

MInference 1.0: 10x Faster Million Context Inference with a Single GPU

By

•

Jul 11

• 11

upvoted 3 collections 4 months ago

Collection Zero & Demo

Collection

Image Gen - Text -to-Image • 22 items • Updated Sep 8 • 10

Most influential papers in AI

Collection

4 items • Updated Nov 16, 2023 • 31

Transformers.js demos

Collection

A collection of my favorite WebML demos, built with Transformers.js! • 30 items • Updated Jul 11 • 84

upvoted a paper 4 months ago

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models

Paper • 2407.02687 • Published Jul 2 • 22

upvoted 3 collections 4 months ago

Perturbed Attention Guidance pipelines

Collection

Pipelines for Perturbed Attention Guidance with 🧨 library • 8 items • Updated Jun 26 • 6

LLM Compiler

Collection

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 147

Gemma 2 Release

Collection

15 items • Updated Sep 9 • 187

upvoted 4 papers 4 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 85

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24 • 67

MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data

Paper • 2406.18790 • Published Jun 26 • 33

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24 • 54

upvoted an article 4 months ago

Article

Introducing Würstchen: Fast Diffusion for Image Generation

Sep 13, 2023

• 11

upvoted 2 papers 4 months ago

VideoTetris: Towards Compositional Text-to-Video Generation

Paper • 2406.04277 • Published Jun 6 • 22

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11 • 26

upvoted a collection 4 months ago

abliterated-v3

Collection

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3 • 95

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 350

upvoted 2 collections 5 months ago

OpenMath

Collection

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 22 days ago • 37

🍷 FineWeb datasets

Collection

5 items • Updated Jun 26 • 19

upvoted a paper 5 months ago

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 43

upvoted 3 collections 5 months ago

WizardLM

Collection

0 items • Updated Jul 11 • 103

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated Sep 10 • 12

Anime Diffusion

Collection

6 items • Updated May 31 • 2

upvoted a paper 5 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27

upvoted an article 5 months ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

By

•

May 21

• 32

Marc Kovka

AI & ML interests

Organizations

GPT007's activity

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

XetHub is joining Hugging Face!

Introducing Ghost 8B Beta: A Game-Changing Language Model

Train a Llama model from scratch

MInference 1.0: 10x Faster Million Context Inference with a Single GPU

Introducing Würstchen: Fast Diffusion for Image Generation

Uncensor any LLM with abliteration

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.