Puffy Bird

Paper • 2409.13592 • Published Sep 20 • 46 •

#1 opened about 2 months ago by

TheBigBlockPC

commented a paper 29 days ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

9

New activity in G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b about 1 month ago

G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b Just SHOCKED The Entire INDUSTRY with 12000 volts

#15 opened about 1 month ago by

nonetrix

New activity in deepseek-ai/DeepSeek-V2.5 about 2 months ago

DeepSeek-Coder-V2.5-Lite

13

#3 opened about 2 months ago by

smcleod

New activity in qihoo360/FancyVideo 2 months ago

Glad to see Qihoo Using HF!

#1 opened 2 months ago by

Paper • 2407.12665 • Published Jul 17 • 16 •

commented a paper 3 months ago

Patch-Level Training for Large Language Models

New activity in Tencent-Hunyuan/HunyuanDiT-v1.2-Diffusers-Distilled 4 months ago

Update from HunyuanDiT v1.1

#1 opened 4 months ago by

Paper • 2407.00320 • Published Jun 29 • 37 •

commented 2 papers 4 months ago

LiteSearch: Efficacious Tree Search for LLM

5

Scaling Laws for Linear Complexity Language Models

Paper • 2406.16690 • Published Jun 24 • 22 •

New activity in hpcai-tech/open-sora 4 months ago

🚩 Report: Not working

11

#4 opened 4 months ago by

uraniumcrystalsmaster

New activity in puffy310/ZeroGPU-DeepSeek-V2-LiteCoder 4 months ago

Apply for community grant: Academic project (gpu)

#1 opened 4 months ago by

Paper • 2406.11931 • Published Jun 17 • 57 •

commented 2 papers 4 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11831 • Published Jun 17 • 19 •

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models

New activity in IndexTeam/Index-1.9B-Chat 4 months ago

Model Scaling

#1 opened 4 months ago by

Paper • 2406.06282 • Published Jun 10 • 36 •

commented 2 papers 4 months ago

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

5

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Paper • 2406.06563 • Published Jun 3 • 17 •

10

New activity in Skywork/Skywork-MoE-Base 4 months ago

Intermediate Checkpoints

#4 opened 4 months ago by

New activity in dalle-mini/dalle-mini 5 months ago

How to convert this model into safetensors format for use in comfyUI?

#46 opened 9 months ago by

hahaMOMO

New activity in ByteDance/Make-An-Audio-2 5 months ago

Comparison to AudioLDM 2

#2 opened 5 months ago by

Paper • 2405.12250 • Published May 19 • 150 •

commented a paper 5 months ago

Your Transformer is Secretly Linear

20

New activity in deepseek-ai/DeepSeek-V2 5 months ago

Smaller Models

#2 opened 6 months ago by

New activity in Tencent-Hunyuan/HunyuanDiT 5 months ago

Not "open source"

Paper • 2312.04916 • Published Dec 8, 2023 • 6 •

#4 opened 5 months ago by

ostris

New activity in deepseek-ai/DeepSeek-V2-Chat 6 months ago

GPTQ plz

10

#3 opened 6 months ago by

Parkerlambert123

commented 2 papers 7 months ago

#3 opened almost 2 years ago by

Crowyote

commented a paper 9 months ago

EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism

7

New activity in THUDM/chatglm-6b 10 months ago

🚩 Report

#102 opened 10 months ago by

Update tokenization_chatglm.py

#100 opened 11 months ago by

Paper • 2401.02415 • Published Jan 4 • 53 •

commented 4 papers 10 months ago

LLaMA Pro: Progressive LLaMA with Block Expansion

Paper • 2401.01055 • Published Jan 2 • 53 •

LLaMA Beyond English: An Empirical Study on Language Capability Transfer

Paper • 2312.11805 • Published Dec 19, 2023 • 45 •

Gemini: A Family of Highly Capable Multimodal Models

10

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Paper • 2312.12436 • Published Dec 19, 2023 • 13 •

New activity in ali-vilab/i2vgen-xl 10 months ago

Demo

#4 opened 10 months ago by

Paper • 2312.06550 • Published Dec 11, 2023 • 56 •

commented a paper 10 months ago

LLM360: Towards Fully Transparent Open-Source LLMs

New activity in Skywork/SkyPile-150B 11 months ago

Dataset Source

#3 opened 11 months ago by

New activity in playgroundai/playground-v2-512px-base 11 months ago

Latent Diffusion Scaling Laws

#3 opened 11 months ago by

Paper • 2312.02087 • Published Dec 4, 2023 • 20 •

commented a paper 11 months ago

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

5

New activity in 01-ai/Yi-34B-Chat 11 months ago

Chinese Answers are better the English Answers

#4 opened 11 months ago by

New activity in DiscloseAI/ChatAnything 11 months ago

Apply for community grant: Academic project (gpu)

Paper • 2310.16795 • Published Oct 25, 2023 • 26 •

#1 opened 11 months ago by

ermu2001

commented a paper 12 months ago

QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models

New activity in LinkSoul/LLaSM-Audio-Instructions about 1 year ago

LLASM Audio Instructions

#1 opened about 1 year ago by

Paper • 2310.11453 • Published Oct 17, 2023 • 96 •

commented a paper about 1 year ago

BitNet: Scaling 1-bit Transformers for Large Language Models

13

New activity in lmsys/lmsys-chat-1m about 1 year ago

Dataset Access

18

#1 opened about 1 year ago by

New activity in artificialguybr/qwen-14b-chat-demo about 1 year ago

Chem

#7 opened about 1 year ago by

Chem

#6 opened about 1 year ago by

New activity in internlm/internlm-xcomposer-7b about 1 year ago

I just witnessed the birth of something incredible.

#1 opened about 1 year ago by

New activity in Qwen/Qwen-14B-Chat about 1 year ago

Alignment Details

#6 opened about 1 year ago by

New activity in TempoFunk/makeavid-sd-jax over 1 year ago

This thing is synthetic nightmares

#5 opened over 1 year ago by

synthetisoft

New activity in TheBirdLegacy/NGA_Art_SD-V1.5 over 1 year ago

Fix deprecation warning by changing `CLIPFeatureExtractor` to `CLIPImageProcessor`.

#6 opened over 1 year ago by

patrickvonplaten

New activity in TheBirdLegacy/OLM-GPT2-Yannic over 1 year ago

Adding `safetensors` variant of this model

#2 opened over 1 year ago by

SFconvertbot

New activity in TheBirdLegacy/NGA_Art_SD-V1.5 over 1 year ago

Librarian Bot: Update dataset YAML metadata for model

#5 opened over 1 year ago by

librarian-bot

New activity in TheBirdLegacy/OSD-Model over 1 year ago

Add `scale_factor` to vae config.

#2 opened over 1 year ago by

New activity in TheBirdLegacy/BlueGuy-V2.1 over 1 year ago

Add `scale_factor` to vae config.

#1 opened over 1 year ago by

New activity in TheBirdLegacy/PhotorealV0.5 over 1 year ago

Add `scale_factor` to vae config.

#1 opened over 1 year ago by

New activity in puffy310/JastaDreambooth over 1 year ago

Add `scale_factor` to vae config.

#1 opened over 1 year ago by

New activity in TheBirdLegacy/NGA_Art_SD-V1.5 over 1 year ago

Add `scale_factor` to vae config.

#4 opened over 1 year ago by

New activity in TheBirdLegacy/NGA_Art almost 2 years ago

This is one of the best models out there.

#1 opened almost 2 years ago by

Crowyote

New activity in TheBirdLegacy/OSD-Model almost 2 years ago

enable Hosted inference API