lulzx (Rishabh Singh)

upvoted an article 6 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

9 days ago

• 48

upvoted a paper 14 days ago

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Paper • 2410.05193 • Published 15 days ago • 12

upvoted a collection about 2 months ago

Hermes 3

Collection

The Hermes 3 Series of Models • 8 items • Updated Aug 23 • 85

upvoted an article about 2 months ago

Article

Automatic Hallucination detection with SelfCheckGPT NLI

By

•

Nov 27, 2023

• 4

upvoted a collection 2 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 109

upvoted an article 2 months ago

Article

Extractive Question Answering with AutoTrain

By

•

Aug 20

• 13

upvoted a paper 2 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12 • 61

upvoted 4 papers 3 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe

Paper • 2406.04165 • Published Jun 6 • 1

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1 • 21

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31 • 18

upvoted a collection 3 months ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

upvoted 7 articles 3 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 229

Article

Announcing New Dataset Search Features

Jul 8

• 22

Article

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

Jul 25

• 18

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Sep 26, 2022

• 16

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22

• 55

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

By

•

Jul 27

• 22

Article

Red-Teaming Large Language Models

Feb 24, 2023

• 13

upvoted 2 papers 3 months ago

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Paper • 2407.14057 • Published Jul 19 • 44

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 155

upvoted an article 3 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 248

upvoted a collection 3 months ago

H2O Danube3

Collection

6 items • Updated 5 days ago • 52

upvoted 2 papers 4 months ago

Direct Preference Knowledge Distillation for Large Language Models

Paper • 2406.19774 • Published Jun 28 • 21

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29 • 37

upvoted a collection 4 months ago

Gemma 2 Release

Collection

15 items • Updated Sep 9 • 187

upvoted an article 4 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 120

upvoted a paper 4 months ago

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18 • 29

upvoted 2 articles 5 months ago

Article

Let's talk about LLM evaluation

By

•

May 23

• 123

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27

• 32

upvoted a paper 8 months ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28 • 18

upvoted 2 collections 9 months ago

Information Extraction Datasets

Collection

Collection of datasest for various information extraction tasks. • 3 items • Updated Sep 10 • 5

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated Sep 10 • 12

upvoted 3 papers 9 months ago

upvoted a collection 9 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 216

upvoted 2 papers 9 months ago

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 49

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Paper • 2205.14135 • Published May 27, 2022 • 11

Rishabh Singh

AI & ML interests

Organizations

lulzx's activity

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Automatic Hallucination detection with SelfCheckGPT NLI

Extractive Question Answering with AutoTrain

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Announcing New Dataset Search Features

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

SetFit: Efficient Few-Shot Learning Without Prompts

WWDC 24: Running Mistral 7B with Core ML

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Red-Teaming Large Language Models

SmolLM - blazingly fast and remarkably powerful

Welcome Gemma 2 - Google's new open LLM

Let's talk about LLM evaluation

TTS Arena: Benchmarking Text-to-Speech Models in the Wild