Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models Paper • 2406.04806 • Published Jun 7 • 1 • 1
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance Paper • 2405.06682 • Published May 5 • 2 • 1
Probabilistic Programming with Programmable Variational Inference Paper • 2406.15742 • Published Jun 22 • 2 • 1
Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows Paper • 2406.16218 • Published Jun 23 • 1 • 1
TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON Paper • 2407.15734 • Published Jul 22 • 1 • 1
Grokfast: Accelerated Grokking by Amplifying Slow Gradients Paper • 2405.20233 • Published May 30 • 5 • 1
HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction Paper • 2401.17948 • Published Jan 31 • 2 • 1
Extreme Compression of Large Language Models via Additive Quantization Paper • 2401.06118 • Published Jan 11 • 12 • 1
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20 • 33 • 3
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy Paper • 2403.14610 • Published Mar 21 • 3 • 2
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs Paper • 2403.02775 • Published Mar 5 • 11 • 3
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers Paper • 2305.07185 • Published May 12, 2023 • 9 • 9
Efficient Training of Language Models to Fill in the Middle Paper • 2207.14255 • Published Jul 28, 2022 • 1 • 1
LoRA: Low-Rank Adaptation of Large Language Models Paper • 2106.09685 • Published Jun 17, 2021 • 29 • 4
Empirical Study of PEFT techniques for Winter Wheat Segmentation Paper • 2310.01825 • Published Oct 3, 2023 • 2 • 1
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers Paper • 2402.01911 • Published Feb 2 • 2 • 1
The FinBen: An Holistic Financial Benchmark for Large Language Models Paper • 2402.12659 • Published Feb 20 • 16 • 5
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models Paper • 2402.10986 • Published Feb 16 • 76 • 5
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling Paper • 2401.16380 • Published Jan 29 • 47 • 7