neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • Updated 6 days ago • 2.18k • 6
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16 Text Generation • Updated 13 days ago • 2.25k • 8
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • Updated 12 days ago • 5.7k • 10
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • Updated 12 days ago • 26.1k • 7
neuralmagic/Llama-3.2-11B-Vision-Instruct-FP8-dynamic Text Generation • Updated 20 days ago • 24.8k • 7
neuralmagic/Llama-3.2-90B-Vision-Instruct-FP8-dynamic Text Generation • Updated 20 days ago • 12.4k • 3
nm-testing/tinyllama-one-shot-static-quant-test-compressed Text Generation • Updated 13 days ago • 24