Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

ModelCloud
/

Meta-Llama-3.1-8B-gptq-4bit

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Meta-Llama-3.1-8B-gptq-4bit

3 contributors

History: 6 commits

lrl-modelcloud's picture

cl-modelcloud's picture

Upload tokenizer.json (#2)

29f5fac verified 3 months ago

.gitattributes

1.52 kB

initial commit 3 months ago
README.md

1.14 kB

Update README.md 3 months ago
config.json

1.28 kB

Upload folder using huggingface_hub (#1) 3 months ago
model.safetensors

5.73 GB
LFS

Upload folder using huggingface_hub (#1) 3 months ago
quantize_config.json

340 Bytes

Upload folder using huggingface_hub (#1) 3 months ago
special_tokens_map.json

335 Bytes

Upload folder using huggingface_hub (#1) 3 months ago
tokenizer.json

9.09 MB

Upload tokenizer.json (#2) 3 months ago
tokenizer_config.json

50.5 kB

Upload folder using huggingface_hub (#1) 3 months ago