LnL-AI
/

TinyLlama-1.1B-Chat-v1.0-GPTQ-4bit

Text Generation

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Qubitium commited on Mar 29

Commit

edc9e65

•

1 Parent(s): 031ec0f

Update README.md

Files changed (1) hide show

README.md +16 -12

README.md CHANGED Viewed

@@ -2,17 +2,21 @@
 license: unknown
 ---
-This is TinyLlama/TinyLlama-1.1B-Chat-v1.0 quantized with AutoGPTQ in 4-bit.
 **Quantize config:**
-"bits": 4,
-"group_size": 128,
-"damp_percent": 0.005,
-"desc_act": false,
-"static_groups": false,
-"sym": true,
-"true_sequential": true,
-"model_name_or_path": null,
-"model_file_base_name": null,
-"checkpoint_format": "gptq",
-"quant_method": "gptq"

 license: unknown
 ---
+This is TinyLlama/TinyLlama-1.1B-Chat-v1.0 quantized with AutoGPTQ in GPTQ 4-bit format.
 **Quantize config:**
+```
+{
+  "bits": 4,
+  "group_size": 128,
+  "damp_percent": 0.01,
+  "desc_act": false,
+  "static_groups": false,
+  "sym": true,
+  "true_sequential": true,
+  "model_name_or_path": null,
+  "model_file_base_name": null,
+  "quant_method": "gptq",
+  "checkpoint_format": "gptq"
+}
+```