bhenrym14
/

airoboros-33b-gpt4-1.4.1-lxctx-PI-16384-fp16

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bhenrym14 commited on Jul 13, 2023

Commit

a6e30d7

•

1 Parent(s): 6d2b533

Update README.md

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ datasets:
 Mostly untested!
-# RoPE Scaled QLoRA Fine-tune of Llama-13b on airoboros-gpt4-1.4.1 (GPTQ)
 ## Overview
@@ -61,15 +61,14 @@ Quantized with AutoGPTQ (bits = 4, group_size = 128, desc_act = True).
 See original model card below.
-# Original model card: Jon Durbin's Airoboros 7B GPT4 1.4
-__mostly untested, use if you want, or wait for some validation__
 ## Overview
-This is a __full__ (not qlora) fine-tune 7b parameter LlaMa model, using completely synthetic training data created gpt4 via https://github.com/jondurbin/airoboros
 This is mostly an extension of the previous gpt-4 series, with a few extras:
@@ -80,7 +79,7 @@ This is mostly an extension of the previous gpt-4 series, with a few extras:
 * riddles
 * all coding instructions have an equivalent " PLAINFORMAT" version now (and all rosettacode examples were trained with PLAINFORMAT)
-This model was fine-tuned with a fork of [FastChat](https://github.com/jondurbin/FastChat)
 The prompt it was trained with was:
@@ -103,7 +102,7 @@ Be sure you are pulling the latest branch!
 Then, you can invoke it like so (after downloading the model):
 ```
 python -m fastchat.serve.cli \
- --model-path airoboros-7b-gpt4-1.4 \
  --temperature 0.5 \
  --max-new-tokens 2048 \
  --no-history
@@ -203,10 +202,11 @@ Or:
 Write a multi-threaded TCP server in C that accepts a "GET [key]" input and "SET [key] [value]" input, and uses a binary tree to get and store the input values.
 ```
-You can optionally add a single space and "PLAINFORMAT" at the end of your prompt to avoid backticks, explanations, etc. and just print the code, e.g.:
 ```
-Write a websocket application in node.js. PLAINFORMAT
 ```
 ### Word games / trivia

 Mostly untested!
+# RoPE Scaled QLoRA Fine-tune of Llama-33b on airoboros-gpt4-1.4.1 (GPTQ)
 ## Overview
 See original model card below.
+# Original model card: Jon Durbin's Airoboros 33B GPT4 1.4
+__not yet tested!__
 ## Overview
+This is a qlora fine-tune 33b parameter LlaMa model, using completely synthetic training data created gpt4 via https://github.com/jondurbin/airoboros
 This is mostly an extension of the previous gpt-4 series, with a few extras:
 * riddles
 * all coding instructions have an equivalent " PLAINFORMAT" version now (and all rosettacode examples were trained with PLAINFORMAT)
+This model was fine-tuned with a fork of [qlora](https://github.com/jondurbin/qlora)
 The prompt it was trained with was:
 Then, you can invoke it like so (after downloading the model):
 ```
 python -m fastchat.serve.cli \
+ --model-path airoboros-33b-gpt4-1.4 \
  --temperature 0.5 \
  --max-new-tokens 2048 \
  --no-history
 Write a multi-threaded TCP server in C that accepts a "GET [key]" input and "SET [key] [value]" input, and uses a binary tree to get and store the input values.
 ```
+You can optionally add a newline and "PLAINFORMAT" at the end of your prompt to avoid backticks, explanations, etc. and just print the code, e.g.:
 ```
+Write a websocket application in node.js.
+PLAINFORMAT
 ```
 ### Word games / trivia