Gracoy
/

ingredients_compatibility_GPT2_S

Text Generation

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

Gracoy commited on Aug 3, 2023

Commit

343c1fc

•

1 Parent(s): 6d45b80

Upload model

Files changed (3) hide show

README.md +5 -5
config.json +1 -1
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -15,8 +15,8 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 1.0788
-- Validation Loss: 1.0687
 - Epoch: 2
 ## Model description
@@ -43,9 +43,9 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 1.4717     | 1.1456          | 0     |
-| 1.1271     | 1.0922          | 1     |
-| 1.0788     | 1.0687          | 2     |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 6.1346
+- Validation Loss: 5.9637
 - Epoch: 2
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 6.7263     | 6.7337          | 0     |
+| 6.6426     | 6.2494          | 1     |
+| 6.1346     | 5.9637          | 2     |
 ### Framework versions

config.json CHANGED Viewed

@@ -11,7 +11,7 @@
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
-  "n_ctx": 1024,
   "n_embd": 768,
   "n_head": 12,
   "n_inner": null,

   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
+  "n_ctx": 256,
   "n_embd": 768,
   "n_head": 12,
   "n_inner": null,

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4b1e1814853540841f4e8617dd4824b6d0914723c7fbcef8aa79a360c666a5b5
 size 370269264

 version https://git-lfs.github.com/spec/v1
+oid sha256:a250943b5ecf26bf4f16b352d6c79ac23b16ae3991bbb7e57f7d7997c6adefa5
 size 370269264