Text Generation
Transformers
PyTorch
Safetensors
English
gpt2
alignment
instruction tuned
text generation
conversation
assistant
dpo
text-generation-inference
Inference Endpoints
nicholasKluge commited on
Commit
f1b6115
1 Parent(s): 699b4d4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -2,6 +2,7 @@
2
  license: apache-2.0
3
  datasets:
4
  - nicholasKluge/instruct-aira-dataset
 
5
  language:
6
  - en
7
  metrics:
@@ -16,11 +17,17 @@ tags:
16
  - dpo
17
  pipeline_tag: text-generation
18
  widget:
19
- - text: "<|startofinstruction|>Can you explain what is Machine Learning?<|endofinstruction|>"
 
 
20
  example_title: Machine Learning
21
- - text: "<|startofinstruction|>Do you know anything about virtue ethics?<|endofinstruction|>"
 
 
22
  example_title: Ethics
23
- - text: "<|startofinstruction|>How can I make my girlfriend happy?<|endofinstruction|>"
 
 
24
  example_title: Advise
25
  inference:
26
  parameters:
@@ -139,4 +146,4 @@ The model will output something like:
139
 
140
  ## License
141
 
142
- The `Aira-2-124M-DPO` is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.
 
2
  license: apache-2.0
3
  datasets:
4
  - nicholasKluge/instruct-aira-dataset
5
+ - nicholasKluge/reward-aira-dataset
6
  language:
7
  - en
8
  metrics:
 
17
  - dpo
18
  pipeline_tag: text-generation
19
  widget:
20
+ - text: >-
21
+ <|startofinstruction|>Can you explain what is Machine
22
+ Learning?<|endofinstruction|>
23
  example_title: Machine Learning
24
+ - text: >-
25
+ <|startofinstruction|>Do you know anything about virtue
26
+ ethics?<|endofinstruction|>
27
  example_title: Ethics
28
+ - text: >-
29
+ <|startofinstruction|>How can I make my girlfriend
30
+ happy?<|endofinstruction|>
31
  example_title: Advise
32
  inference:
33
  parameters:
 
146
 
147
  ## License
148
 
149
+ The `Aira-2-124M-DPO` is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.