lewtun HF staff commited on
Commit
a93c799
1 Parent(s): 29fd236

Add evaluation results on the qqp config of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the qqp config of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-glue-qqp-c973af-14676003).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -26,6 +26,39 @@ model-index:
26
  - name: F1
27
  type: f1
28
  value: 0.8670050100852366
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
26
  - name: F1
27
  type: f1
28
  value: 0.8670050100852366
29
+ - task:
30
+ type: natural-language-inference
31
+ name: Natural Language Inference
32
+ dataset:
33
+ name: glue
34
+ type: glue
35
+ config: qqp
36
+ split: validation
37
+ metrics:
38
+ - name: Accuracy
39
+ type: accuracy
40
+ value: 0.8989859015582489
41
+ verified: true
42
+ - name: Precision
43
+ type: precision
44
+ value: 0.8407470502870844
45
+ verified: true
46
+ - name: Recall
47
+ type: recall
48
+ value: 0.8951965065502183
49
+ verified: true
50
+ - name: AUC
51
+ type: auc
52
+ value: 0.9590670523994457
53
+ verified: true
54
+ - name: F1
55
+ type: f1
56
+ value: 0.8671178499381792
57
+ verified: true
58
+ - name: loss
59
+ type: loss
60
+ value: 0.2457672506570816
61
+ verified: true
62
  ---
63
 
64
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You