Ray2333
/

GRM-llama3-8B-sftreg

Text Classification

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ray2333 commited on Jul 7

Commit

dc744d1

•

1 Parent(s): a707073

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -31,6 +31,9 @@ We evaluate GRM on the [reward model benchmark](https://huggingface.co/spaces/al
 ## Usage
 ```
 import torch
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
@@ -56,7 +59,6 @@ with torch.no_grad():
   reward = reward_tensor.cpu().detach().item()
 ```
-**Note: loading llama3 model into 8 bit could lead to performance degradation.**
 ## Citation
 If you find this model helpful for your research, please cite GRM

 ## Usage
+**Note 1: Please download the `model.py` file from this repository to ensure the structure is loaded correctly and verify that the `v_head` is properly initialized.**
+**Note 2: loading llama3 model into 8 bit could lead to performance degradation.**
 ```
 import torch
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
   reward = reward_tensor.cpu().detach().item()
 ```
 ## Citation
 If you find this model helpful for your research, please cite GRM