Wei Xiong

weqweasdas

AI & ML interests

Machine learning, RLHF

Organizations

weqweasdas's activity

New activity in RLHFlow/LLaMA3-SFT about 1 month ago

LLaMA3.1-SFT

3
#3 opened about 1 month ago by jackzhang
New activity in Qwen/Qwen2.5-Math-RM-72B about 1 month ago

example to service the RM

1
#2 opened about 1 month ago by weqweasdas
New activity in RLHFlow/LLaMA3-SFT 3 months ago
New activity in RLHF4MATH/Gemma-7B-it-SFT3epoch 3 months ago

Update README.md

#1 opened 3 months ago by weqweasdas
New activity in RLHFlow/ArmoRM-Llama3-8B-v0.1 3 months ago

Special tokens in the vocabulary?

4
#13 opened 3 months ago by nshen7
New activity in weqweasdas/RM-Mistral-7B 6 months ago

why vocab size is 32001

1
#3 opened 6 months ago by yechenzhi1
New activity in weqweasdas/RM-Mistral-7B 7 months ago

License

1
#2 opened 7 months ago by ravir123

Fix dataset link

#1 opened 7 months ago by ZennyKenny