Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
36
46
Quentin Gallouédec
qgallouedec
Follow
iKyalo's profile picture
clem's profile picture
irvingfish's profile picture
34 followers
·
29 following
https://gallouedec.com
QGallouedec
qgallouedec
AI & ML interests
None yet
Articles
Preference Optimization for Vision Language Models
Jul 10
•
40
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Apr 22
•
78
Organizations
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
1
Sleeping
🏃
Bibtex Cleaner
models
663
Sort: Recently updated
qgallouedec/Qwen2-0.5B-SFT
Updated
1 day ago
qgallouedec/Qwen2.5-7B-DPO-main
Text Generation
•
Updated
1 day ago
•
37
qgallouedec/Qwen2.5-7B-DPO-2209
Text Generation
•
Updated
1 day ago
•
28
qgallouedec/gkd-model
Updated
13 days ago
qgallouedec/gpt2-zen
Updated
15 days ago
qgallouedec/Qwen2-0.5B-Instruct-SFT-Capybara
Text Generation
•
Updated
20 days ago
•
28
qgallouedec/Qwen2-0.5B-Instruct-Capybara
Updated
20 days ago
qgallouedec/xpo-qwen2
Text Generation
•
Updated
26 days ago
•
53
qgallouedec/online-dpo-qwen2-4
Text Generation
•
Updated
27 days ago
•
79
qgallouedec/online-dpo-qwen2-2
Text Generation
•
Updated
27 days ago
•
68
Expand 663 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
1 day ago
•
50.9k
•
114
qgallouedec/prm800k
Viewer
•
Updated
21 days ago
•
41.2k
•
45
•
1
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9
•
60.9k
•
34
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9
•
16.6k
•
32
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9
•
6.26k
•
34
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9
•
6.26k
•
34
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9
•
179k
•
34
qgallouedec/tldr
Viewer
•
Updated
Sep 9
•
130k
•
42
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5
•
46.2k
•
35
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5
•
46.2k
•
44
Expand 67 datasets