another model request please

#2
by sparsh35 - opened

zephyr-7b-gemma-v0.1 this one is dpo trained version of popular model gemma , showing good results , i need a awq version of it.
Much thanks
model repo is
HuggingFaceH4/zephyr-7b-gemma-v0.1
https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma-v0.1/tree/main

SolidRusT Networks org

I seem to get an out of memory error, despite optimizations, only for the Gemma models.

Maybe try the people in this list, as they may have 24GB GPU for the Gemma quants: https://huggingface.co/models?search=gemma%20awq

Thank you

Sign up or log in to comment