Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen2-7B-Instruct-GPTQ-Int8
like
16
Text Generation
Transformers
Safetensors
English
qwen2
chat
conversational
text-generation-inference
Inference Endpoints
8-bit precision
gptq
arxiv:
2309.00071
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
I tried vllm and without vllm, "RuntimeError: probability tensor contains either `inf`, `nan` or element < 0 during inference with transformer" still exist!
1
#2 opened 4 months ago by
Zaiping
use_exllama?
#1 opened 4 months ago by
DDDSSS