Edit model card

yi-9b-chat-ov

yi-9b-chat-ov is an OpenVino int4 quantized version of 01-ai Yi v1.5 9b Chat, providing a fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.

01-ai-1.5v-9b is a leading general purpose foundation model.

This is a very quality model, and one of the largest that runs effectively on a laptop.

Model Description

  • Developed by: 01-ai
  • Quantized by: llmware
  • Model type: yi-9b-v1.5
  • Parameters: 8.8 billion
  • Model Parent: 01-ai/yi-1.5v-9b
  • Language(s) (NLP): English
  • License: Apache 2.0
  • Uses: General use cases
  • RAG Benchmark Accuracy Score: NA
  • Quantization: int4

Model Card Contact

llmware on github

llmware on hf

llmware website

Downloads last month
17
Inference API
Inference API (serverless) has been turned off for this model.