npc0 commited on
Commit
bcf7256
1 Parent(s): dbaa40a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -21,7 +21,7 @@ The weights are converted to GGML format using [baichuan13b.cpp](https://github.
21
  |ggml-model-q4_1.bin | q4_1 | 8.36 GB |
22
  |ggml-model-q5_0.bin | q5_0 | 9.17 GB |
23
  |ggml-model-q5_1.bin | q5_1 | 9.97 GB |
24
- <!-- |ggml-model-q8_0.bin | q8_0 | ?.?? GB | -->
25
 
26
  ## How to inference
27
  1. [Compile baichuan13b](https://github.com/ouwei2013/baichuan13b.cpp#build), a main executable `baichuan13b/build/bin/main` and a server `baichuan13b/build/bin/server` will be generated.
 
21
  |ggml-model-q4_1.bin | q4_1 | 8.36 GB |
22
  |ggml-model-q5_0.bin | q5_0 | 9.17 GB |
23
  |ggml-model-q5_1.bin | q5_1 | 9.97 GB |
24
+ |ggml-model-q8_0.bin | q8_0 | 14 GB |
25
 
26
  ## How to inference
27
  1. [Compile baichuan13b](https://github.com/ouwei2013/baichuan13b.cpp#build), a main executable `baichuan13b/build/bin/main` and a server `baichuan13b/build/bin/server` will be generated.