pmysl
/

c4ai-command-r-plus-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

pmysl commited on Apr 12

Commit

5c12a1a

•

1 Parent(s): 73b98a9

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -25,6 +25,7 @@ In the folder `imatrix`, you can find imatrix quants. The importance matrix was
 | Q2_K     | 5.7178    | +/- 0.03418        |
 | Q3_K_L   | 4.6214    | +/- 0.02629        |
 | Q4_K_M   | 4.4625    | +/- 0.02522        |
 ## Merging Weights
 After commit `8a28d12`, weights are split with `gguf-split`, which means that you don't have to merge weights. Simply pass the first split, as in the example above, and `llama.cpp` will automatically load all splits. If, for some reason, you want to merge splits, you can use the following command:

 | Q2_K     | 5.7178    | +/- 0.03418        |
 | Q3_K_L   | 4.6214    | +/- 0.02629        |
 | Q4_K_M   | 4.4625    | +/- 0.02522        |
+| f16      | 4.3845    | +/- 0.02468        |
 ## Merging Weights
 After commit `8a28d12`, weights are split with `gguf-split`, which means that you don't have to merge weights. Simply pass the first split, as in the example above, and `llama.cpp` will automatically load all splits. If, for some reason, you want to merge splits, you can use the following command: