Update README.md
Browse files
README.md
CHANGED
|
@@ -549,6 +549,8 @@ These quants can be used with Nvidia RTX GPU on Windows or RTX/AMD ROCm on Linux
|
|
| 549 |
Ensure they fit your GPU for optimal performance. For example on a 12GB GPU you might fit 5bpw quant with about 16k context.
|
| 550 |
With 16GB it might be possible to fit 32k context with 5bpw quant.
|
| 551 |
|
|
|
|
|
|
|
| 552 |
# Original model card
|
| 553 |
|
| 554 |
<div class="container">
|
|
|
|
| 549 |
Ensure they fit your GPU for optimal performance. For example on a 12GB GPU you might fit 5bpw quant with about 16k context.
|
| 550 |
With 16GB it might be possible to fit 32k context with 5bpw quant.
|
| 551 |
|
| 552 |
+
23.08.2025 Fixed corrupted generation_config.json file. If you downloaded the model earlier and were unable to load it, just redownload this file.
|
| 553 |
+
|
| 554 |
# Original model card
|
| 555 |
|
| 556 |
<div class="container">
|