| Quantization | VRAM Required | Min GPU |
|---|---|---|
| FP16 (Half Precision) | 4.0GB | RTX 3070 / 4060 |
| INT8 (8-bit Integer) | 2.0GB | RTX 3070 / 4060 |
| Q4_K_M (GGUF 4-bit) | 1.0GB | RTX 3070 / 4060 |
| q3_k_m | 0.8GB | RTX 3070 / 4060 |
24.0GB VRAM
24.0GB VRAM
24.0GB VRAM
24.0GB VRAM
24.0GB VRAM
24.0GB VRAM