# Can I run Qwen2.5 32B on Nvidia GeForce RTX 4080 (16GB)?

Updated: 2026-06-15

**No, not enough memory.** Needs ~22.1 GB even at Q4_K_M, but only ~15 GB is usable.

- Model: 32B, Q4_K_M 19.85 GB
- Device: 16 GB vram, ~15 GB usable for weights
- Needs ~22.1 GB at Q4_K_M; recommended quant: n/a
- Best tool on Windows: LM Studio
- Command: `ollama run qwen2.5:32b`

Estimate. Method: weights + KV cache + ~0.8GB overhead. Sources: https://ollama.com/library/qwen2.5, https://huggingface.co/bartowski/Qwen2.5-32B-Instruct-GGUF, https://qwenlm.github.io/blog/qwen2.5/.

More: https://localmodel.run/can-i-run/qwen2.5-32b/nvidia-rtx-4080-16gb