# Can I run Qwen3 14B on Nvidia GeForce RTX 4070 (12GB)?

Updated: 2026-06-15

**Yes, but tight.** Fits at Q4_K_M (~10.7 GB of ~11 GB usable) but with little headroom, close other apps.

- Model: 14B, Q4_K_M 9 GB
- Device: 12 GB vram, ~11 GB usable for weights
- Needs ~10.7 GB at Q4_K_M; recommended quant: Q4_K_M
- Best tool on Windows: LM Studio
- Command: `ollama run qwen3:14b`

Estimate. Method: weights + KV cache + ~0.8GB overhead. Sources: https://ollama.com/library/qwen3/tags, https://huggingface.co/Qwen/Qwen3-14B-GGUF, https://qwenlm.github.io/blog/qwen3/.

More: https://localmodel.run/can-i-run/qwen3-14b/nvidia-rtx-4070-12gb