# Can I run Qwen2.5-VL 7B on Apple M3 Ultra (256GB)?

Updated: 2026-06-15

**Yes, it runs.** Runs at Q4_K_M using ~7.1 GB of ~192 GB usable. You have room for FP16 for higher quality.

- Model: 8.29B, Q4_K_M 5.62 GB
- Device: 256 GB unified, ~192 GB usable for weights
- Needs ~7.1 GB at Q4_K_M; recommended quant: Q4_K_M
- Best tool on macOS: LM Studio

Estimate. Method: weights + KV cache + ~0.8GB overhead. Sources: https://huggingface.co/ggml-org/Qwen2.5-VL-7B-Instruct-GGUF, https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct.

More: https://localmodel.run/can-i-run/qwen2.5-vl-7b/apple-m3-ultra-256gb