# Can I run Llama 3.3 70B on Apple M4 Max (128GB)?

Updated: 2026-06-15

**Yes, it runs.** Runs at Q4_K_M using ~45.3 GB of ~96 GB usable. You have room for Q8_0 for higher quality.

- Model: 70B, Q4_K_M 42.52 GB
- Device: 128 GB unified, ~96 GB usable for weights
- Needs ~45.3 GB at Q4_K_M; recommended quant: Q4_K_M
- Best tool on macOS: LM Studio
- Command: `ollama run llama3.3:70b`

Estimate. Method: weights + KV cache + ~0.8GB overhead. Sources: https://ollama.com/library/llama3.3, https://ollama.com/library/llama3.3/tags, https://huggingface.co/bartowski/Llama-3.3-70B-Instruct-GGUF, https://lmarena.ai/leaderboard.

More: https://localmodel.run/can-i-run/llama-3.3-70b/apple-m4-max-128gb