Device profile · Windows

Best local LLMs for 32GB RAM Laptop (CPU/iGPU only)

32GB RAM Laptop (CPU/iGPU only) has ~28 GB usable for model weights and runs 58 of 67 popular models. Best tool: LM Studio.

Usable memory: ~28 GB
Models run: 58
Too large: 9
Top pick: 30.5B

Top pick Q4_K_M

Qwen3 30B-A3B Yes

Runs at Q4_K_M using ~20.7 GB of ~28 GB usable.

Runs on 32GB RAM Laptop (CPU/iGPU only)

Compatible models 58 total

Too large for this device

DeepSeek R1 DeepSeek V3 Qwen3 235B A22B gpt-oss 120B Llama 4 Scout Sarvam-105B Qwen2.5 72B Llama 3.3 70B Mixtral 8x7B

Best way to run models on Windows

Runtime guide Windows

Beginner: LM Studio, Best GUI on Windows, auto-detects CUDA/Vulkan backends.

Power user: Ollama (CUDA), Scriptable server; CUDA path is fastest on NVIDIA.

AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Full Windows tool guide →

FAQ

What is the best local LLM for 32GB RAM Laptop (CPU/iGPU only)?

Qwen3 30B-A3B is the strongest model that runs comfortably, using ~20.7 GB at Q4_K_M of the ~28 GB usable on 32GB RAM Laptop (CPU/iGPU only).

How much of 32GB RAM Laptop (CPU/iGPU only)'s memory can I use for a model?

About 28 GB. On a CPU-only machine, leave headroom for the OS and apps.

Which tool should I use on Windows?

LM Studio (Best GUI on Windows, auto-detects CUDA/Vulkan backends.) or Ollama (CUDA) for speed. AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Sources

Memory figures are estimates. See methodology.