Skip to content
localmodel.run

Device profile · Windows

Best local LLMs for 32GB RAM Laptop (CPU/iGPU only)

32GB RAM Laptop (CPU/iGPU only) has ~28 GB usable for model weights and runs 58 of 67 popular models. Best tool: LM Studio.

Usable memory
~28 GB
Models run
58
Too large
9
Top pick
30.5B
Top pick Q4_K_M

Runs at Q4_K_M using ~20.7 GB of ~28 GB usable.

Runs on 32GB RAM Laptop (CPU/iGPU only)

Compatible models 58 total

Too large for this device

Best way to run models on Windows

Runtime guide Windows

Beginner: LM Studio, Best GUI on Windows, auto-detects CUDA/Vulkan backends.

Power user: Ollama (CUDA), Scriptable server; CUDA path is fastest on NVIDIA.

AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Full Windows tool guide →

FAQ

What is the best local LLM for 32GB RAM Laptop (CPU/iGPU only)?

Qwen3 30B-A3B is the strongest model that runs comfortably, using ~20.7 GB at Q4_K_M of the ~28 GB usable on 32GB RAM Laptop (CPU/iGPU only).

How much of 32GB RAM Laptop (CPU/iGPU only)'s memory can I use for a model?

About 28 GB. On a CPU-only machine, leave headroom for the OS and apps.

Which tool should I use on Windows?

LM Studio (Best GUI on Windows, auto-detects CUDA/Vulkan backends.) or Ollama (CUDA) for speed. AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Sources

Memory figures are estimates. See methodology.