Skip to content
localmodel.run

Device profile · Windows

Best local LLMs for 8GB RAM Laptop (CPU/iGPU only)

8GB RAM Laptop (CPU/iGPU only) has ~5 GB usable for model weights and runs 24 of 67 popular models. Best tool: LM Studio.

Usable memory
~5 GB
Models run
24
Too large
43
Top pick
4B
Top pick Q4_K_M

Runs at Q4_K_M using ~3.8 GB of ~5 GB usable.

Runs on 8GB RAM Laptop (CPU/iGPU only)

Too large for this device

Best way to run models on Windows

Runtime guide Windows

Beginner: LM Studio, Best GUI on Windows, auto-detects CUDA/Vulkan backends.

Power user: Ollama (CUDA), Scriptable server; CUDA path is fastest on NVIDIA.

AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Full Windows tool guide →

FAQ

What is the best local LLM for 8GB RAM Laptop (CPU/iGPU only)?

Gemma 3 4B is the strongest model that runs comfortably, using ~3.8 GB at Q4_K_M of the ~5 GB usable on 8GB RAM Laptop (CPU/iGPU only).

How much of 8GB RAM Laptop (CPU/iGPU only)'s memory can I use for a model?

About 5 GB. On a CPU-only machine, leave headroom for the OS and apps.

Which tool should I use on Windows?

LM Studio (Best GUI on Windows, auto-detects CUDA/Vulkan backends.) or Ollama (CUDA) for speed. AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Sources

Memory figures are estimates. See methodology.