Device profile · Windows

Best local LLMs for 8GB RAM Laptop (CPU/iGPU only)

8GB RAM Laptop (CPU/iGPU only) has ~5 GB usable for model weights and runs 24 of 67 popular models. Best tool: LM Studio.

Usable memory: ~5 GB
Models run: 24
Too large: 43
Top pick: 4B

Top pick Q4_K_M

Gemma 3 4B Yes

Runs at Q4_K_M using ~3.8 GB of ~5 GB usable.

Runs on 8GB RAM Laptop (CPU/iGPU only)

Compatible models 24 total

Best way to run models on Windows

Runtime guide Windows

Beginner: LM Studio, Best GUI on Windows, auto-detects CUDA/Vulkan backends.

Power user: Ollama (CUDA), Scriptable server; CUDA path is fastest on NVIDIA.

AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Full Windows tool guide →

FAQ

What is the best local LLM for 8GB RAM Laptop (CPU/iGPU only)?

Gemma 3 4B is the strongest model that runs comfortably, using ~3.8 GB at Q4_K_M of the ~5 GB usable on 8GB RAM Laptop (CPU/iGPU only).

How much of 8GB RAM Laptop (CPU/iGPU only)'s memory can I use for a model?

About 5 GB. On a CPU-only machine, leave headroom for the OS and apps.

Which tool should I use on Windows?

LM Studio (Best GUI on Windows, auto-detects CUDA/Vulkan backends.) or Ollama (CUDA) for speed. AMD GPUs run via Vulkan/ROCm at roughly half CUDA throughput. NVIDIA is the smooth path on Windows.

Sources

Memory figures are estimates. See methodology.

Best local LLMs for 8GB RAM Laptop (CPU/iGPU only)

Runs on 8GB RAM Laptop (CPU/iGPU only)

Too large for this device

Best way to run models on Windows

FAQ

Sources