Text model · Command R
CRCommand R 35B requirements
Command R family · 35B params · released Mar 2024 · 1.4M Ollama pulls. Minimum to run at Q4_K_M: Nvidia GeForce RTX 4090 (24GB).
Quantization sizes
| Quantization | Size on disk |
|---|---|
| Q2_K | 14.7 GB est |
| Q3_K_M | 17.1 GB est |
| Q4_K_M (default) | 20.05 GB |
| Q5_K_M | 24.9 GB est |
| Q6_K | 28.7 GB est |
| Q8_0 | 34.63 GB |
| FP16 | 70 GB est |
Lower quant = smaller and faster, slightly lower quality. Q4_K_M is the common default.
Run it
ollama run command-r:35b llama-cli -hf bartowski/c4ai-command-r-v01-GGUF:Q4_K_M lms get bartowski/c4ai-command-r-v01-GGUF Which devices can run Command R 35B?
Apple Silicon Macs
RAM-only laptops
iPhone & iPad
Android
NVIDIA GPUs
FAQ
How much VRAM or RAM does Command R 35B need?
At Q4_K_M, Command R 35B needs about 22.3 GB (weights ~20.05 GB + KV cache + overhead) at a 4k context. At Q8_0 budget ~36.8 GB.
Can Command R 35B run on a laptop?
Command R 35B is large; you need a 24 GB+ GPU or a 32-48 GB Mac at Q4_K_M.
Can I use Command R 35B commercially?
No. CC BY-NC 4.0: non-commercial use only.
Cohere Command R 35B, tuned for RAG and tool use, 128K context. Q4_K_M and Q8_0 sizes from the bartowski GGUF repo.
Sources
Memory figures are estimates. See methodology.