Audio model · whisper
Whisper small requirements
Speech to text · 244M params · int8 / fp16 · released Sep 2022. Light enough to run on CPU, no GPU required.
Run it
Whisper small runs in whisper.cpp, faster-whisper or MacWhisper at fp16 (whisper.cpp). It runs CPU-only, and the smaller tiers are fast enough for real-time use on a laptop or phone.
Which devices can run Whisper small?
Apple Silicon Macs
- Apple M1 (8GB) Yes
- Apple M2 (16GB) Yes
- Apple M4 (16GB) Yes
- Apple M5 (16GB) Yes
- Apple M3 Pro (18GB) Yes
- Apple M4 (24GB) Yes
- Apple M4 Pro (24GB) Yes
- Apple M5 (32GB) Yes
- Apple M4 Pro (48GB) Yes
- Apple M5 Pro (48GB) Yes
- Apple M4 Max (64GB) Yes
- Apple M4 Max (128GB) Yes
- Apple M5 Max (128GB) Yes
- Apple M3 Ultra (256GB) Yes
RAM-only laptops
iPhone & iPad
Android
NVIDIA GPUs
AMD GPUs
FAQ
How much memory does Whisper small need?
At fp16 (whisper.cpp) it consumes ~0.85 GB. It runs on CPU, so a GPU is optional.
Can Whisper small run on a phone or CPU?
Yes for CPU. The smaller tiers are light enough for real-time use, and on-device phone runtimes are available.
Can I use Whisper small commercially?
Yes. Whisper small is licensed MIT, which permits commercial use.
The phone-friendly Whisper tier (244M). whisper.cpp runtime RAM is ~852MB; the ggml model is ~466MB. Runs comfortably CPU-only and on-device on modern phones. Good accuracy/speed balance when large-v3 is too heavy. License MIT. Sources: OpenAI Whisper repo, whisper.cpp README.
Sources
Memory is a sourced peak-usage anchor at fp16 (whisper.cpp), validated 2026-06-15. See methodology.