Running local models is good now I’ve been working with local models since they came out, and finally, they’re surprisingly good now. I have a 2022 M2 Mac with 64 GB RAM and 1TB storage and I’ve used
- Mistral 7B
- Gemma 3
- OpenAI OSS-20B
- Qwen 3 MOE, as well as a number of other Qwen variants like Qwen 2.5 Coder across a lot of different system setups like
- raw llama.cpp with Open WebUI
- llama-cpp-python
- Ollama
- llamafiles and
- LM Studio Where are local models now? Early on, models...


