Running local models is good now

Running local models is good now I’ve been working with local models since they came out, and finally, they’re surprisingly good now. I have a 2022 M2 Mac with 64 GB RAM and 1TB storage and I’ve used

Mistral 7B
Gemma 3
OpenAI OSS-20B
Qwen 3 MOE, as well as a number of other Qwen variants like Qwen 2.5 Coder across a lot of different system setups like
raw llama.cpp with Open WebUI
llama-cpp-python
Ollama
llamafiles and
LM Studio Where are local models now? Early on, models...