You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now, running Ollama via Docker on macOS doesn’t use the GPU because Docker can’t access Apple Silicon GPUs. That’s a pretty big performance hit, especially when working with larger models.
When you run Ollama natively, it detects and uses the Metal backend:
level=INFO msg="Dynamic LLM libraries [metal]"
But in Docker:
level=INFO msg="no GPU detected"
It’d be great if Ollama could run natively on macOS without Docker, so we can take advantage of Apple’s GPU acceleration. Performance and dev experience would be much better.
The text was updated successfully, but these errors were encountered:
Right now, running Ollama via Docker on macOS doesn’t use the GPU because Docker can’t access Apple Silicon GPUs. That’s a pretty big performance hit, especially when working with larger models.
This blog post lays it out clearly: https://chariotsolutions.com/blog/post/apple-silicon-gpus-docker-and-ollama-pick-two/
When you run Ollama natively, it detects and uses the Metal backend:
But in Docker:
It’d be great if Ollama could run natively on macOS without Docker, so we can take advantage of Apple’s GPU acceleration. Performance and dev experience would be much better.
The text was updated successfully, but these errors were encountered: