I've been running local LLMs for quite some time now, and LM Studio is one of the best apps to enjoy the benefits of a local LLM on your machine. It's polished, has a nice model browser, and it makes downloading models from Hugging Face feel almost effortless—until it doesn't.

Model downloads can sometimes get stuck, and the frustrating ritual of manually unloading one model, reconfiguring the GPU layers, and reloading another is not an enjoyable process to go through. But LM Studio isn't the only local LLM app that's easy to use, and setting up Ollama might just save you precious hours.

Close up of Jan and LM studio open on a laptop.
I stopped using LM Studio once I found this open-source alternative

LM Studio had competition. I found it.

6

The simplest way to run local AI

What Ollama is and why it exploded in popularity

Ollama is a lightweight, open-source runtime for running LLLMs locally. While LM Studio gives you a full desktop GUI with model browsing, chat tabs, and server controls, Ollama strips everything down to a clean command-line workflow and a local HTTP API. It runs a background server the moment you install it, and everything else, from downloading models, switching between them, and querying them, happens via the terminal or through that API. There's also a minimalistic UI if that's what you prefer.

If you've used Docker before, the model is almost identical. You pull an image—or in this case a model—and run it. Ollama pull [model name] fetches the model, ollama run [model name] runs it, and drops you right into an interactive chat. It might seem restrictive, but the entire process from a fresh install to chatting with a 7B model takes under five minutes on a decent connection.

ollama-logo
OS
Windows, macOS, Linux
Developer
Ollama
Price model
Free, Open-source

A lightweight local runtime that lets you download and run large language models on your own machine with a single command.

I was up and running in minutes

A setup process that skips most of the usual friction

Ollama active and running
Afam Onyimadu / MUO

Installing Ollama is a single curl command on Linux. On Windows, you can use the standard installer from Ollama's website. Once the install is complete, Ollama starts a background service automatically, and you're ready to pull models.

The model library on Ollama's website covers everything you'd expect. Llama 3, Mistral, Gemma 3, Phi-4, DeepSeek, Qwen, and a growing list of others. You can copy the run command right from a model's page, paste it in your terminal, and Ollama handles the download and launch in one step. No navigating a model browser, no separate download queue, no waiting for an app to register the file in its internal catalog.

Switching models is equally frictionless. There's no manual unloading that you have to do, and no memory management sliders to fiddle with. You just run a different model name, Ollama handles the rest in the background.

The API is the real killer feature

Why developers build entire workflows around Ollama

Ollama website open in Zen browser
Yadullah Abidi / MakeUseOf

To me, the most important part is the API. Ollama exposes an OpenAI-compatible Chat Completions endpoint at http://localhost:11434/v1. That means any tool or script already built for the OpenAI API works out of the box with your local models. You point the URL to localhost, set the API key to a dummy string (since it's not validated locally), and you're done.

This is huge if you're building anything. I have a handful of Python scripts that call the OpenAI API for testing. Switching them to Ollama took about 30 seconds of the editing mentioned above. Change the base URL and model name, and no need to touch anything else in the code at all. By comparison, LM Studio does have a local server mode with similar compatibility, but getting it properly configured adds multiple steps and quite a bit of GUI navigation that Ollama simply doesn't require.

You do lose a few conveniences

The features and UI polish that LM Studio still does better

Honestly, Ollama isn't for everyone. If you genuinely prefer browsing models visually, reading their metadata, and playing with parameters via a UI, LM Studio's Discover tab is a much better option for you. Ollama also doesn't give you real-time token throughput stats or a built-in chat interface as detailed as LM Studio's.

LM Studio's catalog is also broader if you're looking at pure model management. It also handles pulling from Hugging Face directly and supports GPTQ formats that Ollama doesn't natively handle.

Local LLMs are for using, not configuring

If you're spending more time configuring AI than using it, try Ollama

So that's where I landed: a terminal window, a tiny background service, and models that just work when I call them. No spinning program wheels, no half-loaded models, no mystery settings buried three menus deep. I still think LM Studio is great for beginners and for people wanting a rich GUI. But if you're looking for speed and the least amount of hassle for running your LLMs locally, Ollama is the way to go. The fix for local LLMs was never a bigger model; it was efficiency with the smaller ones.

LM Studio running an AI chat.
I’ll never pay for AI again

AI doesn’t have to cost you a dime—local models are fast, private, and finally worth switching to.

7

The switch costs nothing and gives you back hours you could easily spend wrestling with LM Studio's loading behavior. For anyone primarily running local models to power scripts, tools, or integrations—rather than chatting through a built-in GUI—Ollama is the faster, leaner, and less frustrating path. The terminal isn't intimidating at all once you realize the entire workflow essentially boils down to two commands. Everything else follows naturally from there.