Ollama

Local LLM runner and model library with simple CLI and API for workstation inference.

Why it is included

Lowers friction for privacy-preserving inference and offline experimentation on open weights.

Developers and power users testing models without cloud API bills.

Local models alternative to ChatGPT, Claude, and Gemini cloud APIs for on-machine experimentation.

llama.cpp · vLLM