Local runtimes

Products that run models on a local machine or privately controlled infrastructure.

6 tools in this category

A local runtime for downloading and serving large language models on personal or private infrastructure.

Ollama is used to run models locally for development, privacy-sensitive experiments, offline workflows, and simple self-hosted inference without a large cloud setup.

Local runtimes

Audience: Developer
Access: Local

A terminal-native open-source coding assistant for applying edits to local repositories with model guidance.

Aider is used to chat against a codebase, make file edits from the terminal, and combine local development loops with external or local models.

Coding assistants Local runtimes

Audience: Developer
Access: API

A workspace and RAG application for chatting with documents using local or remote models.

AnythingLLM is used to build internal knowledge assistants, ingest documents, and create private chat experiences over company or personal content.

Local runtimes Search and research AI Productivity AI

Audience: Hybrid
Access: Docker

An open-source coding assistant framework for building custom AI workflows inside the editor.

Continue is used to wire local or hosted models into IDEs, create custom coding actions, and control how assistants interact with repositories.

Coding assistants Local runtimes

Audience: Developer
Access: API

A desktop application for downloading, testing, and serving open models locally.

LM Studio is used by developers and enthusiasts who want a desktop interface for local model experimentation, offline testing, and lightweight local inference APIs.

Local runtimes

Audience: Developer
Access: Desktop

A family of image generation models that can be used through hosted services or self-hosted deployments.

Stable Diffusion is used to generate and edit images, build custom creative tools, and run visual generation workflows on private infrastructure or hosted platforms.

Local runtimes Image generation

Audience: Hybrid
Access: API