How to Run Ollama with Docker and Docker Compose

A complete guide to running Ollama in Docker: CPU-only and NVIDIA GPU passthrough with the NVIDIA Container Toolkit, AMD ROCm setup, pulling models into the running container, a full Docker Compose stack combining Ollama with Open WebUI, environment variables for parallel requests and model keep-alive, persisting models across redeployments with named volumes, and when to use Docker versus native Ollama installation.

Msty: The Local LLM App That Lets You Compare Models Side by Side

A complete guide to Msty, the local LLM desktop app built for multi-model comparison: connecting to existing Ollama models, running the same prompt against multiple models simultaneously, attaching document Knowledge collections to conversations, conversation branching for exploring different approaches, the prompts library for reusable system prompts, integrating cloud providers like OpenAI and Anthropic alongside local models, and how Msty compares to Jan, LM Studio, and Open WebUI.

How to Set Up Continue in VS Code with Ollama for Local AI Coding

A complete setup guide for the Continue VS Code extension with Ollama: installing Continue and configuring config.json with chat and autocomplete models, enabling tab completions with fill-in-the-middle using Qwen2.5-Coder, indexing your codebase with nomic-embed-text embeddings for @codebase search, using built-in slash commands for editing, testing and commenting code, creating custom slash commands, switching between multiple local and cloud models, JetBrains setup, and troubleshooting connection issues.

How to Add OpenTelemetry Tracing to LLM Applications

A practical guide to instrumenting LLM applications with OpenTelemetry: setting up the SDK and OTLP exporter, creating spans around LLM calls with GenAI semantic conventions, tracing multi-step RAG pipelines with nested spans, what attributes to record, backend options from Grafana Tempo to Honeycomb and Langfuse, and when auto-instrumentation is enough vs when to add manual spans.

Best Open-Source LLMs in 2026: A Practical Guide by Use Case

A practical guide to the best open-source LLMs in 2026 organised by use case and hardware tier: Llama 3.3 70B for best overall quality, Llama 3.2 8B for most users, Qwen2.5-Coder for coding, Phi-4 Mini and Qwen2.5 3B for small/CPU setups, Gemma 3 27B for long context, DeepSeek-R1 for reasoning, Qwen2.5 for multilingual use, and a quick reference table matching models to hardware from 4GB to 48GB RAM.

Jan AI: The Open-Source Local LLM Desktop App Explained

A complete guide to Jan, the open-source local LLM desktop app: installing with no dependencies, browsing and downloading models from the built-in hub, importing local GGUF files, using the chat interface with file attachments, enabling the OpenAI-compatible API on port 1337, hardware detection and GPU layer configuration, how Jan compares to Ollama and LM Studio, and the extension system for adding remote API support and document Q&A.