fastapi Archives - ML Journey

How to Serve Local LLMs as an API (FastAPI + Ollama)

January 25, 2026 by Peter Song

Running large language models locally gives you privacy, control, and independence from cloud services. But to unlock the full potential of local LLMs, you need to expose them through a robust API that applications can consume reliably. Combining FastAPI—Python’s modern, high-performance web framework—with Ollama’s efficient LLM serving capabilities creates a production-ready API that rivals commercial … Read more

Best Practices for Deploying ML Models with Docker + FastAPI in Production

November 23, 2025 by mljourney

Deploying machine learning models to production environments represents the critical bridge between data science experimentation and real-world business value. While Jupyter notebooks and research codebases excel at model development, they fall catastrophically short when serving predictions at scale with reliability, security, and performance requirements that production systems demand. The gap between a trained model achieving … Read more

Deploying Machine Learning Models Using FastAPI

October 22, 2025 by Peter Song

Moving machine learning models from Jupyter notebooks to production systems represents a critical transition that many data scientists struggle with. While you might have a model that achieves impressive accuracy on test data, that model provides zero business value until it’s accessible to applications, users, or other systems. FastAPI has emerged as the go-to framework … Read more

How to Deploy a PyTorch Model Using FastAPI and Docker

July 4, 2025March 5, 2025 by Peter Song

Deploying machine learning models into production is a critical step in the lifecycle of any AI project. While building and training models is essential, their real value is realized when they are deployed and made accessible to end-users. In this article, we will walk through the process of deploying a PyTorch model using FastAPI and Docker. This combination … Read more