Peter Song, Author at ML Journey

Virtualenv vs Conda vs Poetry for Machine Learning

January 28, 2026 by Peter Song

Environment management remains one of the most contentious topics in Python development, and machine learning amplifies the complexity. The choice between virtualenv, Conda, and Poetry profoundly impacts your workflow, dependency resolution, reproducibility, and deployment pipeline. While all three tools manage Python environments, their approaches differ fundamentally—especially for machine learning projects with complex dependencies like TensorFlow, … Read more

Using Local LLMs for Private Document Search

January 28, 2026 by Peter Song

Privacy concerns around sensitive documents have made local AI solutions increasingly attractive. Whether you’re managing confidential business documents, personal medical records, legal files, or proprietary research, sending this information to cloud-based AI services poses significant risks. Local large language models (LLMs) combined with vector databases offer a powerful alternative: private, secure document search that never … Read more

How to Reduce VRAM Usage When Running LLMs Locally

January 28, 2026 by Peter Song

Running large language models (LLMs) on your own hardware offers privacy, control, and cost savings compared to cloud-based solutions. However, the primary bottleneck most users face is VRAM (Video Random Access Memory) limitations. Modern LLMs can require anywhere from 4GB to 80GB of VRAM, making them inaccessible to users with consumer-grade GPUs. Fortunately, several proven … Read more

Best Local LLM for RAG (Retrieval-Augmented Generation)

January 27, 2026 by Peter Song

Retrieval-augmented generation has transformed how we build intelligent systems that work with knowledge bases. By combining document retrieval with language model generation, RAG enables AI to answer questions grounded in specific sources rather than relying solely on training data. When implementing RAG locally, choosing the right language model becomes critical—you need a model that follows … Read more

Ollama vs LM Studio vs GPT4All: Which Is Best for Local LLMs?

February 15, 2026January 26, 2026 by Peter Song

The explosion of accessible local LLM tools has created both opportunity and confusion. Three platforms—Ollama, LM Studio, and GPT4All—have emerged as the leading solutions for running large language models on your own hardware. Each takes a fundamentally different approach to the same goal: making AI accessible without cloud dependencies. Choosing between them isn’t about finding … Read more