Peter Song, Author at ML Journey - Page 47 of 165

Why Is Distillation Important in LLM & SLM?

October 16, 2025 by Peter Song

The AI landscape faces a fundamental tension: larger language models deliver better performance, yet their computational demands make deployment prohibitively expensive for many applications. Distillation—the process of transferring knowledge from large “teacher” models to smaller “student” models—has emerged as one of the most important techniques for resolving this tension. Understanding why distillation matters reveals not … Read more

How to Fine-Tune a Small LLM for Domain Tasks

October 15, 2025 by Peter Song

Fine-tuning small language models for specialized domain tasks has become one of the most practical and cost-effective approaches to deploying AI in production. While massive models like GPT-4 offer impressive general capabilities, a well-fine-tuned 7B parameter model can outperform them on specific tasks at a fraction of the inference cost. This guide walks through the … Read more

Using Google Gemini in Jupyter Notebooks

October 14, 2025 by Peter Song

Jupyter Notebooks have become the go-to environment for data scientists, researchers, and developers who need an interactive workspace for code, documentation, and visualization. With Google’s Gemini AI now offering powerful multimodal capabilities through a straightforward API, integrating it into your Jupyter workflow opens up extraordinary possibilities—from analyzing datasets to generating code, processing images, and creating … Read more

Data Transformation Techniques for ML Readiness

October 14, 2025 by Peter Song

Machine learning models are only as good as the data they’re trained on. While collecting vast amounts of data has become easier, ensuring that data is actually ready for machine learning remains one of the most challenging—and crucial—steps in any ML pipeline. Data transformation techniques bridge this gap, converting raw, messy data into clean, structured … Read more

Orchestrating ML Workflows Using Airflow or Dagster

October 14, 2025 by Peter Song

Machine learning workflows are complex beasts. They involve data extraction, validation, preprocessing, feature engineering, model training, evaluation, deployment, and monitoring—all of which need to run reliably, often on schedules, and with proper handling of failures and dependencies. This is where workflow orchestration tools become essential. Apache Airflow and Dagster have emerged as two leading solutions, … Read more

Data Engineering vs Data Science vs Machine Learning

October 14, 2025 by Peter Song

The data ecosystem has exploded over the past decade, creating distinct career paths that often confuse aspiring professionals and even established organizations. While data engineering, data science, and machine learning are deeply interconnected, they represent fundamentally different disciplines with unique skills, responsibilities, and outcomes. Understanding these differences is crucial whether you’re planning your career path, … Read more

Small LLM vs Large LLM Tradeoffs in Inference Cost

October 14, 2025 by Peter Song

The explosion of large language models has created a critical decision point for organizations: should you deploy massive models that deliver cutting-edge performance, or opt for smaller, more efficient alternatives? This isn’t just a technical question—it’s fundamentally about economics. Inference costs—the expenses incurred every time a model generates a response—can make or break the viability … Read more

How to Build End-to-End ML Pipelines with Airflow and DBT

October 14, 2025 by Peter Song

Building production-ready machine learning pipelines requires orchestrating complex workflows that transform raw data into model predictions. Apache Airflow and dbt (data build tool) have emerged as a powerful combination for this task—Airflow handles workflow orchestration and dependency management, while dbt brings software engineering best practices to data transformation. Together, they enable teams to build maintainable, … Read more

How Gemini Uses Deep Learning and Neural Networks

October 13, 2025 by Peter Song

Google’s Gemini represents a significant leap forward in artificial intelligence, built on sophisticated deep learning architectures and neural networks that enable it to understand and generate human-like responses across multiple modalities. Understanding how Gemini leverages these technologies reveals the intricate engineering behind one of the most advanced AI systems available today. The Foundation: Transformer Architecture … Read more

LLM Cost Reduction Strategies: Practical Techniques to Slash Your AI Spending

October 13, 2025 by Peter Song

Large language models have revolutionized how businesses operate, but their costs can quickly spiral out of control. Organizations frequently discover that their initial API bills of a few hundred dollars have ballooned into monthly expenses exceeding tens of thousands—sometimes even hundreds of thousands—of dollars. The good news? Most companies can dramatically reduce their LLM costs … Read more