Peter Song, Author at ML Journey

Transformer Neural Network Step by Step with Example

September 8, 2025September 6, 2025 by Peter Song

The transformer neural network architecture has fundamentally revolutionized the field of artificial intelligence, powering breakthrough models like GPT, BERT, and countless other state-of-the-art applications. Introduced in the groundbreaking paper “Attention Is All You Need” by Vaswani et al. in 2017, transformers have become the backbone of modern natural language processing and beyond. Understanding how these … Read more

Using Large Language Models for Data Extraction Tasks

September 8, 2025September 6, 2025 by Peter Song

Data extraction has long been one of the most time-consuming and labor-intensive processes in business operations, research, and analytics. Traditional methods often require extensive manual work, complex rule-based systems, or specialized tools that struggle with unstructured data. However, large language models (LLMs) are revolutionizing this landscape, offering unprecedented capabilities to extract, structure, and analyze information … Read more

How to Generate Synthetic Tabular Data with CTGAN

September 8, 2025August 29, 2025 by Peter Song

In today’s data-driven world, access to high-quality datasets is crucial for machine learning research, model development, and business analytics. However, obtaining real data often comes with significant challenges: privacy concerns, regulatory compliance issues, data scarcity, and expensive data collection processes. This is where synthetic data generation becomes invaluable, and CTGAN (Conditional Tabular Generative Adversarial Network) … Read more

What Are Vision Transformers and How Do They Work?

September 8, 2025August 28, 2025 by Peter Song

The landscape of computer vision has undergone a revolutionary transformation with the introduction of Vision Transformers (ViTs). These groundbreaking models have challenged the long-standing dominance of Convolutional Neural Networks (CNNs) in image processing tasks, offering a fresh perspective on how machines can understand and interpret visual information. Vision Transformers represent a paradigm shift in computer … Read more

Graph Neural Networks for Fraud Detection

September 8, 2025August 27, 2025 by Peter Song

Fraud detection has evolved from simple rule-based systems to sophisticated machine learning approaches, and now stands at the forefront of a new revolution: graph neural networks for fraud detection. As financial crimes become increasingly complex and interconnected, traditional detection methods struggle to capture the intricate relationships and patterns that fraudsters exploit. Graph neural networks (GNNs) … Read more

How Accurate is a DeepAR Model?

September 8, 2025August 26, 2025 by Peter Song

Time series forecasting has evolved dramatically with the introduction of deep learning methodologies, and Amazon’s DeepAR stands out as one of the most significant breakthroughs in this field. But how accurate is a DeepAR model compared to traditional forecasting methods? This comprehensive analysis explores the accuracy capabilities, performance benchmarks, and practical applications of DeepAR to … Read more

Best Practices for Using Embeddings in Recommender Systems

September 8, 2025August 25, 2025 by Peter Song

Recommender systems have evolved dramatically over the past decade, transitioning from simple collaborative filtering approaches to sophisticated deep learning architectures that leverage embeddings to capture complex user-item relationships. Embeddings have become the cornerstone of modern recommendation engines, enabling systems to understand nuanced patterns in user behavior and item characteristics that traditional methods often miss. At … Read more

What is SMOTE in Data Augmentation?

September 8, 2025August 24, 2025 by Peter Song

In the world of machine learning and data science, one of the most persistent challenges practitioners face is dealing with imbalanced datasets. When certain classes in your dataset are significantly underrepresented compared to others, traditional machine learning algorithms often struggle to learn meaningful patterns from the minority classes. This is where SMOTE (Synthetic Minority Oversampling … Read more

How to Evaluate Clustering Models Without Ground Truth

September 8, 2025August 23, 2025 by Peter Song

In the world of unsupervised machine learning, clustering stands as one of the most fundamental and widely-used techniques. From customer segmentation to gene expression analysis, clustering algorithms help us discover hidden patterns and structures in data. However, unlike supervised learning where we have labeled data to validate our models, clustering presents a unique challenge: how … Read more

Understanding Gradient Clipping in Deep Learning

September 8, 2025August 22, 2025 by Peter Song

Deep learning has revolutionized artificial intelligence, but training neural networks remains a delicate balancing act. One of the most persistent challenges practitioners face is the dreaded exploding gradient problem, where gradients grow exponentially during backpropagation, causing training to become unstable or fail entirely. This is where gradient clipping emerges as an essential technique, acting as … Read more