Connecting AWS Glue and SageMaker for ML Pipelines

Machine learning pipelines in production require more than just model training. The reality is that data scientists spend roughly 80% of their time on data preparation, transformation, and feature engineering before they can even begin training models. This is where the combination of AWS Glue and Amazon SageMaker becomes transformative. While SageMaker excels at machine … Read more

CDC Pipeline Architecture on AWS Using Firehose and Glue

Change Data Capture (CDC) has become essential for modern data architectures, enabling real-time data synchronization, analytics, and event-driven workflows. When building CDC pipelines on AWS, combining Kinesis Firehose with AWS Glue creates a powerful, serverless architecture that scales automatically and requires minimal operational overhead. This approach leverages AWS-managed services to capture database changes, stream them … Read more