Hybrid Batch and Streaming Architectures for Feature Engineering
Machine learning models in production face a fundamental tension: they need features computed from both historical patterns and real-time events. A fraud detection model benefits from a user’s transaction history over months (batch) while also requiring instant analysis of the current transaction’s characteristics (streaming). A recommendation system needs deep collaborative filtering computed across all users … Read more