Every breath you take-from a deep sigh after a long day to the steady rhythm that carries you through a morning walk-traces ...
Implemented pandas-based cleaning rules in data_preprocessing.py, transformations for salesorder.csv → clean_salesorder.csv, pipeline testing via multiple DAG runs.
This project is a reference implementation of a batch-oriented analytics pipeline designed to reflect how data transformations evolve from simple scripts to structured data platform workflows. In many ...