Processing 4 Million NYC Taxi Trips with Apache Spark — DE Zoomcamp Week 6
Week 6 of the Data Engineering Zoomcamp by DataTalksClub pushed me into distributed batch processing with Apache Spark. This post walks through every question, every line of code, and every lesson I t
data-engineering-ahm.hashnode.dev10 min read