© 2026 LinearBytes Inc.
Search posts, tags, users, and pages
Shahida R. Khan
Staff Data Engineer | PySpark, Databricks, Delta Lake, AWS
Start with a relatable junior struggle: "Ever stared at a Spark UI wondering why your 'simple' PySpark job is shuffling 100GB for a 1GB dataset? Or why Delta reads take forever? I did – for years. As
No responses yet.