Broadcast Joins vs Sort-Merge Joins in Spark
5d ago · 23 min read · 📖 The 45-Minute Join Stage That Became 90 Seconds A data engineering team at a retail company was running a nightly Spark job that joined their 500 GB transaction fact table against a 50 MB product dimension table. The job had been in production for...
Join discussion

























