KBKumar Bhaskar Prakashinspark-opt-tech-adv.hashnode.dev·Apr 5 · 15 min readAdvanced Spark Optimization TechniquesFinding a solution — isn’t that enough? If this question has ever crossed your mind, let me tell you—you’re on the right path. But let me ask you something: Imagine you want to buy a watch from brand 00
TTobiasMuthoniincybersec-tobias.hashnode.dev·Feb 16, 2025 · 2 min readHow to Fix Data Skew in Apache Spark with the Salting TechniqueThe Data Skew Problem Apache Spark struggles when a few keys dominate your dataset during:✔ Join operations✔ GroupBy aggregations✔ Window functions Symptoms you'll notice:⚠️ 80% of tasks finish quickly while 20% take forever⚠️ Frequent "executor lost...00