Home
Blogs
Bookmarks
Forums
Hackathons
Search

Author

Write
Drafts

New
Bug0 - The AI-native e2e QA regression testing Bug0 Browsers - Cloud Chromium on demand, per-minute, live preview Passmark - The open-source AI framework for regression testing Changelog Brand @hashnode on X Hashnode on LinkedIn Code of Conduct Support - hello+support@hashnode.com
Sign in
Terms Privacy Sitemap
© 2026 LinearBytes Inc.

Search Hashnode

Search posts, tags, users, and pages

Tag feed

#shuffle

1 posts·0 followers

Articles Threads

Trending tags this week

Trending tags this week

1#ai 258
2#web-development 141
3#javascript 134
4#devops 127
5#tutorial 119
6#programming 112
7#webdev 107
8#chaicode 80
9#nodejs 64
10#artificial-intelligence 55
11#machinelearning 51
12#security 51
13#machine-learning 48
14#typescript 46

AAAbstract Algorithmsinabstractalgorithms.dev

Shuffles in Spark: Why groupBy Kills Performance

Apr 19 · 29 min read · TLDR: A Spark shuffle is the most expensive operation in any distributed job — it moves every matching key across the network, writes temporary sorted files to disk, and forces a hard synchronization barrier between every upstream and downstream stag...

Join discussion