Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

FeedDiscussion

Trung Thành

May 6

I spent 8 hours learning Spark partitioning and bucketing. Here's what I discovered

s one thing I've noticed: most Spark pipelines waste 30-60% of their compute time reading data they don't need or shuffling data that could have been pre-organized. During my recent deep-dive, I spent 8 hours learning two important optimization techn...

thanh-de.hashnode.dev4 min read

#apache #data #performance #software

Responses

No responses yet.