Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Bug0 Browsers - Cloud Chromium on demand, per-minute, live preview Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Bucketing in Spark" | Hashnode

FeedDiscussion

Deepankar Yadav

Making data sexy, one query at a time: Data's ultimate wingman.

Jan 6, 2024

Bucketing in Spark

Bucketing 🪣 Bucketing is a way to assign rows of a dataset to specific buckets and collocate them on disk. Explicit bucket counts (clustering columns) can be provided to partition the data based on the number of buckets. Ideal situation to use �...

bytesofdeepankar.hashnode.dev3 min read

Responses

No responses yet.