Bucketing in Spark
Bucketing 🪣
Bucketing is a way to assign rows of a dataset to specific buckets and collocate them on disk.
Explicit bucket counts (clustering columns) can be provided to partition the data based on the number of buckets.
Ideal situation to use �...
bytesofdeepankar.hashnode.dev3 min read