Spark Performance optimisation - Broadcast join
Broadcast join is an optimization technique used in the Spark SQL engine. It is utilized when one of the DataFrames is small enough to be stored in the memory of all executor nodes. This technique greatly improves the performance of join operations b...
mpmartydata.hashnode.dev4 min read