ยฉ 2026 LinearBytes Inc.
Search posts, tags, users, and pages
Mickael Saada
Hi everyone ! I want to use Spark to launch some computations in parallel but I know that Spark is hard to configure when you have external libraries in Python. What is the easiest way to launch Spark jobs on clusters using Python ? Thanks !
Paul
Code & Security @ sqreen.io
Hi Mickael ๐
my colleague had the exact same question and just wrote an article about it.
We decided to use Amazon EMR.
I hope it will help.
blog.sqreen.io/amazon-emr-spark
Paul
Code & Security @ sqreen.io
Hi Mickael ๐
my colleague had the exact same question and just wrote an article about it.
We decided to use Amazon EMR.
I hope it will help.
blog.sqreen.io/amazon-emr-spark