Data Manipulation with PySpark
Spark has become the de facto tool for processing large amounts of data. It is a distributed, in-memory engine with interfaces for numerous data stores which makes it scalable, fast and flexible. Platforms like Databricks and Snowflake (SnowPark) use...
jaeycyril.com6 min read