Raghuveer Sriramanraghuveer.meยทOct 3, 2023Getting answers from data using PySparkThis post attempts to document a small part of a Data Engineer's workflow along with some techniques that help answering data questions from a dataset. On the technical side, we will deal with nested JSON data, touch upon data cleaning and data explo...66 readssparkAdd a thoughtful commentNo comments yetBe the first to start the conversation.