Raghuveer Sriramanraghuveer.me·Oct 3, 2023Getting answers from data using PySparkThis post attempts to document a small part of a Data Engineer's workflow along with some techniques that help answering data questions from a dataset. On the technical side, we will deal with nested JSON data, touch upon data cleaning and data explo...66 readsspark
Piyush Kumar Sinhadatasciencewonders.hashnode.dev·Sep 16, 2023The Importance of Data Cleaning and Preprocessing in Data ScienceWelcome back, fellow data wranglers! If you’ve been following this series, you know we’ve already dived into the exciting world of data collection. Today, we’re rolling up our sleeves and getting our hands dirty with the nitty-gritty of Data Cleaning...Data Science
Jon Taylorblog.jontaylor.dev·Jul 9, 2023Command Line for Data AnalystsUnderstanding your data is a crucial part of data analysis and using the right tools can make your job much simpler. By the end of this post, you'll see one way I approach problems with the command line and hopefully introduce you to something new. T...data
FoodDataScrapefooddatascrapeservices.hashnode.dev·Feb 8, 2023How To Scrape TikTok Indonesia Food Recipe Data For Using Data Extraction, Exploration And Data Visualization?During the Covid-19 pandemic, we have seen changes in people’s story updates and Instagram posts. From posts like hangouts, parties, and travel, it’s shifting to the home activities about gardening, cooking, and Netflix binge-watching! We have seen ...63 readsData exploration