pinkdatahub.hashnode.devCustomer Insights with Quantium: Task One ReflectionI tried The Forage Job simulation, and I started with Quantium Task 1 Here is a review of the Task 1. Task One: Data preparation and customer analytics Conducting analysis on client's transaction dataset and identifying customer purchasing behavior...Jul 16, 2025·3 min read
pinkdatahub.hashnode.devSocial Media Sentiment (Data cleaning)Hi everyone, I resumed publishing on my blogs my career growth. I literally took a year break, but I am fully ready to focus on my career. I decided to start learning my Data Analysis skills. I chose to start with data cleaning exercises using Python...Jun 18, 2025·1 min read
pinkdatahub.hashnode.devMarket Basket AnalysisProject Description Market Basket Analysis using PySpark to identify associations between different products. To determine which items are often purchased together and derive insights to optimize product placements and promotions. Dataset: Sample Sup...Aug 14, 2023·3 min read
pinkdatahub.hashnode.devCassandra DB with Docker, Python On Windows Operating SystemA friend of mine just starting his data engineering career journey reached out to me because he had issues with setting Cassandra DB on his system. So I decided to document the whole process so other data engineering newbies can have access to it. Ba...Jul 19, 2023·2 min read
pinkdatahub.hashnode.devLoading Data from MongoDB Database with PySparkThe last database we will connect with PySpark is MongoDB. MongoDB is a NoSQL Database that usually outputs data in a JSON File Format. We start by installing the MongoDB driver for python pip install pymongo To set up MongoDB you can download the c...Jul 16, 2023·4 min read