MFMumtaz Fatimainmumtazfatima.hashnode.dev·Jul 5, 2025 · 3 min readDay 3 of 50 Days of ML: Data Cleaning OverviewOften, when we train ML models on data, the dataset we use is messy, filled with NULLs, or has an absurd range of values in a particular column. The dataset needs to be preprocessed and cleaned to prevent noise from affecting the accuracy of the mode...00
MFMumtaz Fatimainmumtazfatima.hashnode.dev·Jun 27, 2025 · 2 min readWhy Leetcode?Something about opening the leetcode.com website and staring at the problem description annoys me. I find it quite a waste of time trying to solve questions that don’t directly map 1:1 to the work I do. So, I often end up procrastinating (just like h...00
MFMumtaz Fatimainmumtazfatima.hashnode.dev·Jun 20, 2025 · 2 min readDay 2 of 50: Diving into Vector DatabasesWhat are vector databases? In simple words, vector databases are similar to traditional databases, as in they store data. However, the difference is that they store vectors and can be leveraged to do quick similarity searches for vectors. For example...00
MFMumtaz Fatimainmumtazfatima.hashnode.dev·Jun 18, 2025 · 2 min readDay 1 of 50: Data PreprocessingHello again! I’m following Avik’s Day 1 template for my 50 Days of ML Challenge. This module covers the basics of data importing, preprocessing, and feature scaling. For the dataset used in my code, I reused a dataset that I created for one of my cou...00
MFMumtaz Fatimainmumtazfatima.hashnode.dev·Jun 17, 2025 · 1 min readDay 0: 50 Days of ML ChallengeHello again! I’m starting the 50 days of ML challenge to further my knowledge of machine learning. Follow along if you would like to learn together. I’m following Avik Jain’s roadmap. Find the link to the repository here: https://github.com/Avik-Jain...00