PKParth krishan Goswamiinparthkg.hashnode.dev·Feb 28 · 5 min readDay 14: setup_env.sh to human-learn — Building Leak-Proof Pipelines on M1Subtitle: Ditching messy source installations and manual scaling for clean, conda-powered Scikit-Learn pipelines and human-in-the-loop logic. setup_env.sh: Installing Scikit-Learn on Apple Silicon Be00
PKParth krishan Goswamiinparthkg.hashnode.dev·Feb 25 · 5 min readData Wrangling — Cleaning, Merging, and GroupingI thought I just had to load a CSV and press "Train." I was wrong. My data was scattered across two different files. It had empty rows. It had text where numbers should be. If I feed this into a model00
PKParth krishan Goswamiinparthkg.hashnode.dev·Feb 22 · 11 min readVisualizing Data — Matplotlib and SeabornWhy Stats Aren't Enough I thought .describe() was enough. It gave me the mean, the max, and the median. But then I learned about Anscombe's Quartet. This is a famous dataset where four different group00
PKParth krishan Goswamiinparthkg.hashnode.dev·Feb 21 · 4 min readPandas: A Deep Dive1. Pandas Series A Series is a one-dimensional array holding data of any type. It’s essentially a single column. Creating Labels By default, if I make a list, Pandas labels it 0, 1, 2... just like a n00
PKParth krishan Goswamiinparthkg.hashnode.dev·Feb 20 · 6 min readThe NumPy Survival GuideTo build Machine Learning models, I need to master the ndarray (N-dimensional array). It’s not just about storing numbers; it’s about generating patterns, reshaping matrices, and cleaning data without00