Ali Vijdaanvijdaancoding.hashnode.dev·Jun 22, 2024Never Struggle with Skewed Data AgainThe article will give a through explanation on what skewed data is, why its bad for your models and how you can identify and fix such data. What is Skewed Data Data which is normally distributed would look like this As you can see the mean, median a...skewed datasets
Ajay Veerabommaajayveerabomma.hashnode.dev·Feb 20, 2024Understanding Salting Technique in SparkApache Spark is a powerful open-source distributed computing system known for its speed, ease of use, and sophisticated analytics capabilities. It is widely used for big data processing and analytics due to its ability to handle large-scale data acro...2 likes·261 readsspark
Saurabh Naiksaurabhz.hashnode.dev·Nov 21, 2023Data Asymmetry: A Guide to SkewnessIntroduction: In the intricate world of data analysis, skewness emerges as a critical metric, unraveling the hidden tales of data distribution. Understanding its nuances is paramount for deciphering the symphony of statistics. What is Skewness?: Skew...Statisticsskew