Dharshini Sankar Rajdharshinisankarraj.hashnode.dev·Oct 8, 2024Sampling on DatasetsOnce you’ve created a dataset, you want to explore the values inside. Exploring very large datasets can be difficult, as even simple operations can be expensive, both in terms of computational resources and time. The same sampling principle applies t...Discuss·1 likeData Science
Juan Carlos Olamendyjuancolamendy.hashnode.dev·May 24, 2024Real World ML - Dealing with Selection BiasRecently, I was helping to improve the accuracey of an ML model in production and faced a difficult challenge. No matter what robust ML algorithm we selected or how we fine tuned the hyperparameters, the accuracy against real-world data just wouldn't...DiscussArtificial Intelligence
Juan Carlos Olamendyjuancolamendy.hashnode.dev·May 22, 2024Real world ML - Determine the Optimal Sample SizeHave you ever wondered how much data you really need for training your ML model? Determining the optimal sample size is a crucial step that can make or break your model's performance, computational efficiency and your project's time and costs. In thi...DiscussMachine Learning
Juan Carlos Olamendyjuancolamendy.hashnode.dev·May 17, 2024Effective Data Collection Strategies for Machine LearningData is the lifeblood of machine learning models. The right data collection strategy can make all the difference. But how do you ensure that your data is representative, diverse, and unbiased? As a ML practitioner, it's crucial to understand the vari...DiscussMachine Learning
Siddharth Rairaisid369.hashnode.dev·Dec 25, 2022Sampling Techniques!In statistics, different methods or techniques are used to obtain sample data from a given population. These methods or techniques are known as sampling techniques. Broadly, these can be classified into 4. They are: Random Sampling : As the name su...Discusssampling techniques