BSBerkan Seseninsesenai.hashnode.dev·1d ago · 13 min readAIC and BIC: Choosing the Right Model Without OverfittingImagine you're fitting a curve to noisy data. A straight line misses the shape entirely, so you try a quadratic, then a cubic, then keep going. By degree 10 the curve passes through nearly every point00
ASAnton Sarokainanton-saroka.hashnode.dev·Jun 20 · 9 min readCADE — An Interesting Approach to Finding Anomalies in Multidimensional DataThis is a translation of my original article on habr.com. Introduction One way to search for anomalies in a dataset is to use the probability density function corresponding to the data as a measure of00
ASAnton Sarokainanton-saroka.hashnode.dev·Jun 20 · 7 min readWhat Is the Distribution of Sample Quantiles?This is a translation of my original article on habr.com. Sample means, sample variances, sample quantiles, and other statistics are random variables by nature. Knowing their distributions helps us bu00
Mmayaanderssoninllmasajudge.hashnode.dev·Jun 16 · 3 min readStratified sampling for LLM eval sets: why your aggregate pass rate hides the regressions that matterTL;DR: A headline eval pass rate is an average over every kind of input your system sees, and averages hide the thing you most need to catch: a sharp regression in a small but important slice. If refu00
BSBerkan Seseninsesenai.hashnode.dev·Jun 12 · 13 min readLDA vs PCA: Supervised Meets Unsupervised Dimensionality ReductionYou have a high-dimensional dataset and you need to squeeze it down to two or three dimensions for visualisation or downstream modelling. The go-to move is PCA, and most of the time it works. But cons00
AAdarshinaasteriskz.hashnode.dev·Jun 11 · 11 min readBuilding an Autonomous Monte Carlo Engine to Predict the 2026 World CupToday is the start of the 2026 FIFA World Cup, the largest sporting competition every four years. As a fun project, I decided to build a model to predict the tournament. With sports, you never really 00
Mmayaanderssoninllmasajudge.hashnode.dev·Jun 11 · 2 min readWe put confidence intervals on our LLM-judge scores. The error bars ate three weeks of "trend"We track weekly agreement between an LLM judge and human labels (Cohen's kappa) on a sample of production traces. For three weeks the point estimates told a story: 0.55, then 0.49, then 0.44. The team00
ABAman Beherainbeingamanforever.hashnode.dev·Jun 11 · 8 min readGSoC 2026 / Week 2: Wiring the fitHi again. Week 2 of my GSoC with GNU Octave is done. If Week 1 was the structural shell, Week 2 is when that shell starts doing real work, the anova class can now actually fit a model and populate res10
BSBerkan Seseninsesenai.hashnode.dev·Jun 9 · 14 min readChangepoint Detection: Finding Regime Shifts in Financial DataMarkets do not stay in one regime. The S&P 500 can cruise at 10% annualised volatility for months, then a crisis hits and volatility doubles overnight. Any model trained on the calm period is useless 00
ABAman Beherainbeingamanforever.hashnode.dev·Jun 4 · 6 min readGSoC 2026 / Week 1: The skeleton and the selectorHi again. Week 1 of my GSoC with GNU Octave is done. This week was deliberately small in surface area and large in decisions, the kind of week where 280 lines of code take five hours of staring at exi10