BSBerkan Seseninsesenai.hashnode.dev00Solving CartPole Without Gradients: Simulated Annealing1d ago · 17 min read · In the previous post, we solved CartPole using the Cross-Entropy Method: sample 200 candidate policies, keep the best 40, refit a Gaussian, repeat. It worked beautifully, reaching a perfect score of 5Join discussion
BSBerkan Seseninsesenai.hashnode.dev00The Cross-Entropy Method: Solving RL Without Gradients3d ago · 14 min read · Reinforcement learning has accumulated layers of complexity over the years: value functions, policy gradients, replay buffers, target networks. The Cross-Entropy Method predates all of it. Rubinstein Join discussion
BSBerkan Seseninsesenai.hashnode.dev00PCR vs PLS: When Fewer Features Beat More4d ago · 14 min read · How much should a baseball team pay its players? The 1986 Major League season gives us 263 hitters with 19 statistics each: at-bats, hits, home runs, years played, and more. Predicting salary from perJoin discussion
BSBerkan Seseninsesenai.hashnode.dev10Text Classification from Scratch: TF-IDF and Naive Bayes6d ago · 16 min read · Every morning, your inbox separates spam from real email. News apps sort articles into sports, tech, and politics. Customer support systems route tickets to the right team. Behind all of these is textJoin discussion
BSBerkan Seseninsesenai.hashnode.dev00AI Experts Are Dead. Long Live the AI Experts.Apr 15 · 16 min read · Last month, my eight-year-old built a Flappy Bird clone from scratch. He can't really type yet. He certainly can't write Python. What he can do is talk to Claude while I whisper in his ear what to sayJoin discussion