Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Bug0 Browsers - Cloud Chromium on demand, per-minute, live preview Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Solving CartPole Without Gradients: Simulated Annealing" | Hashnode

FeedDiscussion

Berkan Sesen

I think about AI. A lot.

Apr 23

Solving CartPole Without Gradients: Simulated Annealing

In the previous post, we solved CartPole using the Cross-Entropy Method: sample 200 candidate policies, keep the best 40, refit a Gaussian, repeat. It worked beautifully, reaching a perfect score of 5

sesenai.hashnode.dev17 min read

#reinforcement-learning #optimisation

Responses

No responses yet.