Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "The Cross-Entropy Method: Solving RL Without Gradients" | Hashnode

FeedDiscussion

Berkan Sesen

I think about AI. A lot.

Apr 21

The Cross-Entropy Method: Solving RL Without Gradients

Reinforcement learning has accumulated layers of complexity over the years: value functions, policy gradients, replay buffers, target networks. The Cross-Entropy Method predates all of it. Rubinstein

sesenai.hashnode.dev14 min read

#reinforcement-learning #optimisation

Responses

No responses yet.