Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Why Reward Shaping Sucks and What You Can Do About It" | Hashnode

FeedDiscussion

Duane Nielsen

Obsessed with RL

Sep 18, 2025

Why Reward Shaping Sucks and What You Can Do About It

In my previous article on reward shaping, I walked through four hard-learned lessons about balancing collision penalties in a navigation task. I eventually found the "Goldilocks solution" - a -0.1 penalty that let my agent learn to navigate obstacles...

proximal.hashnode.dev6 min read

#ai #bellman-equation #deep-learning #machine-learning #reinforcement-learning #reward-shaping

Responses(1)

Revan wjy

Nyatai disini dapat cuan

jo777.help