Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Comprehensive Guide to the ReLU Activation Function in Neural Networks: Definition, Role, and Type Explained" | Hashnode

FeedDiscussion

Pronod Bharatiya

learning

Sep 11, 2024

Comprehensive Guide to the ReLU Activation Function in Neural Networks: Definition, Role, and Type Explained

ReLU stands for Rectified Linear Unit and is an activation function commonly used in artificial neural networks, especially in deep learning models. It's a simple but effective mathematical function that introduces non-linearity to the network's comp...

data-intelligence.hashnode.dev22 min read

#deep-learning #machine-learning #relu #activation-function

Responses(1)

In table 3, the function for Swish needs sigmoid(beta*x), unless beta is identically 1. From what I understand, beta is itself a trainable parameter, making it key to differentiate it from SiLU, where beta is identically 1.

Thus, the derivative should be: sigmoid(beta*x) plus beta times x times derivative_of_the_sigmoid function, where derivative of the sigmoid equals sigmoid(beta times x) times (1 minus sigmoid(beta times x))

Or if sigmoid is y, then derivative of y is equal to y times (1-y)

Vineet Theodore

Jack of all trades

Jun 12, 2025