Data Science Asked by Tarun Gupta on January 17, 2021
I was reading this article where I came across the following statement in the context of “Why do we use sigmoid activation function in Neural Nets?”:
The assumption of a dependent variable to follow a sigmoid function inherently assumes a Gaussian distribution for the independent variable which is a general distribution we see for a lot of randomly occurring events and this is a good generic distribution to start with.
Could someone elaborate on this relationship between the two?
Yes sure, the core idea is that the sigmoid function approximate very nicely the cumulative density function of a Gaussian distribution. Therefore, if the input of a neuron is a continuous Gaussian distribution, the probability of the output being inferior to a certain value will follow (almost) the shape of the sigmoid function.
A detailed explanation can be found p16 of the following article:
Answered by Jeanba on January 17, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP