<h1 id="heading-how-do-we-assign-weights-in-deep-learning">How do we assign weights in deep learning?</h1>
<ul>
<li>We already know that in a neural network, weights are usually initialized randomly and that kind of initialization takes a fair/significant amount of repetitions to converge to the least loss and reach the ideal weight matrix.
</li>
<li>The problem is, that kind of initialization is prone to vanishing or exploding gradient problems.
</li>
</ul>
<h4 id="heading-general-ways-to-make-it-initialize-better-weights">General ways to make it initialize better weights:</h4>
A) ReLu activation function in the deep nets.
<ol>
<li>Generate a random sample of weights from a Gaussian distribution having a mean of 0 and a standard deviation of 1.
</li>
<li>Multiply the sample with the square root of (2/ni). Where ni is the number of input units for that layer.
</li>
</ol>
B) Likewise, if you’re using the Tanh activation function.
<ol>
<li>Generate a random sample of weights from a Gaussian distribution having a mean of 0 and a standard deviation of 1.
</li>
<li>Multiply the sample with the square root of (1/ni) where ni is several input units for that layer.
</li>
</ol>

# How do we assign weights in deep learning?

* We already know that in a neural network, weights are usually initialized randomly and that kind of initialization takes a fair/significant amount of repetitions to converge to the least loss and reach the ideal weight matrix.
    
* The problem is, that kind of initialization is prone to vanishing or exploding gradient problems.
    

#### General ways to make it initialize better weights:

**A) ReLu activation function in the deep nets.**

1. Generate a random sample of weights from a Gaussian distribution having a mean of 0 and a standard deviation of 1.
    
2. Multiply the sample with the square root of (2/ni). Where ni is the number of input units for that layer.
    

**B) Likewise, if you’re using the Tanh activation function.**

1. Generate a random sample of weights from a Gaussian distribution having a mean of 0 and a standard deviation of 1.
    
2. Multiply the sample with the square root of (1/ni) where ni is several input units for that layer.

How do we assign weights in deep learning?

#59 Machine Learning & Data Science Challenge 59

A Mountain Hiker, Space, and Tech Enthusiast who became a Data Scientist.