Skip to content

Commit

Permalink
nn reading
Browse files Browse the repository at this point in the history
  • Loading branch information
elimelt committed Jan 14, 2025
1 parent 4fd0ffd commit 332cfe4
Showing 1 changed file with 10 additions and 1 deletion.
11 changes: 10 additions & 1 deletion natural-language-processing/neural-networks.md
Original file line number Diff line number Diff line change
Expand Up @@ -217,4 +217,13 @@ $$
\frac{\partial L}{\partial a} &= \frac{\partial L}{\partial e} \cdot \frac{\partial e}{\partial a} = c \\
\frac{\partial L}{\partial b} &= \frac{\partial L}{\partial e} \cdot \frac{\partial e}{\partial d} \cdot \frac{\partial d}{\partial b} = 2c
\end{align*}
$$
$$

### Learning details

NN optimization is a non-convex optimization problem, so it requires a few techniques to work well:

- Initialize weights and biases to small random values instead of all zeros
- Normalize input values to $\mu = 0, \sigma = 1$
- Dropout: randomly (with probability $p$) set some hidden units to 0, then renormalize other inputs to prevent overfitting
- Hyperparameters: learning rate, mini-batch size, number of hidden units, number of layers, choice of activation function, etc.

0 comments on commit 332cfe4

Please sign in to comment.