contents
It’s an optional section
previous
KL-divergence and cross-entropy
next
Optimization and gradient descent method