How Uniform Random Weights Induce Non-uniform Bias: Typical...
Background. A main theoretical puzzle is why over-parameterized Neural Networks (NNs) generalize well when trained to zero loss (i.e., so they interpolate the data). Usually, the NN is trained...
arxiv.org