Search
Now showing items 1-2 of 2
For interpolating kernel machines, the minimum norm ERM solution is the most stable
(Center for Brains, Minds and Machines (CBMM), 2020-06-22)
We study the average CVloo stability of kernel ridge-less regression and derive corresponding risk bounds. We show that the interpolating solution with minimum norm has the best CVloo stability, which in turn is controlled ...
The Janus effects of SGD vs GD: high noise and low rank
(2023-12-21)
It was always obvious that SGD has higher fluctuations at convergence than GD. It has also been often reported that SGD in deep RELU networks has a low-rank bias in the weight matrices. A recent theoretical analysis linked ...