On Generalization Bounds for Neural Networks with Low Rank Layers

Pinto, Andrea; Rangamani, Akshay; Poggio, Tomaso

dc.contributor.author	Pinto, Andrea
dc.contributor.author	Rangamani, Akshay
dc.contributor.author	Poggio, Tomaso
dc.date.accessioned	2024-10-11T13:51:12Z
dc.date.available	2024-10-11T13:51:12Z
dc.date.issued	2024-10-11
dc.identifier.uri	https://hdl.handle.net/1721.1/157263
dc.description.abstract	While previous optimization results have suggested that deep neural networks tend to favour low-rank weight matrices, the implications of this inductive bias on generalization bounds remain under-explored. In this paper, we apply a chain rule for Gaussian complexity (Maurer, 2016a) to analyze how low-rank layers in deep networks can prevent the accumulation of rank and dimensionality factors that typically multiply across layers. This approach yields generalization bounds for rank and spectral norm constrained networks. We compare our results to prior generalization bounds for deep networks, highlighting how deep networks with low-rank layers can achieve better generalization than those with full-rank layers. Additionally, we discuss how this framework provides new perspectives on the generalization capabilities of deep nets exhibiting neural collapse.	en_US
dc.description.sponsorship	This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF-1231216.	en_US
dc.publisher	Center for Brains, Minds and Machines (CBMM)	en_US
dc.relation.ispartofseries	CBMM Memo;151
dc.subject	Gaussian Complexity, Generalization Bounds, Low Rank Layers, Neural Collapse	en_US
dc.title	On Generalization Bounds for Neural Networks with Low Rank Layers	en_US
dc.type	Article	en_US
dc.type	Technical Report	en_US
dc.type	Working Paper	en_US

Files in this item

Name:: CBMM-Memo-151.pdf
Size:: 697.3Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

CBMM Memo Series

Show simple item record