ModelDiff: A Framework for Comparing Learning Algorithms

Shah, Harshay

dc.contributor.advisor	Mądry, Aleksander
dc.contributor.author	Shah, Harshay
dc.date.accessioned	2025-11-17T19:06:38Z
dc.date.available	2025-11-17T19:06:38Z
dc.date.issued	2025-05
dc.date.submitted	2025-08-14T19:33:18.790Z
dc.identifier.uri	https://hdl.handle.net/1721.1/163675
dc.description.abstract	We study the problem of (learning) algorithm comparison, where the goal is to find differences between models trained with two different learning algorithms. We begin by formalizing this goal as one of finding distinguishing feature transformations, i.e., input transformations that change the predictions of models trained with one learning algorithm but not the other. We then present ModelDiff, a method that leverages the datamodels framework (Ilyas et al., 2022) to compare learning algorithms based on how they use their training data. We demonstrate ModelDiff through three case studies, comparing models trained with/without data augmentation, with/without pre-training, and with different SGD hyperparameters. Our code is available at https://github.com/MadryLab/modeldiff.
dc.publisher	Massachusetts Institute of Technology
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.title	ModelDiff: A Framework for Comparing Learning Algorithms
dc.type	Thesis
dc.description.degree	S.M.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Science in Electrical Engineering and Computer Science

Files in this item

Name:: shah-harshay-sm-eecs-2025-thes ...
Size:: 6.303Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record