| dc.contributor.advisor | Mądry, Aleksander | |
| dc.contributor.author | Shah, Harshay | |
| dc.date.accessioned | 2025-11-17T19:06:38Z | |
| dc.date.available | 2025-11-17T19:06:38Z | |
| dc.date.issued | 2025-05 | |
| dc.date.submitted | 2025-08-14T19:33:18.790Z | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/163675 | |
| dc.description.abstract | We study the problem of (learning) algorithm comparison, where the goal is to find differences between models trained with two different learning algorithms. We begin by formalizing this goal as one of finding distinguishing feature transformations, i.e., input transformations that change the predictions of models trained with one learning algorithm but not the other. We then present ModelDiff, a method that leverages the datamodels framework (Ilyas et al., 2022) to compare learning algorithms based on how they use their training data. We demonstrate ModelDiff through three case studies, comparing models trained with/without data augmentation, with/without pre-training, and with different SGD hyperparameters. Our code is available at https://github.com/MadryLab/modeldiff. | |
| dc.publisher | Massachusetts Institute of Technology | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) | |
| dc.rights | Copyright retained by author(s) | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.title | ModelDiff: A Framework for Comparing Learning Algorithms | |
| dc.type | Thesis | |
| dc.description.degree | S.M. | |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
| mit.thesis.degree | Master | |
| thesis.degree.name | Master of Science in Electrical Engineering and Computer Science | |