Show simple item record

dc.contributor.advisorMatusik, Wojciech
dc.contributor.advisorDaptardar, Ajay
dc.contributor.authorWang, Jialan
dc.date.accessioned2024-01-16T21:52:12Z
dc.date.available2024-01-16T21:52:12Z
dc.date.issued2023-09
dc.date.submitted2023-12-05T17:30:48.141Z
dc.identifier.urihttps://hdl.handle.net/1721.1/153338
dc.description.abstractMercari is an online two-sided marketplace that allows users to both sell and purchase items. To create the most efficient item listing process for the sellers and bring the most relevant items to the buyers, Mercari utilizes a pre-trained model called Contrastive Language-Image Pre-training (CLIP), famed for its exceptional zero-shot performances, to support the auto-filling feature for item listing and similar items recommendation. As this model is pre-trained on a general dataset gathered from the Internet, which likely does not have the same data distribution as Mercari’s data and results in non-optimal performance, we would like to explore the possibility of pre-training or fine-tuning CLIP with Mercari’s data to improve its performance within Mercari’s data domain. We explore various training strategies to understand the effects of each and determine the most effective strategy. Our best-performing and most space-efficient model achieves a brand prediction top-1 accuracy of 89.34% with 49.89% coverage and a category prediction accuracy of 78.02% with 69.62% coverage, significantly outperforming the current zero-shot CLIP in brand prediction and marginally in category prediction. Moreover, it achieves this with an embedding size that is half of that of the original CLIP.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://rightsstatements.org/page/InC-EDU/1.0/
dc.titleThe Effects of Pre-Training and Fine-Tuning CLIP with Domain-Specific Data
dc.typeThesis
dc.description.degreeM.Eng.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degreeMaster
thesis.degree.nameMaster of Engineering in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record