| dc.contributor.author | Li, Linday Skylar | en_US |
| dc.date.accessioned | 2025-10-10T12:37:05Z | |
| dc.date.available | 2025-10-10T12:37:05Z | |
| dc.date.issued | 2025-07 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/163147 | |
| dc.description.abstract | While current results show potential in LMM-based diagnosis, it is unclear if the output of them are backed by strong spatial reasoning capabilities. To evaluate this, I provided GPT-4o with chest X-rays and asked it to return diagnoses and the coordinates of bounding boxes that surrounded any identified abnormalities on the NIH chest X-ray dataset. I find variable performance across different images in the dataset, suggesting the need for further development of the spatial reasoning capabilities of LMMs. | en_US |
| dc.title | Evaluating the Spatial Reasoning Capabilities of Large Multimodal Models on Chest X-Ray Anomaly Detection | en_US |
| dc.type | Article | en_US |
| dc.relation.journal | 2025 MIT AI and Education Summit | |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |