Show simple item record

dc.contributor.authorLi, Linday Skylaren_US
dc.date.accessioned2025-10-10T12:37:05Z
dc.date.available2025-10-10T12:37:05Z
dc.date.issued2025-07
dc.identifier.urihttps://hdl.handle.net/1721.1/163147
dc.description.abstractWhile current results show potential in LMM-based diagnosis, it is unclear if the output of them are backed by strong spatial reasoning capabilities. To evaluate this, I provided GPT-4o with chest X-rays and asked it to return diagnoses and the coordinates of bounding boxes that surrounded any identified abnormalities on the NIH chest X-ray dataset. I find variable performance across different images in the dataset, suggesting the need for further development of the spatial reasoning capabilities of LMMs.en_US
dc.titleEvaluating the Spatial Reasoning Capabilities of Large Multimodal Models on Chest X-Ray Anomaly Detectionen_US
dc.typeArticleen_US
dc.relation.journal2025 MIT AI and Education Summit
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record