Show simple item record

dc.contributor.authorSchofield, Alexandra
dc.contributor.authorWu, Siqi
dc.contributor.authorBayard de Volo, Theo
dc.contributor.authorKuze, Tatsuki
dc.contributor.authorGomez, Alfredo
dc.contributor.authorSultana, Sharifa
dc.date.accessioned2025-02-11T17:25:36Z
dc.date.available2025-02-11T17:25:36Z
dc.date.issued2025-01-10
dc.identifier.issn2573-0142
dc.identifier.urihttps://hdl.handle.net/1721.1/158190
dc.description.abstractPractitioners dealing with large text collections frequently use topic models such as Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) in their projects to explore trends. Despite twenty years of accrued advancement in natural language processing tools, these models are found to be slow and challenging to apply to text exploration projects. In our work, we engaged with practitioners (n=15) who use topic modeling to explore trends in large text collections to understand their project workflows and investigate which factors often slow down the processes and how they deal with such errors and interruptions in automated topic modeling. Our findings show that practitioners are required to diagnose and resolve context-specific problems with preparing data and models and need control for these steps, especially for data cleaning and parameter selection. Our major findings resonate with existing work across CSCW, computational social science, machine learning, data science, and digital humanities. They also leave us questioning whether automation is actually a useful goal for tools designed for topic models and text exploration.en_US
dc.publisherAssociation for Computing Machineryen_US
dc.relation.isversionofhttps://doi.org/10.1145/3701201en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAssociation for Computing Machineryen_US
dc.title"My Very Subjective Human Interpretation": Domain Expert Perspectives on Navigating the Text Analysis Loop for Topic Modelsen_US
dc.typeArticleen_US
dc.identifier.citationSchofield, Alexandra, Wu, Siqi, Bayard de Volo, Theo, Kuze, Tatsuki, Gomez, Alfredo et al. 2025. ""My Very Subjective Human Interpretation": Domain Expert Perspectives on Navigating the Text Analysis Loop for Topic Models." Proceedings of the ACM on Human-Computer Interaction, 9 (GROUP).
dc.relation.journalProceedings of the ACM on Human-Computer Interactionen_US
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2025-02-01T08:57:07Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2025-02-01T08:57:08Z
mit.journal.volume9en_US
mit.journal.issueGROUPen_US
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record