Nuggeteer: Automatic Nugget-Based Evaluation Using Descriptions and Judgements

Marton, Gregory

dc.contributor.advisor	Boris Katz
dc.contributor.author	Marton, Gregory
dc.contributor.other	Infolab
dc.date.accessioned	2006-01-10T18:47:00Z
dc.date.available	2006-01-10T18:47:00Z
dc.date.issued	2006-01-09
dc.identifier.uri	http://hdl.handle.net/1721.1/30604
dc.description.abstract	TREC Definition and Relationship questions are evaluated on thebasis of information nuggets that may be contained in systemresponses. Human evaluators provide informal descriptions of eachnugget, and judgements (assignments of nuggets to responses) for eachresponse submitted by participants.The best present automatic evaluation for these kinds of questions isPourpre. Pourpre uses a stemmed unigram similarity of responses withnugget descriptions, yielding an aggregate result that is difficult tointerpret, but is useful for relative comparison. Nuggeteer, bycontrast, uses both the human descriptions and the human judgements,and makes binary decisions about each response, so that the end resultis as interpretable as the official score.I explore n-gram length, use of judgements, stemming, and termweighting, and provide a new algorithm quantitatively comparable to,and qualitatively better than the state of the art.
dc.format.extent	15 p.
dc.format.extent	236402 bytes
dc.format.mimetype	application/pdf
dc.language.iso	en_US
dc.relation.ispartofseries	Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory
dc.subject	natural language
dc.subject	question answering
dc.title	Nuggeteer: Automatic Nugget-Based Evaluation Using Descriptions and Judgements

Files in this item

Name:: nuggeteer.pdf
Size:: 230.8Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

CSAIL Work Products

Show simple item record