Sherman defends dissertation

Garrick Sherman successfully defended his PhD dissertation, "Document Expansion and Language Model Re-estimation for Information Retrieval," on August 22.

His committee included Associate Professor Jana Diesner, chair and director of research; Professor J. Stephen Downie; Professor Ted Underwood; and Associate Professor Jaime Arguello of the University of North Carolina at Chapel Hill.

From the abstract: Document expansion is the process of augmenting the text of a document with text drawn from one or more other documents. The purpose of this expansion is to increase the size of the term sample from which document representations, such as language models, may be estimated. While document expansion has been shown to improve the effectiveness of ad-hoc document retrieval, our work differs from previous work in a variety of ways. We propose a consistent language modeling approach to document expansion of full length documents. We also explore the use of one or more external document collections as sources of data during the expansion process. Our proposed methods prove successful in improving retrieval effectiveness over baselines. We also acknowledge that existing document expansion work, including our own, has relied on intuitive assumptions about the mechanisms by which it achieves its effects. In this thesis, we quantify aspects of document language model change resulting from expansion . . . Recognizing the potential for further retrieval effectiveness improvement by means of selective application of our model, we investigate methods for automatically predicting whether or not to expand individual documents and, if so, which expansion collection may yield the optimal document representation. We find that, although the document expansion retrieval model has proven effective overall, accurate prediction concerning the expansion of a given document depends too heavily on predicting the document's relevance.

Updated on
Backto the news archive

Related News

Han defends dissertation

Doctoral candidate Kanyao Han successfully defended his dissertation, "Natural Language Processing for Supporting Impact Assessment of Funded Projects," on January 7, 2025.

Kanyao Han

Pettigrew finds balance as a student-athlete

Isiah Pettigrew started wrestling in his junior year of high school in Palatine, Illinois. He advanced in the sport quickly, placing fourth in his weight class at the state wrestling tournament in his senior year. He signed on with the Illini Wrestling team in 2020 as a freshman and has been wrestling throughout his academic career, which includes earning a bachelor's degree and beginning a master's degree at the iSchool.

Isiah Pettigrew

Get to know Cadence Cordell, MSLIS student

Cadence Cordell was inspired by her undergraduate work experience to pursue a degree in library and information science. She followed in her mother’s footsteps by selecting the iSchool for her MSLIS. After completing a recent research poster presentation, she combined her scholarly pursuit with her hobby by sewing her fabric poster into a squirrel plushie.

Cadence Cordell

Recent graduate committed to making libraries accessible and inclusive

Joshua Short knows firsthand the barriers to public library access that patrons living on modest wages experience. Having grown up in a self-professed "low-income environment," Short has made it his mission to reduce these barriers, such as library fines, inadequate transportation, and limited computer literacy.

Joshua Short

Spectrum Scholar Spotlight: Leslie Lopez

Twelve iSchool master's students were named 2024–2025 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This “Spectrum Scholar Spotlight” series highlights the School’s scholars. MSLIS student Leslie Lopez graduated from the University of North Texas with a BA in psychology.

Leslie Lopez headshot