Sherman defends dissertation

Garrick Sherman successfully defended his PhD dissertation, "Document Expansion and Language Model Re-estimation for Information Retrieval," on August 22.

His committee included Associate Professor Jana Diesner, chair and director of research; Professor J. Stephen Downie; Professor Ted Underwood; and Associate Professor Jaime Arguello of the University of North Carolina at Chapel Hill.

From the abstract: Document expansion is the process of augmenting the text of a document with text drawn from one or more other documents. The purpose of this expansion is to increase the size of the term sample from which document representations, such as language models, may be estimated. While document expansion has been shown to improve the effectiveness of ad-hoc document retrieval, our work differs from previous work in a variety of ways. We propose a consistent language modeling approach to document expansion of full length documents. We also explore the use of one or more external document collections as sources of data during the expansion process. Our proposed methods prove successful in improving retrieval effectiveness over baselines. We also acknowledge that existing document expansion work, including our own, has relied on intuitive assumptions about the mechanisms by which it achieves its effects. In this thesis, we quantify aspects of document language model change resulting from expansion . . . Recognizing the potential for further retrieval effectiveness improvement by means of selective application of our model, we investigate methods for automatically predicting whether or not to expand individual documents and, if so, which expansion collection may yield the optimal document representation. We find that, although the document expansion retrieval model has proven effective overall, accurate prediction concerning the expansion of a given document depends too heavily on predicting the document's relevance.

Updated on
Backto the news archive

Related News

iSchool alumni and student named 2025 Movers & Shakers

Two iSchool alumni and an MSLIS student are included in Library Journal's 2025 class of Movers & Shakers, an annual list that recognizes 50 professionals who are moving the library field forward as a profession. Leah Gregory (MSLIS '04) was honored in the Advocates category, Billy Tringali (MSLIS '19) was honored in the Innovators category, and University Library Assistant Professor and Digital Humanities Librarian Mary Ton (current MSLIS student) was honored in the Educators category.

Spectrum Scholar Spotlight: Dalia Ortiz Pon

Twelve iSchool master's students were named 2024–2025 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Dalia Ortiz Pon earned her bachelor's degree in Latina/Latino studies from San Francisco State University. 

Dalia Ortiz Pon

Debnath datafies "The Bulletin"

MSIM student Tan Debnath, whose interests span data mining, statistical modeling, text mining, and digital humanities, joined the Center for Children's books as a research assistant. He was tasked with building curation processes that would datafy seventy-five years' worth of archival issues of The Bulletin of the Center for Children's Books, one of the nation's leading children's book review journals.

Tan Debnath stands casually with his hands in his pockets and smiles broadly at the camera. It's a sunny day

He receives Amazon Research Award to improve monitoring of Earth’s ecosystem

A new project led by Professor Jingrui He aims to help scientists monitor disruptions to the Earth’s ecosystem, such as climate change. She recently received support for her work through an Amazon Research Award, which includes $60,000 in cash and an additional $40,000 in Amazon Web Services (AWS) credits.

Jingrui He