Sherman defends dissertation

Garrick Sherman successfully defended his PhD dissertation, "Document Expansion and Language Model Re-estimation for Information Retrieval," on August 22.

His committee included Associate Professor Jana Diesner, chair and director of research; Professor J. Stephen Downie; Professor Ted Underwood; and Associate Professor Jaime Arguello of the University of North Carolina at Chapel Hill.

From the abstract: Document expansion is the process of augmenting the text of a document with text drawn from one or more other documents. The purpose of this expansion is to increase the size of the term sample from which document representations, such as language models, may be estimated. While document expansion has been shown to improve the effectiveness of ad-hoc document retrieval, our work differs from previous work in a variety of ways. We propose a consistent language modeling approach to document expansion of full length documents. We also explore the use of one or more external document collections as sources of data during the expansion process. Our proposed methods prove successful in improving retrieval effectiveness over baselines. We also acknowledge that existing document expansion work, including our own, has relied on intuitive assumptions about the mechanisms by which it achieves its effects. In this thesis, we quantify aspects of document language model change resulting from expansion . . . Recognizing the potential for further retrieval effectiveness improvement by means of selective application of our model, we investigate methods for automatically predicting whether or not to expand individual documents and, if so, which expansion collection may yield the optimal document representation. We find that, although the document expansion retrieval model has proven effective overall, accurate prediction concerning the expansion of a given document depends too heavily on predicting the document's relevance.

Updated on
Backto the news archive

Related News

Get to Know Fi Keane, MSLIS Student

MSLIS student Fi Keane found their way to the iSchool after working in the children’s section of a bookstore. For the past academic year, they have served as the graduate assistant for Research on Diversity in Youth Literature, a peer-reviewed, open access, academic journal hosted by the iSchool’s Center for Children’s Books.

Fi Keane

Kaushik defends dissertation

Doctoral candidate Smirity Kaushik successfully defended her dissertation, "Digital Trust, Safety, and Privacy in the Age of Emerging Technologies," on June 16. 

Smirity Kaushik

New book explores how AI is reshaping cultural heritage

Glen Layne-Worthey, associate director for research support services for the HathiTrust Research Center (HTRC), and J. Stephen Downie, professor and HTRC co-director, have edited a new book, Navigating Artificial Intelligence for Cultural Heritage Organisations, which was recently released by UCL Press. 

Han defends dissertation

Doctoral candidate Yingying Han successfully defended her dissertation, "Community Archives as Agency: Documenting Chinese American Experiences in the U.S.,” on May 28.

Yingying Han

Student award recipients announced

The School of Information Sciences recognized student award recipients at the iSchool Convocation on May 18. Awards are based on academic achievements as well as attributes that contribute to professional success. For more information about each award, including past recipients, visit the Student Awards page. Congratulations to this year's honorees!

Award recipients Mahir Thakkar, Delia Kerr-Dennhardt, Katie Skoufes, Audrey Bentch, and Adam Beaty.