Sherman defends dissertation

Garrick Sherman
Garrick Sherman

Garrick Sherman successfully defended his PhD dissertation, "Document Expansion and Language Model Re-estimation for Information Retrieval," on August 22.

His committee included Associate Professor Jana Diesner, chair and director of research; Professor J. Stephen Downie; Professor Ted Underwood; and Associate Professor Jaime Arguello of the University of North Carolina at Chapel Hill.

From the abstract: Document expansion is the process of augmenting the text of a document with text drawn from one or more other documents. The purpose of this expansion is to increase the size of the term sample from which document representations, such as language models, may be estimated. While document expansion has been shown to improve the effectiveness of ad-hoc document retrieval, our work differs from previous work in a variety of ways. We propose a consistent language modeling approach to document expansion of full length documents. We also explore the use of one or more external document collections as sources of data during the expansion process. Our proposed methods prove successful in improving retrieval effectiveness over baselines. We also acknowledge that existing document expansion work, including our own, has relied on intuitive assumptions about the mechanisms by which it achieves its effects. In this thesis, we quantify aspects of document language model change resulting from expansion . . . Recognizing the potential for further retrieval effectiveness improvement by means of selective application of our model, we investigate methods for automatically predicting whether or not to expand individual documents and, if so, which expansion collection may yield the optimal document representation. We find that, although the document expansion retrieval model has proven effective overall, accurate prediction concerning the expansion of a given document depends too heavily on predicting the document's relevance.

Updated on
Backto the news archive

Related News

Diesner to present research at conference on scientific data quality

Associate Professor and PhD Program Director Jana Diesner will give an invited talk at the conference "The Data Quality Challenge: Research during the Digital Transformation," which will be hosted by the German Council for Scientific Information Infrastructures on February 27-28 in Hanover, Germany. The conference will examine topics such as research integrity and trust, data quality as a political issue, criteria for the scientific quality of data, the data lifecycle, and data quality standards.

Jana Diesner

Tilley to give keynote at C2E2

Associate Professor Carol Tilley will be a keynote speaker at the Chicago Comic & Entertainment Expo (C2E2), which will be held on February 28-March 1. C2E2 brings together the best of pop culture, including comics, graphic novels, and manga, as well as movies, TV, video games, and more.

Carol Tilley

Researcher Spotlight: Jessie Chin

Jessie Chin is an assistant professor in the iSchool and the principal investigator of The Adaptive Cognition and Interaction Design (ACTION) Lab. Her research aims to advance knowledge in cognitive sciences regarding evolving human interaction with the contemporary information technologies and translating theories in social and behavioral sciences to the design of technologies and interaction experience to promote health communication and behavior across the lifespan. 

Jessie Chin

iSchool researchers organize provenance workshop in Ireland

PhD students Michael Gryk and Jessica Cheng and alumna Rhiannon Bettivia (PhD '16) organized a provenance workshop, which was held on February 17 in conjunction with the 15th International Digital Curation Conference (IDCC) in Dublin, Ireland. 

Worthey joins HathiTrust Research Center

Glen Worthey is the new associate director for research support services in the HathiTrust Research Center (HTRC), a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust Digital Library. He will be based at the iSchool at Illinois.

Glen Worthey