School of Information Sciences

Sherman defends dissertation

Garrick Sherman successfully defended his PhD dissertation, "Document Expansion and Language Model Re-estimation for Information Retrieval," on August 22.

His committee included Associate Professor Jana Diesner, chair and director of research; Professor J. Stephen Downie; Professor Ted Underwood; and Associate Professor Jaime Arguello of the University of North Carolina at Chapel Hill.

From the abstract: Document expansion is the process of augmenting the text of a document with text drawn from one or more other documents. The purpose of this expansion is to increase the size of the term sample from which document representations, such as language models, may be estimated. While document expansion has been shown to improve the effectiveness of ad-hoc document retrieval, our work differs from previous work in a variety of ways. We propose a consistent language modeling approach to document expansion of full length documents. We also explore the use of one or more external document collections as sources of data during the expansion process. Our proposed methods prove successful in improving retrieval effectiveness over baselines. We also acknowledge that existing document expansion work, including our own, has relied on intuitive assumptions about the mechanisms by which it achieves its effects. In this thesis, we quantify aspects of document language model change resulting from expansion . . . Recognizing the potential for further retrieval effectiveness improvement by means of selective application of our model, we investigate methods for automatically predicting whether or not to expand individual documents and, if so, which expansion collection may yield the optimal document representation. We find that, although the document expansion retrieval model has proven effective overall, accurate prediction concerning the expansion of a given document depends too heavily on predicting the document's relevance.

Updated on
Backto the news archive

Related News

Downie presents TORCHLITE in Germany

This week, Professor and Executive Associate Dean J. Stephen Downie was a guest speaker at the Herder Institute in Marburg and the University of Göttingen. Downie, who serves as co-director of the HathiTrust Research Center (HTRC), lectured on the HTRC's "Tools for Open Research and Computation with HathiTrust: Leveraging Intelligent Text Extraction" (TORCHLITE) project.

J. Stephen Downie

Internship Spotlight: San Francisco Public Library

PhD student Adebola Obayemi discusses her internship with the San Francisco Public Library, where she worked on Expanding Information Access for Incarcerated People Initiative. She has been invited to present her proposal on digital literacy for incarcerated populations at the Expanding Information Access for Incarcerated People Convening, which will be held in June in Chicago. 

Adebola Obayemi

Undergraduate Research Symposium features iSchool researchers

The iSchool is well represented in the 19th annual Undergraduate Research Symposium, which will be held on April 30 from 9:00 a.m.-5:00 p.m. in the Illini Union. The iSchool is a Gold Sponsor of the symposium, which spotlights undergraduate research through oral and poster presentations, creative performances, and art exhibits.

Vaez Afshar selected as 2026 APT Student Scholar

The Association for Preservation Technology (APT) International has named Informatics PhD student Sepehr Vaez Afshar as a 2026 Student Scholar. Established in 1985, the APT Student Scholarship annually recognizes ten students worldwide whose work advances preservation technology through innovative and impactful approaches.

Sepehr Vaez Afshar

Nguyen receives Critical Language Scholarship

MSLIS student Christine Nguyen has been awarded a U.S. Department of State Critical Language Scholarship (CLS) to study Japanese this summer. She is one of four University of Illinois Urbana-Champaign students who received full scholarships to spend 8-10 weeks abroad and study one of 14 critical languages. The program is part of an initiative to expand the number of Americans studying and mastering critical foreign languages and cultural skills to enable them to contribute to U.S. economic competitiveness and national security.

Christine Thuy Minh Nguyen

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top