School of Information Sciences

Sherman defends dissertation

Garrick Sherman successfully defended his PhD dissertation, "Document Expansion and Language Model Re-estimation for Information Retrieval," on August 22.

His committee included Associate Professor Jana Diesner, chair and director of research; Professor J. Stephen Downie; Professor Ted Underwood; and Associate Professor Jaime Arguello of the University of North Carolina at Chapel Hill.

From the abstract: Document expansion is the process of augmenting the text of a document with text drawn from one or more other documents. The purpose of this expansion is to increase the size of the term sample from which document representations, such as language models, may be estimated. While document expansion has been shown to improve the effectiveness of ad-hoc document retrieval, our work differs from previous work in a variety of ways. We propose a consistent language modeling approach to document expansion of full length documents. We also explore the use of one or more external document collections as sources of data during the expansion process. Our proposed methods prove successful in improving retrieval effectiveness over baselines. We also acknowledge that existing document expansion work, including our own, has relied on intuitive assumptions about the mechanisms by which it achieves its effects. In this thesis, we quantify aspects of document language model change resulting from expansion . . . Recognizing the potential for further retrieval effectiveness improvement by means of selective application of our model, we investigate methods for automatically predicting whether or not to expand individual documents and, if so, which expansion collection may yield the optimal document representation. We find that, although the document expansion retrieval model has proven effective overall, accurate prediction concerning the expansion of a given document depends too heavily on predicting the document's relevance.

Updated on
Backto the news archive

Related News

PhD student Meng Li wins iSchool T-shirt design contest

PhD student Meng Li's research focuses on neuro-symbolic AI, with an emphasis on using syntactic analysis and large language models (LLMs) to understand Python notebooks. This cutting-edge research keeps Li "super busy" for much of the term, but in August, she took a brief break from her work and shifted her focus to designing the winning entry for the iSchool T-shirt contest.

While the idea of the design "just popped into my mind," Li has been thinking about the contest for years.

Meng Li wears the T-shirt with her winning design. The shirt is dark blue, with a hand-sketched wave in white, while the figure and surf board are in Illini Orange.

Jiang defends dissertation

PhD candidate Xiaoliang Jiang successfully defended his dissertation, "Identifying Place Names in Scientific Writing Based on Language Models, Linked Data, and Metadata," on November 10. 

Xiaoliang Jiang

Vaez Afshar named APT Student Scholar

Informatics PhD student Sepehr Vaez Afshar has been named a Student Scholar by the Association for Preservation Technology (APT). Each year, around ten students are selected worldwide for the scholarship program based on the quality and innovation of their research abstracts, as well as their contribution to the field of preservation technology. Scholars are paired with mentors from the APT College of Fellows, prepare and present their research during the association's annual conference, and enjoy opportunities for long-term professional networking and mentorship within the preservation community.

Sepehr Vaez Afshar

iSchool well represented at ASIS&T 2025

iSchool faculty, staff, and students will participate in the 88th Annual Meeting of the Association for Information Science and Technology (ASIS&T), which will be held on November 14-18 in Arlington, Virginia. ASIS&T will also host a Virtual Satellite Meeting on December 11-12. 

Kang makes sense of too much information

As an MSIM student at the iSchool, Zhanchen Kang is passionate about helping people make sense of the overwhelming amount of information in their daily lives. Kang earned an undergraduate degree in information systems in China before coming to the University of Illinois to further explore how technology, data, and people intersect. 

Zhanchen Kang

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Fax: (217) 244-3302

Email: ischool@illinois.edu

Back to top