School of Information Sciences

Underwood receives NEH grant to investigate consequences of error in digital libraries

Ted Underwood
Ted Underwood, Professor

Professor Ted Underwood has received a $73,122 grant from the National Endowment for the Humanities to investigate the consequences of error in digital libraries. While digital libraries represent an immense storehouse of knowledge, the texts are full of errors because of the imperfect process by which they are transcribed optically.

"It isn't unusual for five percent of the words in volumes to be mistranscribed, with the level of error much higher in some volumes," said Underwood. "Simply measuring the fraction of mistranscribed words is easy. It’s harder to know how much difference those errors make for the methods and questions that actually interest researchers. Some forms of analysis are undisturbed by high levels of error; others may be quite sensitive, especially when errors are distributed unevenly across different historical periods and genres."

Underwood will work with graduate students from the iSchool and English Department to construct parallel collections that pair each "clean" text with a realistically error-ridden version of the same book drawn from a digital library. The team will build collections of Chinese texts as well as English texts ranging from 1700 to the present, because different character sets and printing technologies produce different kinds of error. Then the team will apply a wide range of data-mining methods to both the clean and error-ridden collections and measure the distortion produced by transcription error and other common sources of noise. The project will provide tools that help other researchers estimate the level of uncertainty in their own conclusions.

"No data is perfect. There's always some kind of error. The question is whether the error is of a kind and magnitude likely to matter for a particular question," he said.

Underwood is a professor in the iSchool and also holds an appointment with the Department of English in the College of Liberal Arts and Sciences. He has authored three books about literary history, including Distant Horizons (The University of Chicago Press Books, 2019), Why Literary Periods Mattered: Historical Contrast and the Prestige of English Studies (Stanford University Press, 2013), and The Work of the Sun: Literature, Science and Political Economy 1760-1860 (New York: Palgrave, 2005). His articles have appeared in PMLA, Representations, MLQ, and Cultural Analytics. Underwood earned his PhD in English from Cornell University.

Updated on
Backto the news archive

Related News

Kraus wins 2026 Pulitzer Prize Award in Fiction

iSchool alumnus and New York Times bestselling author Daniel Kraus (MSLIS '05) has won the 2026 Pulitzer Prize in Fiction for Angel Down. Kraus, a prolific writer whose works span several genres—children's fiction, horror, science fiction, graphic novels, and comics—learned the good news last week.

Daniel Kraus 2026

Raji invited to join UN Working Expert Group

PhD student Mubarak Raji has been invited to join the Working Expert Group on AI Governance Interoperability. This group operates under the United Nations Office for Digital and Emerging Technologies' new AI Governance for Humanity Lab. It supports the Secretary-General's High-level Advisory Body on AI by providing evidence-based analysis for the Global Dialogue on AI Governance, which will be held in July 2026 in Geneva, Switzerland.

Mubarak Raji headshot

Faculty and staff recognized with inaugural iSchool awards

The iSchool recognized faculty and staff for their contributions to teaching and outstanding service to the School at a ceremony on May 6. Interim Dean Emily Knox presented plaques to the inaugural recipients of the Faculty Teaching Award, Adjunct Teaching Award, and Staff Excellence Award.

Paper by He's lab recognized at ICLR 2026 workshop

The iDEA-iSAIL Joint Laboratory at the University of Illinois received an Outstanding Paper Award at the International Conference on Learning Representations (ICLR) 2026 Logical Reasoning of Large Language Models Workshop for their paper, "RAG Over Tables: Hierarchical Memory Index, Multi-State Retrieval, and Benchmarking." Paper authors include lab members Jingrui He, professor and MSIM program director; Sirui Chen, Xinrui He, and Zihao Li, computer science PhD students; Jiaru Zou, computer science MS student; Dongqi Fu, alum; as well as Jiawei Han, professor of computer science, and Yada Zhu, IBM collaborator. Chen gave an oral presentation of the research at the workshop, which was held last month in Rio de Janeiro, Brazil. This award was selected out of 206 accepted papers at the workshop.

Jingrui He

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top