School of Information Sciences

Underwood receives NEH grant to investigate consequences of error in digital libraries

Ted Underwood
Ted Underwood, Professor

Professor Ted Underwood has received a $73,122 grant from the National Endowment for the Humanities to investigate the consequences of error in digital libraries. While digital libraries represent an immense storehouse of knowledge, the texts are full of errors because of the imperfect process by which they are transcribed optically.

"It isn't unusual for five percent of the words in volumes to be mistranscribed, with the level of error much higher in some volumes," said Underwood. "Simply measuring the fraction of mistranscribed words is easy. It’s harder to know how much difference those errors make for the methods and questions that actually interest researchers. Some forms of analysis are undisturbed by high levels of error; others may be quite sensitive, especially when errors are distributed unevenly across different historical periods and genres."

Underwood will work with graduate students from the iSchool and English Department to construct parallel collections that pair each "clean" text with a realistically error-ridden version of the same book drawn from a digital library. The team will build collections of Chinese texts as well as English texts ranging from 1700 to the present, because different character sets and printing technologies produce different kinds of error. Then the team will apply a wide range of data-mining methods to both the clean and error-ridden collections and measure the distortion produced by transcription error and other common sources of noise. The project will provide tools that help other researchers estimate the level of uncertainty in their own conclusions.

"No data is perfect. There's always some kind of error. The question is whether the error is of a kind and magnitude likely to matter for a particular question," he said.

Underwood is a professor in the iSchool and also holds an appointment with the Department of English in the College of Liberal Arts and Sciences. He has authored three books about literary history, including Distant Horizons (The University of Chicago Press Books, 2019), Why Literary Periods Mattered: Historical Contrast and the Prestige of English Studies (Stanford University Press, 2013), and The Work of the Sun: Literature, Science and Political Economy 1760-1860 (New York: Palgrave, 2005). His articles have appeared in PMLA, Representations, MLQ, and Cultural Analytics. Underwood earned his PhD in English from Cornell University.

Updated on
Backto the news archive

Related News

Nguyen receives Critical Language Scholarship

MSLIS student Christine Nguyen has been awarded a U.S. Department of State Critical Language Scholarship (CLS) to study Japanese this summer. She is one of four University of Illinois Urbana-Champaign students who received full scholarships to spend 8-10 weeks abroad and study one of 14 critical languages. The program is part of an initiative to expand the number of Americans studying and mastering critical foreign languages and cultural skills to enable them to contribute to U.S. economic competitiveness and national security.

Christine Thuy Minh Nguyen

iSchool researchers to present at CHI 2026

iSchool faculty and students will present their research at the ACM Conference on Human Factors in Computing Systems (CHI 2026), which will be held from April 13–17 in Barcelona, Spain. The conference, considered the most prestigious in the field of Human-Computer Interaction, attracts researchers and practitioners from around the globe.

Wang and Snap Research partner on "Profile Agent"

Imagine your favorite apps had a "digital twin" of your personality that actually grew up with you. Right now, most AI systems create a static snapshot of your interests. For example, a personal shopper who keeps recommending video games just because you bought one three years ago, even though you've long since moved on to hiking and cooking. To bridge this gap, Professor Dong Wang's team at the University of Illinois Urbana-Champaign is partnering with Snap Research to build a "Profile Agent."

Dong Wang

Dahlen selected as juror for 2026 Kirkus Prize

Associate Professor Sarah Park Dahlen has been selected as one of six jurors for the 2026 Kirkus Prize, given annually in the categories of fiction, nonfiction, and young readers' literature. The prize is one of the richest in the literary world, with awards of $50,000 in each category.

Sarah Park Dahlen

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top