Underwood receives NEH grant to investigate consequences of error in digital libraries

Ted Underwood
Ted Underwood, Professor

Professor Ted Underwood has received a $73,122 grant from the National Endowment for the Humanities to investigate the consequences of error in digital libraries. While digital libraries represent an immense storehouse of knowledge, the texts are full of errors because of the imperfect process by which they are transcribed optically.

"It isn't unusual for five percent of the words in volumes to be mistranscribed, with the level of error much higher in some volumes," said Underwood. "Simply measuring the fraction of mistranscribed words is easy. It’s harder to know how much difference those errors make for the methods and questions that actually interest researchers. Some forms of analysis are undisturbed by high levels of error; others may be quite sensitive, especially when errors are distributed unevenly across different historical periods and genres."

Underwood will work with graduate students from the iSchool and English Department to construct parallel collections that pair each "clean" text with a realistically error-ridden version of the same book drawn from a digital library. The team will build collections of Chinese texts as well as English texts ranging from 1700 to the present, because different character sets and printing technologies produce different kinds of error. Then the team will apply a wide range of data-mining methods to both the clean and error-ridden collections and measure the distortion produced by transcription error and other common sources of noise. The project will provide tools that help other researchers estimate the level of uncertainty in their own conclusions.

"No data is perfect. There's always some kind of error. The question is whether the error is of a kind and magnitude likely to matter for a particular question," he said.

Underwood is a professor in the iSchool and also holds an appointment with the Department of English in the College of Liberal Arts and Sciences. He has authored three books about literary history, including Distant Horizons (The University of Chicago Press Books, 2019), Why Literary Periods Mattered: Historical Contrast and the Prestige of English Studies (Stanford University Press, 2013), and The Work of the Sun: Literature, Science and Political Economy 1760-1860 (New York: Palgrave, 2005). His articles have appeared in PMLA, Representations, MLQ, and Cultural Analytics. Underwood earned his PhD in English from Cornell University.

Updated on
Backto the news archive

Related News

Aubin Le Quéré to join the faculty

The iSchool is pleased to announce that Marianne Aubin Le Quéré will join the faculty as an assistant professor in August 2026, pending approval by the University of Illinois Board of Trustees. Aubin Le Quéré is a PhD candidate in the Department of Information Science at Cornell University. For the 2025-2026 academic year, she will be a postdoctoral fellow at Princeton University's Center for Information Technology Policy.

Marianne Aubin Le Quere

Midwest Big Data Innovation Hub wins Synergy Award

The Midwest Big Data Innovation Hub (MBDH) has won the Synergy Award from the Chicago Council on Science and Technology (C2ST). The MBDH is a partnership of the University of Illinois Urbana-Champaign, Indiana University, Iowa State University, the University of Michigan, the University of Minnesota, and the University of North Dakota. It is part of the National Science Foundation’s regional Big Data Innovation Hubs program that comprises offices in the Midwest, West, South, and the Northeast. 

Kelly Desino, scientific director of AbbVie's Community of Science, presenting the Synergy Award from the Chicago Council on Science and Technology (C2ST) to Professor Cathy Blake.

New project improves accessibility of health information through AI

Assistant Professor Yue Guo has received a $30,000 Arnold O. Beckman Research Award from the U of I Campus Research Board for her project, "Optimizing Personalization in Plain Language Summaries: Comparing Predictive and Interactive Approaches for Tailored Health Information." 

Yue Guo

Han defends dissertation

Doctoral candidate Yingying Han successfully defended her dissertation, "Community Archives as Agency: Documenting Chinese American Experiences in the U.S.,” on May 28.

Yingying Han