Underwood receives NEH grant to investigate consequences of error in digital libraries

Ted Underwood
Ted Underwood, Professor

Professor Ted Underwood has received a $73,122 grant from the National Endowment for the Humanities to investigate the consequences of error in digital libraries. While digital libraries represent an immense storehouse of knowledge, the texts are full of errors because of the imperfect process by which they are transcribed optically.

"It isn't unusual for five percent of the words in volumes to be mistranscribed, with the level of error much higher in some volumes," said Underwood. "Simply measuring the fraction of mistranscribed words is easy. It’s harder to know how much difference those errors make for the methods and questions that actually interest researchers. Some forms of analysis are undisturbed by high levels of error; others may be quite sensitive, especially when errors are distributed unevenly across different historical periods and genres."

Underwood will work with graduate students from the iSchool and English Department to construct parallel collections that pair each "clean" text with a realistically error-ridden version of the same book drawn from a digital library. The team will build collections of Chinese texts as well as English texts ranging from 1700 to the present, because different character sets and printing technologies produce different kinds of error. Then the team will apply a wide range of data-mining methods to both the clean and error-ridden collections and measure the distortion produced by transcription error and other common sources of noise. The project will provide tools that help other researchers estimate the level of uncertainty in their own conclusions.

"No data is perfect. There's always some kind of error. The question is whether the error is of a kind and magnitude likely to matter for a particular question," he said.

Underwood is a professor in the iSchool and also holds an appointment with the Department of English in the College of Liberal Arts and Sciences. He has authored three books about literary history, including Distant Horizons (The University of Chicago Press Books, 2019), Why Literary Periods Mattered: Historical Contrast and the Prestige of English Studies (Stanford University Press, 2013), and The Work of the Sun: Literature, Science and Political Economy 1760-1860 (New York: Palgrave, 2005). His articles have appeared in PMLA, Representations, MLQ, and Cultural Analytics. Underwood earned his PhD in English from Cornell University.

Updated on
Backto the news archive

Related News

iSchool alumni and student named 2025 Movers & Shakers

Two iSchool alumni and an MSLIS student are included in Library Journal's 2025 class of Movers & Shakers, an annual list that recognizes 50 professionals who are moving the library field forward as a profession. Leah Gregory (MSLIS '04) was honored in the Advocates category, Billy Tringali (MSLIS '19) was honored in the Innovators category, and University Library Assistant Professor and Digital Humanities Librarian Mary Ton (current MSLIS student) was honored in the Educators category.

Spectrum Scholar Spotlight: Dalia Ortiz Pon

Twelve iSchool master's students were named 2024–2025 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Dalia Ortiz Pon earned her bachelor's degree in Latina/Latino studies from San Francisco State University. 

Dalia Ortiz Pon

Debnath datafies "The Bulletin"

MSIM student Tan Debnath, whose interests span data mining, statistical modeling, text mining, and digital humanities, joined the Center for Children's books as a research assistant. He was tasked with building curation processes that would datafy seventy-five years' worth of archival issues of The Bulletin of the Center for Children's Books, one of the nation's leading children's book review journals.

Tan Debnath stands casually with his hands in his pockets and smiles broadly at the camera. It's a sunny day

He receives Amazon Research Award to improve monitoring of Earth’s ecosystem

A new project led by Professor Jingrui He aims to help scientists monitor disruptions to the Earth’s ecosystem, such as climate change. She recently received support for her work through an Amazon Research Award, which includes $60,000 in cash and an additional $40,000 in Amazon Web Services (AWS) credits.

Jingrui He

iSchool undergraduates selected as 2025 Community-Academic Scholars

The Interdisciplinary Health Sciences Institute (IHSI) has selected BSIS student Dhanvi Puttur and BSIS+DS student Lara Terpetschnig as 2025 Community-Academic Scholars. Representing nineteen majors and nine minors in eight colleges and schools at the University of Illinois Urbana-Champaign and two additional universities, the eighteen scholars in this cohort encompass diverse fields of study, from community health to graphic design to statistics. 

BSIS+DS student Lara Terpetschnig and BSIS student Dhanvi Puttur