Underwood receives NEH grant to investigate consequences of error in digital libraries

Ted Underwood
Ted Underwood, Professor

Professor Ted Underwood has received a $73,122 grant from the National Endowment for the Humanities to investigate the consequences of error in digital libraries. While digital libraries represent an immense storehouse of knowledge, the texts are full of errors because of the imperfect process by which they are transcribed optically.

"It isn't unusual for five percent of the words in volumes to be mistranscribed, with the level of error much higher in some volumes," said Underwood. "Simply measuring the fraction of mistranscribed words is easy. It’s harder to know how much difference those errors make for the methods and questions that actually interest researchers. Some forms of analysis are undisturbed by high levels of error; others may be quite sensitive, especially when errors are distributed unevenly across different historical periods and genres."

Underwood will work with graduate students from the iSchool and English Department to construct parallel collections that pair each "clean" text with a realistically error-ridden version of the same book drawn from a digital library. The team will build collections of Chinese texts as well as English texts ranging from 1700 to the present, because different character sets and printing technologies produce different kinds of error. Then the team will apply a wide range of data-mining methods to both the clean and error-ridden collections and measure the distortion produced by transcription error and other common sources of noise. The project will provide tools that help other researchers estimate the level of uncertainty in their own conclusions.

"No data is perfect. There's always some kind of error. The question is whether the error is of a kind and magnitude likely to matter for a particular question," he said.

Underwood is a professor in the iSchool and also holds an appointment with the Department of English in the College of Liberal Arts and Sciences. He has authored three books about literary history, including Distant Horizons (The University of Chicago Press Books, 2019), Why Literary Periods Mattered: Historical Contrast and the Prestige of English Studies (Stanford University Press, 2013), and The Work of the Sun: Literature, Science and Political Economy 1760-1860 (New York: Palgrave, 2005). His articles have appeared in PMLA, Representations, MLQ, and Cultural Analytics. Underwood earned his PhD in English from Cornell University.

Updated on
Backto the news archive

Related News

iSchool researchers present at inaugural ASIS&T symposium

iSchool researchers will present their work at the Association for Information Science & Technology (ASIS&T) Midwest Chapter Spring Symposium on April 26. The inaugural symposium will include talks by seventeen researchers from ten institutions across the Midwest region.

New EU legislation has iSchool connection

Thanks to new European Union (EU) legislation, those who perform on-demand work through an app or website, such as DoorDash or Uber, will enjoy better working conditions. PhD student Zachary Kilhoffer, who spent four years working as a researcher for the Centre for European Policy Studies (CEPS) in Brussels prior to entering the iSchool's doctoral program, authored or co-authored several policy research pieces that informed the creation of the EU Platform Work Directive.

Zak Kilhoffer

Undergraduate Research Symposium features iSchool researchers

Several iSchool undergraduate students will participate in the 17th annual Undergraduate Research Symposium. During the event, visitors will learn about undergraduate research projects through oral and poster presentations, creative performances, and art exhibits. All are welcome to attend the symposium, which will be held on April 25 from 9:00 a.m.-5:00 p.m. in the Illini Rooms and South Lounge of the Illini Union. 

iSchool researchers present at iConference 2024

The following iSchool faculty and students participated in the virtual portion of iConference 2024 from April 15-18. The in-person portion of the conference will be held in Changchun, China, from April 22-26. The theme of this year’s conference is "Wisdom, Well-being, Win-win."

Wegrzyn awarded SMART Scholarship

PhD student Emily Wegrzyn has been selected for the prestigious Science, Mathematics, and Research for Transformation (SMART) Scholarship-for-Service Program, which is funded by the Department of Defense. The primary aim of this program is to increase the number of civilian engineers and scientists in the U.S. 

 Emily Wegrzyn