Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Associate Dean for Research, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

New book explores how AI is reshaping cultural heritage

Glen Layne-Worthey, associate director for research support services for the HathiTrust Research Center (HTRC), and J. Stephen Downie, professor and HTRC co-director, have edited a new book, Navigating Artificial Intelligence for Cultural Heritage Organisations, which was recently released by UCL Press. 

Jung to join the faculty

The iSchool is pleased to announce that Yonghan Jung will join the faculty as an assistant professor in August 2025, pending approval by the University of Illinois Board of Trustees. 

Yonghan Jung

Aubin Le Quéré to join the faculty

The iSchool is pleased to announce that Marianne Aubin Le Quéré will join the faculty as an assistant professor in August 2026, pending approval by the University of Illinois Board of Trustees. Aubin Le Quéré is a PhD candidate in the Department of Information Science at Cornell University. For the 2025-2026 academic year, she will be a postdoctoral fellow at Princeton University's Center for Information Technology Policy.

Marianne Aubin Le Quere

New project improves accessibility of health information through AI

Assistant Professor Yue Guo has received a $30,000 Arnold O. Beckman Research Award from the U of I Campus Research Board for her project, "Optimizing Personalization in Plain Language Summaries: Comparing Predictive and Interactive Approaches for Tailored Health Information." 

Yue Guo