School of Information Sciences

Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Executive Associate Dean, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

Wang and Snap Research partner on "Profile Agent"

Imagine your favorite apps had a "digital twin" of your personality that actually grew up with you. Right now, most AI systems create a static snapshot of your interests. For example, a personal shopper who keeps recommending video games just because you bought one three years ago, even though you've long since moved on to hiking and cooking. To bridge this gap, Professor Dong Wang's team at the University of Illinois Urbana-Champaign is partnering with Snap Research to build a "Profile Agent."

Dong Wang

Dahlen selected as juror for 2026 Kirkus Prize

Associate Professor Sarah Park Dahlen has been selected as one of six jurors for the 2026 Kirkus Prize, given annually in the categories of fiction, nonfiction, and young readers' literature. The prize is one of the richest in the literary world, with awards of $50,000 in each category.

Sarah Park Dahlen

Liu receives support for AI project through NVIDIA Academic Grant Program

Assistant Professor Yaoyao Liu has been awarded a grant through the NVIDIA Academic Grant Program. NVIDIA, a world leader in accelerated computing and AI, established the program to advance academic research by providing world-class computing access and resources to researchers. Liu has received 32,000 A100 GPU-hours on Brev, an AI and machine learning platform that empowers developers to run, build, train, deploy, and scale AI models with GPU in the cloud. 

Yaoyao Liu

New app designed to improve conference experience

A new app developed by Associate Professor Yun Huang aims to make navigating conferences less work and more fun, so that attendees can meet others, discover fresh ideas, and "experience academic life as an exciting adventure." The app, PapersClaw.fun, will debut at the ACM Conference on Human Factors in Computing Systems (CHI 2026), which will be held from April 13-17 in Barcelona, Spain.

Yun Huang

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top