School of Information Sciences

Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Executive Associate Dean, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

Bashir group presents work at PEPR 2026

PhD students Ramazan Yener, Eryue Xu, and Mubarak Raji presented their research this week at the 2026 USENIX Conference on Privacy Engineering Practice and Respect (PEPR) in Santa Clara, California. PEPR is focused on designing and building products and systems with privacy and respect for their users and the societies in which they operate. The students received USENIX grants covering their conference registration and providing travel support to attend the conference. 

Bashir group PEPR 2026

iSchool researchers to present work at CVPR Conference

Assistant Professors Ismini Lourentzou and Yaoyao Liu, along with students from their labs, will present their research at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), held in Denver, Colorado, from June 3–7. CVPR is the flagship annual meeting of IEEE/CVF and PAMI-TC, where researchers present their latest advances in computer vision, pattern recognition, machine learning, robotics, and artificial intelligence, both in theory and practice. 

iSchool researchers to present at ChLA 2026

iSchool faculty and staff will present their research at the Children's Literature Association (ChLA) annual conference, which will be held from May 28-30 in Pittsburgh, Pennsylvania. The theme of this year's conference is "Neighbors and Neighborhoods in Children's Literature, Media, and Culture."

Wang Group to present work at ICWSM 2026

Professor Dong Wang and PhD student Ruichen Yao will present their research at the International AAAI Conference on Web and Social Media (ICWSM) 2026, which will take place May 27–29 in Los Angeles, bringing together researchers from around the world to study the intersection of social media, society, and technology. The conference is widely recognized as a premier venue for computational social science and social computing, with a highly selective acceptance process.

Dong Wang

Lourentzou receives NSF CAREER Award

Assistant Professor Ismini Lourentzou has received a National Science Foundation (NSF) CAREER award to develop the next generation of embodied AI agents, systems that can reason, explain, and adapt as they act in the physical world.

Ismini Lourentzou

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top