School of Information Sciences

Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Executive Associate Dean, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

Dahlen selected as juror for 2026 Kirkus Prize

Associate Professor Sarah Park Dahlen has been selected as one of six jurors for the 2026 Kirkus Prize, given annually in the categories of fiction, nonfiction, and young readers' literature. The prize is one of the richest in the literary world, with awards of $50,000 in each category.

Sarah Park Dahlen

Liu receives support for AI project through NVIDIA Academic Grant Program

Assistant Professor Yaoyao Liu has been awarded a grant through the NVIDIA Academic Grant Program. NVIDIA, a world leader in accelerated computing and AI, established the program to advance academic research by providing world-class computing access and resources to researchers. Liu has received 32,000 A100 GPU-hours on Brev, an AI and machine learning platform that empowers developers to run, build, train, deploy, and scale AI models with GPU in the cloud. 

Yaoyao Liu

New app designed to improve conference experience

A new app developed by Associate Professor Yun Huang aims to make navigating conferences less work and more fun, so that attendees can meet others, discover fresh ideas, and "experience academic life as an exciting adventure." The app, PapersClaw.fun, will debut at the ACM Conference on Human Factors in Computing Systems (CHI 2026), which will be held from April 13-17 in Barcelona, Spain.

Yun Huang

Seo selected as CAS Beckman Fellow

Assistant Professor JooYoung Seo has been selected as a Center for Advanced Study (CAS) Beckman Fellow for the 2026-2027 academic year. CAS is one of the most prestigious faculty recognition programs at the University of Illinois. Its primary mission is to identify and support the most productive and innovative faculty across all disciplines. CAS Fellows are nominated by their unit heads and selected by the Center's permanent faculty through a competitive review process, with final approval by the Board of Trustees. 

JooYoung Seo

iSchool participation in iConference 2026

The following iSchool faculty and students will participate in iConference 2026, which will be held virtually from March 23–26 and physically from March 29–April 2 in Edinburgh, Scotland. The theme of this year's conference is "Information Literacies, Authenticity and Use: The Move Towards a Digitally Enlightened Society."

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top