School of Information Sciences

Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Executive Associate Dean, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

New multi-institutional project to use AI to represent past historical periods

A new project led by a team of researchers from four universities aims to create and evaluate language models that represent past historical periods. The project, "Artificial Intelligence for Cultural and Historical Reasoning," was recently selected for a 2025 Humanities and AI Virtual Institute (HAVI) award from Schmidt Sciences. The $800,000 grant will be split among four institutions: Cornell University, the University of Illinois Urbana-Champaign, The University of British Columbia, and McGill University. Professor Ted Underwood will serve as the principal investigator for the portion of the project at Illinois.

Ted Underwood

Wang group to present at WSDM26

Professor and Associate Dean for Research Dong Wang and PhD student Ruohan Zong will present their research at the 19th ACM International Conference on Web Search and Data Mining (WSDM 26), which will be held from February 22–26 in Boise, Idaho. WSDM is a premier international conference in web search, data mining, and AI, known for its highly selective acceptance rates. This year, the acceptance rate for the main track of the conference was only 16 percent. 

Dong Wang

New NSF award supports innovative role-playing game approach to strengthening research security in academia

A new National Science Foundation (NSF) award will support an innovative effort in the School of Information Sciences to strengthen research security by using structured role-playing games (RPG) to model the threats facing academic research environments. The project, titled "REDTEAM: Research Environment Defense Through Expert Attack Modeling," addresses a growing challenge: balancing the open, collaborative nature of academic research with increasing national security risks and sophisticated adversarial threats. 

Wang appointed associate dean for research

The iSchool is pleased to announce that Professor Dong Wang has been appointed associate dean for research. In this role, Wang will provide leadership in the support, integration, communication, and administration of the iSchool's research and scholarship endeavors. This includes supervising the iSchool's Research Services unit, supporting the research centers, and assisting faculty in the acquisition of research funding.

Dong Wang

Knox authors new edition of Book Banning

The second edition of Interim Dean and Professor Emily Knox's book, Book Banning in 21st Century America, was recently released by Bloomsbury. The first edition, published by Rowman & Littlefield (now Bloomsbury) in 2015, was the first monograph in the Beta Phi Mu Scholars' Series. The new edition examines 25 contemporary cases of book challenges in schools and public libraries across the United States and breaks down how and why reading practices can lead to censorship.

"Book Banning in 21st Century America" by Emily Knox

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top