Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Associate Dean for Research, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

Cheng defends dissertation

Doctoral candidate Jessica Cheng successfully defended her dissertation, "Agreeing to Disagree: Applying a Logic-based Approach to Reconciling and Merging Multiple Taxonomies," on May 25. 

Jessica Cheng

Brooks presents keynote at West African conference

Ian Brooks, iSchool research scientist and director of the Center for Health Informatics (CHI), gave a keynote talk at the West Africa Conference on Digital Public Goods and Cybersecurity, which was held on May 9-10 in Freetown, Sierra Leone. The conference focused on bridging the gender gap in digital public goods and cybersecurity spaces in Africa.

Ian Brooks

New project to help identify and predict insider threats

Insider threats are one of the top security concerns facing large organizations. Current and former employees, business partners, contractors—anyone with the right level of access to a company’s data—can pose a threat. The incidence of insider threats has increased in recent years, at a significant cost to companies. Associate Professor Jingrui He is addressing this problem in a new project that seeks to detect and predict insider threats. She has been awarded a three-year, $200,000 grant from the C3.ai Digital Transformation Institute for her project, "Multi-Facet Rare Event Modeling of Adaptive Insider Threats."

Jingrui He

iSchool students present their research at Urbana City Council meeting

At the Urbana City Council meeting on May 9, students in the Community Data (IS 594) course presented their research on how communities are reducing gun violence. According to their instructor Chamee Yang, postdoctoral research associate with the iSchool, Community Data Clinic, and Just Infrastructures Initiative, the new course was designed as an experiential learning opportunity with a community engagement component, where students could gain research experience with real-world implications. Throughout the Spring 2022 semester, students worked in groups to explore community-driven approaches to prevent gun violence.

Chamee Yang, Sarah Unruh, and Gowri Balasubramaniam

Dinh defends dissertation

Doctoral candidate Ly Dinh successfully defended her dissertation, "Advances to Network Analysis Theories and Methods for the Understanding of Formal and Emergent Structures in Interpersonal, Corporate/Organizational, and Hazards Response Setting," on May 19.

Ly Dinh