Downie to discuss HTRC findings at Harvard Library

Stephen Downie
J. Stephen Downie, Professor, Associate Dean for Research, and Co-Director of the HathiTrust Research Center

Professor and Associate Dean for Research J. Stephen Downie will present his recent work with the HathiTrust Research Center (HTRC) on April 30 at Harvard Library. Downie is codirector of HTRC, a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to text found in the HathiTrust (HT) Digital Library.

His talk, "Creating Universal Open Access to Closed Textual Data at Scale: Use Cases from the HathiTrust Research Center," will discuss how the HTRC is creating a set of non-consumptive research services to make HT Digital Library volumes that are under copyright restrictions more open and useful to scholars.

"The creation and publication of the HTRC 'Extracted Features' (EF) dataset provides unigram counts and Part-of-Speech (POS) information for each of the 5.6 billion pages in the HT Digital Library," explained Downie. "In my talk, I will introduce two uses cases that leverage the EF dataset: the 'HathiTrust + Bookworm' visualization and analysis tool; and the Workset Building environment developed to provide researchers fine-grained access to the entire HT collection (both public domain and in-copyright) via the EF dataset."

Downie leads the HathiTrust + Bookworm text analysis project, which is creating tools to visualize the evolution of term usage over time. He also is the principal investigator on the Workset Creation for Scholarly Analysis + Data Capsules project, which integrates workset models and tools, and he represents the HTRC on the Novel(TM) text mining project as well as the Single Interface for Music Score Searching and Analysis project. All of these projects strive to provide large-scale analytic access to copyright-restricted cultural data.

Research Areas:
Updated on
Backto the news archive

Related News

Tilley to serve on Lynd Ward Prize jury

Associate Professor Carol Tilley has been selected to serve as a judge for the 2022 Lynd Ward Graphic Novel Prize, which is presented to the best graphic novel, fiction or nonfiction, published in the previous year by a living U.S. or Canadian citizen or resident. The annual award is sponsored by Penn State University Libraries and administered by the Pennsylvania Center for the Book, an affiliate of the Center for the Book at the Library of Congress.

Carol Tilley

iSchool researchers receive funding for napari plugin project

A new project led by Assistant Professor Matthew Turk is among the napari plugin projects that have recently received support from the Chan Zuckerberg Initiative (CZI) in its effort to advance bioimaging technologies. Visiting Research Scientist Christopher Havlin will serve as co-principal investigator on the project, "Enabling Access To Multi-resolution Data."

Matthew Turk

New project focuses on rare categories

Associate Professor Jingrui He has been awarded a three-year, $500,000 grant from the National Science Foundation (NSF) to develop explainable techniques to detect and track rare categories. For her project, "RareXplain: A Computational Framework for Explainable Rare Category Analysis," she will focus on real-world problems where underrepresented, rare (abnormal) examples play critical roles, such as defective silicon wafers resulting from a new semiconductor manufacturing process and rare but severe complications (e.g., kidney failure) among diabetes patients.

Jingrui He

iSchool researchers present at virtual CIRN conference

iSchool researchers presented their work at the 19th annual Community Informatics Research Network (CIRN) Conference on November 8-12. The theme of this year's conference was "Communities, Technology and This Moment." CIRN 2021 explored how researchers and practitioners ethically collect information, including what happens when community information is deliberately not collected and how information systems can be designed "in harmony with communities."

Lueg to join iSchool faculty

The iSchool is pleased to announce that Christopher Lueg will join the faculty as a professor in January 2022. He is currently a professor of medical informatics at the Bern University of Applied Sciences in Biel/Bienne, Switzerland.

Christopher Lueg