School of Information Sciences

HathiTrust Research Center extends non-consumptive research tools to copyrighted materials

HathiTrust Research Center

The HathiTrust Research Center (HTRC) has extended non-consumptive research tools to copyrighted materials, expanding research through fair use. HTRC is a collaboration between the University of Illinois, Indiana University, and the HathiTrust to enable advanced computational access to the HathiTrust Digital Library database.

Since 2011, HTRC has been developing services and tools to allow researchers to employ text and data mining methodologies using the HathiTrust collection. To date, this service has been available only on the portion of the collection that is out of copyright. With the development of a landmark HathiTrust policy and an updated release of HTRC Analytics, HTRC now provides access to the text of the complete 16.7-million-item HathiTrust corpus for non-consumptive research, such as data mining and computational analysis, including items protected by copyright.

This extraordinary opportunity to use copyrighted materials for non-consumptive research purposes expands research access to the entire HathiTrust digital collection, which is sustained by HathiTrust's 140+ member libraries. Researchers may access HTRC's easy-to-use computational tools, which are ideal for beginners, as well as more complex tools designed to meet advanced data analysis needs.

A primary goal of HathiTrust is to enable the widest possible lawful research and educational uses of the HathiTrust collection. In recent years, U.S. courts have recognized the solid legal basis for non-consumptive research on copyrighted materials. In 2016, HathiTrust established a working group to develop the Non-Consumptive Use Research Policy to ensure the responsible research use of copyrighted items.

The policy is now enacted in an updated release of HTRC Analytics, which allows researchers to conduct computational text analysis on copyrighted items as permitted under U.S. copyright law. Non-consumptive research use does not change the legal status of items protected under copyright.

"My HTRC colleagues at both Illinois and Indiana should be very proud of their great accomplishment," said HTRC Codirector J. Stephen Downie, iSchool professor and associate dean for research. "Providing non-consumptive access to all of HathiTrust's nearly 17 million volumes will help scholars and students to uncover the secrets buried within to the benefit of us all."

Read the HTRC press release.

Research Areas:
Tags:
Updated on
Backto the news archive

Related News

iSchool researchers to present work at Technocracy Conference

This week, iSchool PhD students and faculty will present their research at the Technocracy Conference. Hosted by the Unit for Criticism and Interpretive Theory at the University of Illinois on March 5–6, the conference will begin with a panel of graduate student papers and continue the following day with invited speakers and a keynote. All events will take place at the Levis Faculty Center on the Urbana campus. 

New multi-institutional project to use AI to represent past historical periods

A new project led by a team of researchers from four universities aims to create and evaluate language models that represent past historical periods. The project, "Artificial Intelligence for Cultural and Historical Reasoning," was recently selected for a 2025 Humanities and AI Virtual Institute (HAVI) award from Schmidt Sciences. The $800,000 grant will be split among four institutions: Cornell University, the University of Illinois Urbana-Champaign, The University of British Columbia, and McGill University. Professor Ted Underwood will serve as the principal investigator for the portion of the project at Illinois.

Ted Underwood

Wang group to present at WSDM26

Professor and Associate Dean for Research Dong Wang and PhD student Ruohan Zong will present their research at the 19th ACM International Conference on Web Search and Data Mining (WSDM 26), which will be held from February 22–26 in Boise, Idaho. WSDM is a premier international conference in web search, data mining, and AI, known for its highly selective acceptance rates. This year, the acceptance rate for the main track of the conference was only 16 percent. 

Dong Wang

New NSF award supports innovative role-playing game approach to strengthening research security in academia

A new National Science Foundation (NSF) award will support an innovative effort in the School of Information Sciences to strengthen research security by using structured role-playing games (RPG) to model the threats facing academic research environments. The project, titled "REDTEAM: Research Environment Defense Through Expert Attack Modeling," addresses a growing challenge: balancing the open, collaborative nature of academic research with increasing national security risks and sophisticated adversarial threats. 

Jiang defends dissertation

PhD candidate Xiaoliang Jiang successfully defended his dissertation, "Identifying Place Names in Scientific Writing Based on Language Models, Linked Data, and Metadata," on November 10. 

Xiaoliang Jiang

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top