HTRC Subscribe to HTRC

IN THE NEWS

Feb. 16, 2018
Professor Downie gives an update on HTRC

Over 140 people attended the HathiTrust Research Center (HTRC) UnCamp, hosted by the University of California, Berkeley Libraries, on January 25 and 26. In addition to keynotes focused on methodologies of text and data mining, researchers from the fields of information science, digital libraries, literary history, digital pedagogy, and the history of social movements presented their work and its intersection with the HathiTrust Digital Library. Slides and notes from the presentations are available on the Uncamp website.

iSchool-affiliated presentations included:

"Consistency and Confidence in the Million-Book Library"
Keynote presented by...

Feb. 15, 2018
underwood-sq

While the issue of gender equality is more prevalent in modern times than in the Victorian era, a new study shows that in literature, the number of women characters and women authors has declined rather than grown over the years. Professor Ted Underwood led the research, which used machine learning to analyze the presentation of gender in more than 100,000 novels from 1703 to 2009 in the HathiTrust Digital Library. 

According to Underwood, "By 1960, women had lost half the space they occupied in nineteenth-century fiction, even though gender roles had become more flexible."

He and his fellow researchers, David Bamman, assistant professor of information science at the University of California, Berkeley, and Sabrina Lee, a graduate student in English at Illinois, recently published their findings, "The Transformation of Gender in English-Language...

Nov. 21, 2017
downie-square

Professor and Associate Dean for Research J. Stephen Downie was a keynote speaker for the 7th Rizal Library International Conference, which was held from November 16-18 at Ateneo de Manila University in Quezon City, Philippines. The theme of the conference was "CLICK! Connecting Libraries, Information, and Community Knowledge."

Downie gave the presentation, "HathiTrust Research Center: Text mining the very big data of the HathiTrust Digital Library." HathiTrust Digital Library is a partnership of more than 100 university and public libraries, which has amassed a collection of over 15 million volumes and 5.5 billion pages. While researchers are applying data mining and text analysis techniques to reveal new knowledge buried within the collection, roughly 10 million volumes are under copyright restrictions and cannot be shared directly with researchers.

In his talk, Downie, codirector of the...

Oct. 25, 2017

The HathiTrust Research Center (HTRC) will host its 2018 UnCamp on January 25-26 at the University of California, Berkeley. The primary venue will be the newly renovated Moffitt Library with breakout events in nearby campus locations, including the Berkeley Institute for Data Science, Morrison Library, D-Lab in Barrows Hall, and Academic Innovation Studio. Registration is now open.

UnCamp brings together digital humanities researchers and tool developers as well as librarians and graduate students. It combines hands-on coding and demonstrations, inspirational use-case studies, lightning talks, and breakout sessions—all structured in the dynamic setting of a participant-driven, unconference programming format. This year's event will feature keynote presentations about the IMLS-funded project Aida (Image...

Aug. 4, 2017

The iSchool at Illinois is involved in a partnership that has received a research grant from the Institute of Museum and Library Services for an extension of the Data Capsule service, which enables remote access by the HathiTrust Digital Library to other collections managed by research libraries. The partnership is led by the School of Informatics and Computing at Indiana University. 
  
As the volume of digital content has expanded exponentially over the past several years, researchers and educators have recognized the potential of big data techniques to analyze, access, and organize digital scholarly collections. The Data Capsule service, which was developed for use in the HathiTrust Research Center (HTRC), creates virtual computers for users to access a restricted collection. Within HTRC, the Data Capsule service is used for non-consumptive analytics, which allow the computer to analyze the text but doesn’...

Jun. 12, 2017

The iSchool is co-organizing a workshop on digital scholarship with Beijing Institute of Technology (BIT) Library on June 14-16 in Beijing. The workshop, Digital Scholarship Centers: Building Library Services for Data-Driven Scholarship, will instruct participants in library service models for digital scholarship and discuss concepts in digital humanities and computational social science. Dean Allen Renear will give opening remarks. Other iSchool presenters include J. Stephen Downie, professor and codirector of the HathiTrust Research Center (HTRC); Peter Organisciak (PhD '15), postdoctoral research associate; Eleanor Dickson, visiting HTRC digital humanities specialist; and Nic Weber (PhD '15), assistant professor at the University of Washington.

Downie will give the talks:

  • "Text Mining Concepts and Methods: HTRC and Non-Consumptive Research"
  • "Quick and Painless Introduction to Machine Learning"
  • "WEKA Machine Learning Tools: A Friendly...
May. 30, 2017
downie_square_crop

J. Stephen Downie, professor and associate dean for research, has been named a National Center for Supercomputing Applications (NCSA) Faculty Fellowship awardee for the 2017-18 academic year. Faculty Fellows work with NCSA on specific projects aimed to help solve grand challenges facing all people, including deep learning, the internet of things, data analysis, volcano activity, and more.

Downie’s project is titled, “Modeling the Massive HathiTrust Corpus: Creating Concept-Based Representations of 15 Million Volumes.” Through this research, he hopes to make the HathiTrust collection—15 million books spanning multiple centuries—available for large-scale research use through optimized,...

Pages