Projects

Show

Mapping Information Access

Time frame
2013-Present
Investigator
Emily Knox

Mapping Information Access is a collaborative academic research project to study and understand the landscape of information access and availability in public schools and libraries in the United States.

There are more than 18,000 public school districts and more than 9,000 public library systems in the US. Each of these institutions is as a central node of information access for the…

Metadata Poems

Time frame
2017-Present
Investigator
Bonnie Mak

Although standardized vocabularies and languages are often invoked as a way to ensure interoperability in the management of informational resources, these conventions prioritize particular ways of representing the world. This project situates metadata as an infrastructure of information, and examines how such descriptive practices have configured the production of knowledge for centuries—from…

Metadata Poems project

National Forum Data Mining Research Using In Copyright and Limited Access Text Datasets Shaping a Research and Implementation Agenda for Researchers Libraries and Content Providers

Time frame
2017-Present
Investigator
Bertram Ludäscher
Total funding to date
$99,536.00
Funding agency
Institute of Museum and Library Services

Copyright law and resource licensing complicate the application of text data mining for research. This project convened a National Forum on Text Data Mining with Use-Limited Data in April 2018 that brought together 25 leading stakeholders selected among researchers, librarians, content providers, legal experts, and representatives of scholarly societies to articulate an agenda that provides…

text mining icon

Pathtracker: A smartphone-based system for mobile infectious disease detection and epidemiology

Time frame
2015-Present
Investigator
Ian Brooks
Total funding to date
$1,005,692.00
Funding agency
National Science Foundation

This project will develop a mobile sensor technology for performing detection and identification of viral and bacterial pathogens. By means of a smartphone-based detection instrument, the results are shared with a cloud-based data management service that will enable physicians to rapidly visualize the geographical and temporal spread of infectious disease. When deployed by a community of…

Research Software Credit and Sustainability

Time frame
2014-Present
Investigator
Daniel Katz

An ever-increasing fraction of research is dependent on software, much of it developed in academia. But the developers are often not recognized or rewarded for their contributions in the academic systems. In addition to recognition, resources are needed to sustain research software: to continue to make it available in the future, on new platforms, meeting new needs. This project examines both…

RIDIR: Collaborative Research: Developing and Deploying SKOPE - A Resource for Synthesizing Knowledge of Past Environments

Time frame
2016-Present
Investigator
Bertram Ludäscher
Total funding to date
$884,627.00
Funding agency
National Science Foundation

Recent research has demonstrated that investigations of contemporary societal problems can benefit from the use of long-term environmental data and from comparisons with cases in which the interactions of human societies with their environments is well-documented over centuries. By providing easy access to time- and place-specific long-term environmental data, this project seeks to facilitate…

Pueblo village

Single Interface for Music Score Searching and Analysis

Time frame
2015-Present
Investigator
J. Stephen Downie
Total funding to date
$15,000.00
Funding agency
Social Sciences and Humanities Research Council of Canada

Music prints and manuscripts created over the past thousand years sit on the shelves of libraries and museums around the globe. As these organizations digitize their collections, images of these scores are increasingly accessible online. However, the musical content remains difficult to search.

Google Books and HathiTrust have already made it possible to search the content of text…

Text Mining Pipeline to Accelerate Systematic Reviews in Evidence-Based Medicine

Time frame
2017-2020
Investigator
Jodi Schneider
Funding agency
National Institutes of Health

We hypothesize that a flexible, configurable suite of automated informatics tools can reduce significantly the effort needed to generate systematic reviews of medical literature while maintaining or even improving their quality. To test this hypothesis, we propose:
Aim 1. To extend our research on automated RCT tagging to include additional study types and provide public resources.…

Text Mining the Novel: Establishing the Foundations of a New Discipline

Time frame
2014-Present
Investigators
Ted Underwood, J. Stephen Downie
Total funding to date
$142,000.00
Funding agency
Social Sciences and Humanities Research Council of Canada

This HathiTrust Research Center (HTRC) project seeks to produce the first large-scale cross-cultural study of the novel according to quantitative methods. Ever since its putative rise in the eighteenth century, the novel has emerged as a central means of expressing what it means to be modern. And yet despite this cultural significance, we still lack a comprehensive study of the novel’s place…

TextTransfer: Corpus-Based Detection of Secondary Use of Scientific Publications

Time frame
2017-Present
Investigator
Jana Diesner
Total funding to date
$112,667.00
Funding agency
Federal Ministry of Education and Research and the Institute for German Language (IDS) in Germany

In this collaborative project, we are using Natural Language Processing and Machine Learning to identify secondary practical uses of research findings from final reports of grant funded work. Such reports are often stored in specialized databases, where long-term archiving activities focus on standardization, interoperability, and information indexing and retrieval. However, secondary use of…

close-up view of journal article