Data Analytics

Using computational methods to transform both structured and unstructured data into actionable knowledge. 

Researchers Working in this Area

Related Research Projects

Analysis of Academic Activity Patterns in Academic Literature

Time frame
2017-2017
Investigators
Jana Diesner, Vetle Torvik
Total funding to date
$25,000.00
Funding agency
Korea Institute of Science and Technology Information

The project team will work on extracting key concepts from scholarly publications and explore techniques for building a taxonomy of extracted concepts by leveraging open knowledge bases (e.g., Wikipedia). The outcome of this process will be evaluated for various science and technology knowledge platform-based analysis services. The techniques, which reduce semantic ambiguity, will analyze…

Big Data-Theoretic Approach to Quantify Organizational Failure Mechanisms in Probabilistic Risk Assessment

Time frame
2015-Present
Investigators
Zahra Mohaghegh, Catherine Blake
Total funding to date
$899,663.00
Funding agency
National Science Foundation

Catastrophic events such as Fukushima and Katrina have made it clear that integrating physical and social causes of failure into a cohesive modeling framework is critical in order to prevent complex technological accidents and to maintain public safety and health. In this research, experts in Probabilistic Risk Assessment (PRA), Organizational Behavior and Information Science and Data…

Impact of Data Quality and Provenance

Time frame
2014-2017
Investigator
Jana Diesner
Total funding to date
$130,475.00
Funding agency
Korea Institute of Science and Technology Information

How do limitations and intransparencies in data quality and data provenance bias research outcomes, and how can we detect and mitigate these limitations? For example, we have been investigating the impact of entity resolution errors on network analysis results. We found that commonly reported network metrics and derived implications can strongly deviate from the truth—as established based on…

Improving Patient Outcomes by Listening to Their Social Media Communications

Time frame
2017-Present
Investigator
Ian Brooks
Total funding to date
$15,000.00
Funding agency
Homecare Education Advocacy & Resource Team Support

It is difficult to understand the effectiveness of various treatment options when a huge number of external factors such as lifestyle, diet, and environment affect the burden of a disease. A major barrier to understanding is the challenge of scale—sampling enough patients to separate the major, minor, and negligible factors. With access to a database of more than one trillion public social…

social media icons

INDICATOR: An Information System for Monitoring the Health of a Community

Time frame
2007-2011
Investigator
Ian Brooks
Total funding to date
$300,723.00
Funding agency
Centers for Disease Control and Prevention, U.S. Department of Agriculture, Carle Foundation

INDICATOR is a novel information system for collecting, integrating, and analyzing data from multiple sources to provide public health decision makers real-time data on the health of their community. Data comes from sources as varied as emergency department visits, school attendance, veterinary clinics, and social media postings and together have been used to change public policy in outbreak…

INDICATOR

National Forum Data Mining Research Using In Copyright and Limited Access Text Datasets Shaping a Research and Implementation Agenda for Researchers Libraries and Content Providers

Time frame
2017-Present
Investigator
Bertram Ludäscher
Total funding to date
$99,536.00
Funding agency
Institute of Museum and Library Services

Copyright law and resource licensing complicate the application of text data mining for research. This project convened a National Forum on Text Data Mining with Use-Limited Data in April 2018 that brought together 25 leading stakeholders selected among researchers, librarians, content providers, legal experts, and representatives of scholarly societies to articulate an agenda that provides…

text mining icon

Natural Language Processing for Building and Enhancing Graph Data and Theory

Investigator
Jana Diesner

How can we use user-generated content to construct, infer or refine network data? We have been tackling this problem by leveraging communication content produced and disseminated in social networks to enhance graph data. For example, we have used domain-adjusted sentiment analysis to label graphs with valence values in order to enable triadic balance assessment. The resulting method enables…

NCSA Faculty Fellowship: Predictive Modeling for Impact Assessment

Time frame
2015-2016
Investigator
Jana Diesner
Total funding to date
$24,323.00
Funding agency
National Center for Supercomputing Applications

Assistant Professor Jana Diesner a received an Faculty Fellowship and seed funding for her project, “Predictive Modeling for Impact Assessment,” from the National Center for Supercomputing Applications (NCSA). Diesner collaborates closely with NCSA scientists on the project, which builds on her work developing computational solutions…

Pathtracker: A smartphone-based system for mobile infectious disease detection and epidemiology

Time frame
2015-Present
Investigator
Ian Brooks
Total funding to date
$1,005,692.00
Funding agency
National Science Foundation

This project will develop a mobile sensor technology for performing detection and identification of viral and bacterial pathogens. By means of a smartphone-based detection instrument, the results are shared with a cloud-based data management service that will enable physicians to rapidly visualize the geographical and temporal spread of infectious disease. When deployed by a community of…

RIDIR: Collaborative Research: Developing and Deploying SKOPE - A Resource for Synthesizing Knowledge of Past Environments

Time frame
2016-Present
Investigator
Bertram Ludäscher
Total funding to date
$884,627.00
Funding agency
National Science Foundation

Recent research has demonstrated that investigations of contemporary societal problems can benefit from the use of long-term environmental data and from comparisons with cases in which the interactions of human societies with their environments is well-documented over centuries. By providing easy access to time- and place-specific long-term environmental data, this project seeks to facilitate…

Pueblo village

Related News Articles