Data Analytics and Human Centered Data

Using computational methods to transform both structured and unstructured data into actionable knowledge; and to understand and enable humans to explore and gain insight from vast data sets

Researchers Working in this Area

Related Research Projects

Analysis of Academic Activity Patterns in Academic Literature

Time frame
2017-2017
Investigators
Jana Diesner, Vetle Torvik
Total funding to date
$25,000.00
Funding agency
Korea Institute of Science and Technology Information

The project team will work on extracting key concepts from scholarly publications and explore techniques for building a taxonomy of extracted concepts by leveraging open knowledge bases (e.g., Wikipedia). The outcome of this process will be evaluated for various science and technology knowledge platform-based analysis services. The techniques, which reduce semantic ambiguity, will analyze…

BD Hubs: Collaborative Proposal: Midwest: Midwest Big Data Hub: Building Communities to Harness the Data Revolution

Time frame
2019-Present
Investigator
Catherine Blake
Total funding to date
$2,883,274.00
Funding agency
National Science Foundation

The Midwest Big Data Hub (MBDH) is a network of regional institutions created to facilitate collection, management, and use of complex information, creating active partnerships of experts and resources to address issues relevant to life in the Midwest.

MBDH aims to help member organizations working in Big Data coordinate…

Midwest Big Data Hub

Big Data-Theoretic Approach to Quantify Organizational Failure Mechanisms in Probabilistic Risk Assessment

Time frame
2015-Present
Investigators
Zahra Mohaghegh, Catherine Blake
Total funding to date
$899,663.00
Funding agency
National Science Foundation

Catastrophic events such as Fukushima and Katrina have made it clear that integrating physical and social causes of failure into a cohesive modeling framework is critical in order to prevent complex technological accidents and to maintain public safety and health. In this research, experts in Probabilistic Risk Assessment (PRA), Organizational Behavior and Information Science and Data…

CAREER: III: Modeling the Heterogeneity of Heterogeneity: Algorithms,Theories and Applications

Time frame
2019-Present
Investigator
Jingrui He
Total funding to date
$415,836.00
Funding agency
National Science Foundation

Nowadays, as an intrinsic property of big data, data heterogeneity can be seen in a variety of real world applications, ranging from security to manufacturing, from healthcare to crowdsourcing. Many high-impact data mining applications exhibit the co-existence of multiple types of heterogeneity, such as different classification tasks, different data sources, and different labeling oracles.…

modeling heterogeneity

Collaborative Research: Changes in Molecular Gas and Galaxy Properties Over Time in the Era of Integral Field Unit Surveys

Time frame
2016-Present
Investigator
Matthew Turk
Total funding to date
$267,664.00
Funding agency
National Science Foundation

Changes of a galaxy's properties over time are driven by the quantity of its cold gas, the raw material from which stars form. Thus, understanding the properties of a galaxy's cold gas component will tell us both how the star formation process changes over time and how this affects galaxies. Using a recently completed survey of carbon monoxide gas in a sample of nearby galaxies, the project…

galaxy

Collaborative Research: Accelerating Synthetic Biology Discovery & Exploration through Knowledge Integration

Time frame
2019-Present
Investigator
J. Stephen Downie
Total funding to date
$211,699.00
Funding agency
National Science Foundation

The scientific challenge for this project is to accelerate discovery and exploration of the synthetic biology design space. In particular, many parts used in synthetic biology come from or are initially tested in a simple bacteria, E. coli, but many potential applications in energy, agriculture, materials, and health require either different bacteria or higher level organisms (yeast for…

synthetic biology

Collaborative Research: S12-SSI: Inquiry-Focused Volumetric Data Analysis Across Scientific Domains: Sustaining and Expanding the yt Community

Time frame
2017-Present
Investigator
Matthew Turk
Total funding to date
$1,061,721.00
Funding agency
National Science Foundation

Scientific discovery across the physical sciences is increasingly dependent on the analysis of volumetric - or three-dimensional - data, that may come from a supercomputer simulation, direct measurement, or mathematical models. Researchers typically seek to extract meaningful insights from this data by visualizing and analyzing it in various ways. The ways in which scientists process…

yt project

Conservation Legacies: Understanding the Long-term Impacts of Private Foundation Investment in International Biodiversity Conservation

Time frame
2019-Present
Investigator
Jana Diesner
Total funding to date
$550,000.00
Funding agency
MacArthur Foundation

The overall goal of this project is to advance knowledge of the long-term impacts of international funding for biodiversity conservation from private foundations and the factors associated with sustainable conservation gains over time. The project will analyze funding from private foundations, with a focus on MacArthur Foundation investment in conservation as a case study. Our team will focus…

conservation

III: Small: Predictive Analysis of Diabetes Dedicated Social Networks

Time frame
2019-Present
Investigator
Jingrui He
Total funding to date
$448,049.00
Funding agency
National Science Foundation

This project will study diabetes dedicated social networks. It aims to harness diabetes patients' online social behaviors from multiple networks to predict their biomarker measurements such as glycated hemoglobin and fasting blood glucose. This project will provide a paradigm shift from exploration to prediction compared with state-of-the-art research on diabetes dedicated social networks,…

diabetes

Impact of Data Quality and Provenance

Time frame
2014-2017
Investigator
Jana Diesner
Total funding to date
$130,475.00
Funding agency
Korea Institute of Science and Technology Information

How do limitations and intransparencies in data quality and data provenance bias research outcomes, and how can we detect and mitigate these limitations? For example, we have been investigating the impact of entity resolution errors on network analysis results. We found that commonly reported network metrics and derived implications can strongly deviate from the truth—as established based on…

News Stories