Impact of Data Quality and Provenance

TIME FRAME
2014 – present
TOTAL FUNDING TO DATE
$130,475

How do limitations and intransparencies in data quality and data provenance bias research outcomes, and how can we detect and mitigate these limitations? For example, we have been investigating the impact of entity resolution errors on network analysis results. We found that commonly reported network metrics and derived implications can strongly deviate from the truth—as established based on gold standard data or approximations thereof—depending on the efforts dedicated to entity resolution.

Impact of Data Quality and Provenance

Personnel

Impact of Data Quality and Provenance
Jana Diesner
Principal Investigator (PI)

Funding Agencies

Korea Institute of Science and Technology Information — $130,475

Research Areas

Data Analytics, Data Curation