Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Wegrzyn awarded SMART Scholarship

PhD student Emily Wegrzyn has been selected for the prestigious Science, Mathematics, and Research for Transformation (SMART) Scholarship-for-Service Program, which is funded by the Department of Defense. The primary aim of this program is to increase the number of civilian engineers and scientists in the U.S. 

 Emily Wegrzyn

Senior Spotlight: Colton Keiser

After graduating with his BSIS degree in May, Colton Keiser will head to St. Louis to work as an internal audit and financial advisory consultant with Protiviti. He gained experience in auditing while working as an intern for the Montgomery County Public Defender in his hometown of Hillsboro, Illinois.

Colton Keiser

Winning exhibit features recipes from across the globe

MSLIS students Yung-hui Chou, Alice Tierney-Fife, and Elizabeth Workman are the winners of this year’s Graduate Student Exhibit Contest, sponsored by the University of Illinois Library. Their exhibit, "Culture and Cuisine in Diaspora: A Hidden Library Collection," displays items from seven campus libraries and highlights research and recreational material centered on traditional recipes from across the globe. The exhibit is on display in the library's Marshall Gallery through the end of April and also available online.

MSLIS students Yung-hui Chou, Alice Tierney-Fife, and Elizabeth Workman stand next to the winning exhibit

Seo coauthors chapter on data science and accessibility

Assistant Professor JooYoung Seo and Mine Dogucu, professor of statistics in the Donald Bren School of Information and Computer Sciences at the University of California Irvine, have coauthored a chapter in the new book Teaching Accessible Computing. The goal of the book, which is edited by Alannah Oleson, Amy J. Ko and Richard Ladner, is to help educators feel confident in introducing topics related to disability and accessible computing and integrating accessibility into their courses.

JooYoung Seo