Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Wang wins grand prize at Research Live!

Informatics PhD student Olivia Wang won the Grand Prize at the 2025 Research Live! competition, which was held on April 8 in the Campus Instructional Facility Atrium. At the event, which is hosted by the Graduate College, thirteen finalists presented their graduate research in three minutes or less to a general audience. Wang received $500 as the Grand Prize winner.

Olivia Wang

Spectrum Scholar Spotlight: Katherine Mendoza Gonzalez

Twelve iSchool master's students were named 2024–2025 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS Katherine Mendoza Gonzalez earned her BA in history from Aurora University in Aurora, Illinois.

Katherine Mendoza Gonzalez

Zhou defends dissertation

Doctoral candidate Kyrie Zhixuan Zhou successfully defended his dissertation, "A Pragmatic and Human-centered Approach to Promoting Software Accessibility: Design, Education, Governance," on April 3.

Zhixuan Zhou

Scholarship alleviates financial burden for returning student

During her time as an active-duty Naval Officer, Anna Hartman realized that she had a passion for helping others and building community. That passion, combined with a lifelong love of reading, led her to pursue an MSLIS degree at the University of Illinois. Hartman is receiving support for her studies through the Balz Endowment Fund, which was established by Nancy (BA LAS '70, MSLIS '72) and Dan (BS Media '68, MS Media '72) Balz to help make education more affordable for returning students.

Anna Hartman