Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

iSchool researchers work with diverse groups to improve user experience

iSchool faculty are studying ways to improve user experience, with a common goal of improving technology and applications for the needs of individual users. These researchers are working with diverse groups to gain feedback, and several current projects are focused on experiences for users with disabilities.

Das receives student membership award from ASIS&T

PhD student Puranjani Das has been selected as a recipient of the Association for Information Science and Technology (ASIS&T) SIG CMR Student Membership Award for the 2024-2025 academic year. She will receive a complimentary one-year membership in both ASIS&T and SIG CMR, a special interest group focused on classification and metadata research.

Puranjani Das

Kim defends dissertation

Doctoral candidate Jenna Kim successfully defended her dissertation, "Evaluating Pre-Trained Language Modeling Approaches for Author Name Disambiguation," on June 11, 2024.

Jenna Kim headshot

Desai defends dissertation

Doctoral candidate Smit Desai successfully defended his dissertation, "Designing Metaphor-fluid Voice User Interfaces," on June 10.

Smit Desai

Student says ‘thank you’ with a helicopter ride

Last month, Michael Ferrer showed appreciation for one of his MSIM instructors in a unique way—by inviting him for an insider’s look at his work as a reservist in the Illinois Army National Guard. For the ILARNG BOSS Lift, which took place on June 18 at Camp Atterbury, Indiana, Ferrer selected Michael Wonderlich, iSchool adjunct lecturer and senior associate director of business intelligence and enterprise architecture for Administrative Information Technology Services (AITS) at the University of Illinois.

Michael Wonderlich and Michael Ferrer hold a U of I flag in front of a military helicopter