Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Spectrum Scholar Spotlight: Ted Farias

Seventeen iSchool master’s students have been named 2023-2024 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Ted Farias earned his BA in psychology from California State University of Long Beach.

Ted Farias

iSchool researchers present at inaugural ASIS&T symposium

iSchool researchers will present their work at the Association for Information Science & Technology (ASIS&T) Midwest Chapter Spring Symposium on April 26. The inaugural symposium will include talks by seventeen researchers from ten institutions across the Midwest region.

New EU legislation has iSchool connection

Thanks to new European Union (EU) legislation, those who perform on-demand work through an app or website, such as DoorDash or Uber, will enjoy better working conditions. PhD student Zachary Kilhoffer, who spent four years working as a researcher for the Centre for European Policy Studies (CEPS) in Brussels prior to entering the iSchool's doctoral program, authored or co-authored several policy research pieces that informed the creation of the EU Platform Work Directive.

Zak Kilhoffer

Undergraduate Research Symposium features iSchool researchers

Several iSchool undergraduate students will participate in the 17th annual Undergraduate Research Symposium. During the event, visitors will learn about undergraduate research projects through oral and poster presentations, creative performances, and art exhibits. All are welcome to attend the symposium, which will be held on April 25 from 9:00 a.m.-5:00 p.m. in the Illini Rooms and South Lounge of the Illini Union. 

iSchool researchers present at iConference 2024

The following iSchool faculty and students participated in the virtual portion of iConference 2024 from April 15-18. The in-person portion of the conference will be held in Changchun, China, from April 22-26. The theme of this year’s conference is "Wisdom, Well-being, Win-win."