Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Han successfully defends dissertation

Doctoral candidate Yingying Han successfully defended her dissertation, "Community Archives as Agency: Documenting Chinese American Experiences in the U.S.,” on May 28.

Yingying Han

Student award recipients announced

The School of Information Sciences recognized student award recipients at the iSchool Convocation on May 18. Awards are based on academic achievements as well as attributes that contribute to professional success. For more information about each award, including past recipients, visit the Student Awards page. Congratulations to this year's honorees!

Award recipients Mahir Thakkar, Delia Kerr-Dennhardt, Katie Skoufes, Audrey Bentch, and Adam Beaty.

iSchool alumni and student named 2025 Movers & Shakers

Two iSchool alumni and an MSLIS student are included in Library Journal's 2025 class of Movers & Shakers, an annual list that recognizes 50 professionals who are moving the library field forward as a profession. Leah Gregory (MSLIS '04) was honored in the Advocates category, Billy Tringali (MSLIS '19) was honored in the Innovators category, and University Library Assistant Professor and Digital Humanities Librarian Mary Ton (current MSLIS student) was honored in the Educators category.

Spectrum Scholar Spotlight: Dalia Ortiz Pon

Twelve iSchool master's students were named 2024–2025 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Dalia Ortiz Pon earned her bachelor's degree in Latina/Latino studies from San Francisco State University. 

Dalia Ortiz Pon