School of Information Sciences

Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

iSchool researchers to present at ASSETS 2025

iSchool faculty and students will present their research at the 27th International Association for Computing Machinery (ACM) Special Interest Group (SIG) ACCESS Conference on Computers and Accessibility (ASSETS 2025), which will be held in Denver, Colorado, October 26–29, 2025. This conference allows researchers to present their scholarship on design, evaluation, use, and education related to computing for people with disabilities and older adults.

Olalere receives HSLI Jira Scholarship

Precious Olalere, a doctoral student in information sciences, has been awarded the 2025 Helen Knoll Jira Scholarship from the Health Science Librarians of Illinois (HSLI). This award supports individuals pursuing education in library or information science in Illinois, especially those focusing on health science librarianship.

Precious Olalere

Student Spotlight: Daria Meshcheriakova

BSIS student Daria Meshcheriakova came to the iSchool with intention. Originally from Russia, where she lived for 17 years, Meshcheriakova moved to Chicago and attended Harold Washington Community College before transferring to the University of Illinois. Among potential universities, Illinois proved to be the best fit.

Daria Meshcheriakova

iSchool researchers present at ILA 2025

School faculty, staff, and students will present their research at the 2025 Illinois Library Association (ILA) Annual Conference, which will be held on October 14–16 in Rosemont. The theme of this year's conference is "You Belong Here."

Get to know Jade Carthans, BSIS student

Jade Carthans is interested in how human-centered design, machine learning, and data analytics can come together to solve critical problems that impact organizations and individuals. She gained firsthand experience in these areas through internships with Microsoft and State Farm.

Jade Carthans

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Fax: (217) 244-3302

Email: ischool@illinois.edu

Back to top