School of Information Sciences

Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Kang makes sense of too much information

As an MSIM student at the iSchool, Zhanchen Kang is passionate about helping people make sense of the overwhelming amount of information in their daily lives. Kang earned an undergraduate degree in information systems in China before coming to the University of Illinois to further explore how technology, data, and people intersect. 

Zhanchen Kang

Students from The Stu/dio to present work at MDEV

Students from The Stu/dio, the University of Illinois student-led game production studio, are preparing to take the stage at MDEV 2025, which will be held on November 7-8 in Madison, Wisconsin. One of the Midwest's most popular game industry conferences, MDEV celebrates innovation and collaboration in game development by bringing together game designers, developers, and enthusiasts from across the region for panels, workshops, and networking. 

PhD students receive scholarships from IAPP

Information Sciences PhD students Mubarak Raji, Eryclis Rodrigues Silva, and Eryue Xu, and Informatics PhD student Muhammad Hussain have received A. Serwin Conference Scholarships from the International Association of Privacy Professionals (IAPP). The award, which recognizes outstanding students in the areas of privacy, AI governance, and digital responsibility, consists of $1,000 and complimentary conference registration. The IAPP’s annual conference, Privacy. Security. Risk., will be held October 30-31 in San Diego, California.

Perkins defends dissertation

PhD candidate Jana M. Perkins successfully defended her dissertation, "Scholarship writ large: A data-rich analysis of professionalization in English literary scholarship from 1940 to the present."

Jana Perkins

Yu receives 2025 Google PhD Fellowship

PhD student Yaman Yu has been named a recipient of the 2025 Google PhD Fellowship in Privacy, Safety, and Security. The fellowship program recognizes outstanding graduate students who are conducting exceptional and innovative research in computer science and related fields, with a special focus on candidates who seek to influence the future of technology. Google PhD fellowships include tuition and fees, a stipend, and mentorship from a Google Research Mentor for up to two years. Google.org is providing over $10 million to support 255 PhD students across 35 countries and 12 research domains.

Yaman Yu

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Fax: (217) 244-3302

Email: ischool@illinois.edu

Back to top