School of Information Sciences

Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

BIG: Solving real problems for real organizations

Students in the Business Intelligence Group (BIG)—the experiential learning consultancy program affiliated with Associate Professor Yoo-Seong Song's Applied Business Research courses (IS 494 and IS 514)—spent the spring semester working directly with organizations across industries, including health care, financial services, aviation, gaming, community services, and higher education. 

Business Intelligence Group (BIG) student consultants smile on the steps of Foellinger Auditorium with Associate Professor Yoo-Seong Song

Cao and Liu receive Best Paper Award for FreeOrbit4D

PhD student Wei Cao and Assistant Professor Yaoyao Liu received a Best Paper Award at the 4th Workshop on Generative Models for Computer Vision, which was held during the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 

Wang group receives ICWSM Best Dataset Paper Award

A paper from Professor Dong Wang's Social Sensing & Intelligence Lab received the Best Dataset Paper Award at the International AAAI Conference on Web and Social Media (ICWSM) held in May 2026 in Los Angeles, California. According to Wang, the paper was accepted in the first review round, which had an acceptance rate of 4.7 percent (14 of 298 submissions). 

Adler and Wang to present at RESPECT 2026

Associate Professor Rachel Adler and Informatics PhD student Olive Wang will present their work at the Association for Computing Machinery Special Interest Group on Computer Science Education Conference on Research on Equity and Sustained Participation in Engineering, Computing, and Technology (RESPECT), which will be held in Chicago this week.

Bashir group presents work at PEPR 2026

PhD students Ramazan Yener, Eryue Xu, and Mubarak Raji presented their research this week at the 2026 USENIX Conference on Privacy Engineering Practice and Respect (PEPR) in Santa Clara, California. PEPR is focused on designing and building products and systems with privacy and respect for their users and the societies in which they operate. The students received USENIX grants covering their conference registration and providing travel support to attend the conference. 

Bashir group PEPR 2026

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top