School of Information Sciences

Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Nguyen receives Critical Language Scholarship

MSLIS student Christine Nguyen has been awarded a U.S. Department of State Critical Language Scholarship (CLS) to study Japanese this summer. She is one of four University of Illinois Urbana-Champaign students who received full scholarships to spend 8-10 weeks abroad and study one of 14 critical languages. The program is part of an initiative to expand the number of Americans studying and mastering critical foreign languages and cultural skills to enable them to contribute to U.S. economic competitiveness and national security.

Christine Thuy Minh Nguyen

iSchool researchers to present at CHI 2026

iSchool faculty and students will present their research at the ACM Conference on Human Factors in Computing Systems (CHI 2026), which will be held from April 13–17 in Barcelona, Spain. The conference, considered the most prestigious in the field of Human-Computer Interaction, attracts researchers and practitioners from around the globe.

Wang and Snap Research partner on "Profile Agent"

Imagine your favorite apps had a "digital twin" of your personality that actually grew up with you. Right now, most AI systems create a static snapshot of your interests. For example, a personal shopper who keeps recommending video games just because you bought one three years ago, even though you've long since moved on to hiking and cooking. To bridge this gap, Professor Dong Wang's team at the University of Illinois Urbana-Champaign is partnering with Snap Research to build a "Profile Agent."

Dong Wang

Liu receives support for AI project through NVIDIA Academic Grant Program

Assistant Professor Yaoyao Liu has been awarded a grant through the NVIDIA Academic Grant Program. NVIDIA, a world leader in accelerated computing and AI, established the program to advance academic research by providing world-class computing access and resources to researchers. Liu has received 32,000 A100 GPU-hours on Brev, an AI and machine learning platform that empowers developers to run, build, train, deploy, and scale AI models with GPU in the cloud. 

Yaoyao Liu

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top