School of Information Sciences

Parulian defends dissertation

Doctoral candidate Nikolaus Parulian successfully defended his dissertation, "A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning," on June 29.

His committee included Professor Bertram Ludäscher (chair), Professor J. Stephen Downie, Associate Professor Jana Diesner, and Assistant Professor Nigel Bosch.

Abstract: Data cleaning is an essential component of data preparation in machine learning and other data science workflows. It is a time-consuming and error-prone task that can greatly affect the reliability of subsequent analyses. Tools must capture provenance information to ensure transparent and auditable data-cleaning processes. However, existing provenance models have limitations in tracing and querying changes at different levels of granularity. To address this, we proposed a new conceptual model that captures fine-grained retrospective provenance and extends it with prospective provenance to represent operations or workflows that change the datasets. This hybrid model allows powerful queries and supports advanced use cases like auditing data cleaning workflows. Additionally, we extended the model to present a conceptual model focusing on reusability and collaboration in data cleaning. It addresses scenarios where multiple users contribute to dataset changes and enables tracking of curator actions, identifying dependencies between cleaning operations, and facilitating collaboration. Through an experimental case study, we demonstrated the reusability of data-cleaning workflows, different users' contributions, and collaboration's effectiveness in improving data quality.

Updated on
Backto the news archive

Related News

Raji invited to join UN Working Expert Group

PhD student Mubarak Raji has been invited to join the Working Expert Group on AI Governance Interoperability. This group operates under the United Nations Office for Digital and Emerging Technologies' new AI Governance for Humanity Lab. It supports the Secretary-General's High-level Advisory Body on AI by providing evidence-based analysis for the Global Dialogue on AI Governance, which will be held in July 2026 in Geneva, Switzerland.

Mubarak Raji headshot

Paper by He's lab recognized at ICLR 2026 workshop

The iDEA-iSAIL Joint Laboratory at the University of Illinois received an Outstanding Paper Award at the International Conference on Learning Representations (ICLR) 2026 Logical Reasoning of Large Language Models Workshop for their paper, "RAG Over Tables: Hierarchical Memory Index, Multi-State Retrieval, and Benchmarking." Paper authors include lab members Jingrui He, professor and MSIM program director; Sirui Chen, Xinrui He, and Zihao Li, computer science PhD students; Jiaru Zou, computer science MS student; Dongqi Fu, alum; as well as Jiawei Han, professor of computer science, and Yada Zhu, IBM collaborator. Chen gave an oral presentation of the research at the workshop, which was held last month in Rio de Janeiro, Brazil. This award was selected out of 206 accepted papers at the workshop.

Jingrui He

Kemboi receives Young LIS Professional Award

PhD student Gladys Kemboi has been named a recipient of the Standing Conference of Eastern, Central and Southern African Library and Information Associations (SCECSAL) Excellence Awards 2026 in the category of Young LIS Professional. This is an international award recognizing excellence in library and information science in Africa. 

Gladys Kemboi

Downie presents TORCHLITE in Germany

This week, Professor and Executive Associate Dean J. Stephen Downie was a guest speaker at the Herder Institute in Marburg and the University of Göttingen. Downie, who serves as co-director of the HathiTrust Research Center (HTRC), lectured on the HTRC's "Tools for Open Research and Computation with HathiTrust: Leveraging Intelligent Text Extraction" (TORCHLITE) project.

J. Stephen Downie

Internship Spotlight: San Francisco Public Library

PhD student Adebola Obayemi discusses her internship with the San Francisco Public Library, where she worked on Expanding Information Access for Incarcerated People Initiative. She has been invited to present her proposal on digital literacy for incarcerated populations at the Expanding Information Access for Incarcerated People Convening, which will be held in June in Chicago. 

Adebola Obayemi

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top