Guan successfully defends dissertation

Yingjun Guan
Yingjun Guan

Doctoral candidate Yingjun Guan successfully defended his dissertation, "Disambiguating Academic Institution Names: A Comprehensive Study of Authority Files, Linguistic Variations, and Computational Evaluation in PubMed Affiliations," on April 28. 

His committee included Associate Professor Vetle Torvik, Professor J. Stephen Downie, Professor Bertram Ludäscher, and Professor Allen Renear.

Abstract: This dissertation investigates the challenges of institutional name disambiguation (IND) in scholarly communication, focusing on the inconsistencies and ambiguities found in academic affiliation metadata. It examines variations in naming conventions, institutional hierarchies, and multilingual expressions that hinder accurate representation across digital library systems and bibliometric platforms. Through a comparative review of 21 authority files—including VIAF, ROR, and Wikidata—a new integrated authority dataset is developed to improve standardization. The study further introduces a manually annotated dataset of PubMed affiliation records to analyze linguistic patterns, synonym usage, and structural inconsistencies in real-world data. It evaluates the coverage and performance of major authority files and computational tools using precision, recall, and other core metrics. The findings include an organized framework that categorizes the different types of linguistic ambiguities in institutional names, a benchmark dataset for future research, and practical insights into combining authority control with computational methods. Together, these efforts support more reliable affiliation parsing and enhance data integrity in bibliometrics, citation indexing, and digital scholarly infrastructures.

Updated on
Backto the news archive

Related News

iSchool alumni and student named 2025 Movers & Shakers

Two iSchool alumni and an MSLIS student are included in Library Journal's 2025 class of Movers & Shakers, an annual list that recognizes 50 professionals who are moving the library field forward as a profession. Leah Gregory (MSLIS '04) was honored in the Advocates category, Billy Tringali (MSLIS '19) was honored in the Innovators category, and University Library Assistant Professor and Digital Humanities Librarian Mary Ton (current MSLIS student) was honored in the Educators category.

Spectrum Scholar Spotlight: Dalia Ortiz Pon

Twelve iSchool master's students were named 2024–2025 Spectrum Scholars by the American Library Association (ALA) Office for Diversity, Literacy, and Outreach Services. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Dalia Ortiz Pon earned her bachelor's degree in Latina/Latino studies from San Francisco State University. 

Dalia Ortiz Pon

Debnath datafies "The Bulletin"

MSIM student Tan Debnath, whose interests span data mining, statistical modeling, text mining, and digital humanities, joined the Center for Children's books as a research assistant. He was tasked with building curation processes that would datafy seventy-five years' worth of archival issues of The Bulletin of the Center for Children's Books, one of the nation's leading children's book review journals.

Tan Debnath stands casually with his hands in his pockets and smiles broadly at the camera. It's a sunny day

iSchool undergraduates selected as 2025 Community-Academic Scholars

The Interdisciplinary Health Sciences Institute (IHSI) has selected BSIS student Dhanvi Puttur and BSIS+DS student Lara Terpetschnig as 2025 Community-Academic Scholars. Representing nineteen majors and nine minors in eight colleges and schools at the University of Illinois Urbana-Champaign and two additional universities, the eighteen scholars in this cohort encompass diverse fields of study, from community health to graphic design to statistics. 

BSIS+DS student Lara Terpetschnig and BSIS student Dhanvi Puttur

Scholarship provides validation, motivation for Martinez

BSIS+DS student Fabian Martinez chose his major because he wanted to learn how to help people understand and interpret data and information. While his immediate plans include finding a job in data analytics, business analytics, consulting, or product management, his ultimate goal is "to create meaningful relationships and help make a meaningful impact in the world" in whatever way he can.

Fabian Martinez graduation