School of Information Sciences

Jinseok Kim defends dissertation

Doctoral candidate Jinseok Kim successfully defended his dissertation, "The impact of author name disambiguation on knowledge discovery from large-scale scholarly data," on April 24. 

His committee included Assistant Professor Jana Diesner (chair), Associate Professor Catherine Blake, Assistant Professor Vetle Torvik, Michelle Shumate (associate professor of communication studies, Northwestern University), and Seok-Hyoung Lee (senior researcher, Korea Institute of Science and Technology Information).

From the abstract: In this study, I demonstrate that the choice of data pre-processing methods for resolving author name ambiguity can adversely affect our understanding of scholarly collaboration patterns and coauthorship network structure extracted from bibliometric data . . . A common challenge has been that author names in bibliometric data are not properly disambiguated: authors may share the same name (i.e., different authors are sometimes misrepresented to be a single author which can lead to a “merging of identities”). In addition, one author may use name variations (i.e., an author may be represented as two or more different authors which can lead to a “splitting of identities”). When faced with these challenges, most scholars have pre-processed bibliometric data using simple heuristics (e.g., if two author names share the same surname and given name initials, they are presumed to refer to the same author identity) and assumed that their findings are robust to errors due to author name ambiguity.

My findings show that initial-based name disambiguation methods can severely distort our understanding of given networks and such distortion gets severe over time. Moreover, this distortion can sometimes lead to false knowledge of network formation and evolution mechanisms such as preferential attachment generating power-law distribution of node degree and to false validation of theories about the choice of collaborators in scientific research, which may result in ill-informed decisions about research policy and resource allocation.

Tags:
Updated on
Backto the news archive

Related News

Kang makes sense of too much information

As an MSIM student at the iSchool, Zhanchen Kang is passionate about helping people make sense of the overwhelming amount of information in their daily lives. Kang earned an undergraduate degree in information systems in China before coming to the University of Illinois to further explore how technology, data, and people intersect. 

Zhanchen Kang

Students from The Stu/dio to present work at MDEV

Students from The Stu/dio, the University of Illinois student-led game production studio, are preparing to take the stage at MDEV 2025, which will be held on November 7-8 in Madison, Wisconsin. One of the Midwest's most popular game industry conferences, MDEV celebrates innovation and collaboration in game development by bringing together game designers, developers, and enthusiasts from across the region for panels, workshops, and networking. 

PhD students receive scholarships from IAPP

Information Sciences PhD students Mubarak Raji, Eryclis Rodrigues Silva, and Eryue Xu, and Informatics PhD student Muhammad Hussain have received A. Serwin Conference Scholarships from the International Association of Privacy Professionals (IAPP). The award, which recognizes outstanding students in the areas of privacy, AI governance, and digital responsibility, consists of $1,000 and complimentary conference registration. The IAPP’s annual conference, Privacy. Security. Risk., will be held October 30-31 in San Diego, California.

Perkins defends dissertation

PhD candidate Jana M. Perkins successfully defended her dissertation, "Scholarship writ large: A data-rich analysis of professionalization in English literary scholarship from 1940 to the present."

Jana Perkins

Yu receives 2025 Google PhD Fellowship

PhD student Yaman Yu has been named a recipient of the 2025 Google PhD Fellowship in Privacy, Safety, and Security. The fellowship program recognizes outstanding graduate students who are conducting exceptional and innovative research in computer science and related fields, with a special focus on candidates who seek to influence the future of technology. Google PhD fellowships include tuition and fees, a stipend, and mentorship from a Google Research Mentor for up to two years. Google.org is providing over $10 million to support 255 PhD students across 35 countries and 12 research domains.

Yaman Yu

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Fax: (217) 244-3302

Email: ischool@illinois.edu

Back to top