School of Information Sciences

Kim awarded Eugene Garfield Doctoral Dissertation Fellowship

Doctoral candidate Jinseok Kim has been awarded a Eugene Garfield Doctoral Dissertation Fellowship by Beta Phi Mu, the International Library and Information Studies Honor Society. Up to six recipients are selected each year for this prestigious award, a national competition among doctoral students who are working on their dissertations. The amount awarded for each fellowship is $3,000.

"The Eugene Garfield Dissertation Fellowship will be a tremendous benefit to my doctoral research. It is a recognition for my work and will provide me valuable resources for gaining new knowledge," said Kim.  

Kim's research focuses on the role of data processing in knowledge discovery from data. His dissertation is titled, “The impact of author name disambiguation on knowledge discovery from big scholarly data.”

Abstract: By utilizing large-scale bibliometric data, scholars in diverse fields gleaned knowledge for use in scholarly evaluation, collaborator recommendations, and network-evolution modeling. A common challenge has been that author names in bibliometric data are not properly disambiguated: authors may share the same name (different authors are sometimes misrepresented to be a single author; merging of identities). In addition, one author may use name variations (an author may be represented as two or more different authors; splitting of identities). When faced with these authority-control challenges, a majority of scholars have processed bibliometric data using simple heuristics: if two author names share the same surname and given name initials, they are presumed to refer to the same author. Furthermore, without proper justification, those scholars have based their choice of data processing on the assumption that their findings are robust to authority-control errors.

My dissertation tests this assumption by measuring the impact of author name ambiguity on network properties. I accomplish this under varying conditions, including network size and time window using four large-scale bibliometric datasets that cover: biomedicine, computer science, physics, and one nation’s entire domestic publication output (Korea). For this, statistical properties of collaboration networks generated from algorithmically disambiguated data (i.e., close to clean data) are compared against those of the same networks but compromised by misidentified authors due to name ambiguity. My findings show that data processing can severely distort both our micro-level and macro-level understanding of a given network. This distortion can sometimes lead to false knowledge of network formation and evolution mechanisms such as preferential attachment generating power-law distribution of node degree. In addition, my dissertation explores whether compromised author names can be identified by their network-based characteristics, and provides practical guidance for scholars and decision makers.

Updated on
Backto the news archive

Related News

Spectrum Scholar Spotlight: Nathaniel Allen Pila

Eight iSchool master's students have been named 2025–2026 Spectrum Scholars by the American Library Association. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Nathaniel Allen Pila earned a bachelor's degree in psychology from Mount Holyoke College.

Nathaniel Allen Pila

iSchool participation in iConference 2026

The following iSchool faculty and students will participate in iConference 2026, which will be held virtually from March 23–26 and physically from March 29–April 2 in Edinburgh, Scotland. The theme of this year's conference is "Information Literacies, Authenticity and Use: The Move Towards a Digitally Enlightened Society."

Wang receives AccessComputing funding for video game project

Informatics PhD student Olive Wang has been awarded a minigrant by AccessComputing, an organization that supports people with disabilities in computing. The $5,000 grant will support Wang's work on the video game Loadouts, which teaches players why accessibility is important. In the game, players learn why video games are inaccessible for players who are low-vision and how accessibility features such as high contrast, auditory cues, and multimodality can be effective.

Olive Wang

Chan’s "Predatory Data" named a 2026 PROSE Award finalist

Professor Anita Say Chan's book Predatory Data: Eugenics in Big Tech and Our Fight for an Independent Future (University of California Press, 2025) has been named a finalist in the Computing and Information Sciences Category of the 2026 PROSE Awards. The annual awards bestowed by the Association of American Publishers recognize the very best in professional and scholarly publishing and celebrate works that have made significant advancements in their respective fields of study.

Anita Say Chan

He inducted into Sigma Xi

Professor Jingrui He has been inducted into Sigma Xi, The Scientific Research Honor Society. Sigma Xi is the international honor society of science and engineering and one of the oldest and largest scientific organizations in the world, boasting a history of service to science and society spanning over 125 years. It has a multidisciplinary membership of scientists, engineers, and scholars, and Sigma Xi chapters can be found in universities and colleges, government laboratories, and commercial research centers.

Jingrui He

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top