Jinseok Kim defends dissertation

Doctoral candidate Jinseok Kim successfully defended his dissertation, "The impact of author name disambiguation on knowledge discovery from large-scale scholarly data," on April 24. 

His committee included Assistant Professor Jana Diesner (chair), Associate Professor Catherine Blake, Assistant Professor Vetle Torvik, Michelle Shumate (associate professor of communication studies, Northwestern University), and Seok-Hyoung Lee (senior researcher, Korea Institute of Science and Technology Information).

From the abstract: In this study, I demonstrate that the choice of data pre-processing methods for resolving author name ambiguity can adversely affect our understanding of scholarly collaboration patterns and coauthorship network structure extracted from bibliometric data . . . A common challenge has been that author names in bibliometric data are not properly disambiguated: authors may share the same name (i.e., different authors are sometimes misrepresented to be a single author which can lead to a “merging of identities”). In addition, one author may use name variations (i.e., an author may be represented as two or more different authors which can lead to a “splitting of identities”). When faced with these challenges, most scholars have pre-processed bibliometric data using simple heuristics (e.g., if two author names share the same surname and given name initials, they are presumed to refer to the same author identity) and assumed that their findings are robust to errors due to author name ambiguity.

My findings show that initial-based name disambiguation methods can severely distort our understanding of given networks and such distortion gets severe over time. Moreover, this distortion can sometimes lead to false knowledge of network formation and evolution mechanisms such as preferential attachment generating power-law distribution of node degree and to false validation of theories about the choice of collaborators in scientific research, which may result in ill-informed decisions about research policy and resource allocation.

Tags:
Updated on
Backto the news archive

Related News

SafeRBot to assist community, police in crime reporting

Across the nation, 911 dispatch centers are facing a worker shortage. Unfortunately, this understaffing, plus the nature of the job itself, leads to dispatchers who are often overworked and stressed. Meanwhile, when community members need to report a crime, their options are to contact 911 for an emergency or, in a non-emergency situation, call a non-emergency number or fill out an online form. A new chatbot, SafeRBot, designed and developed by Associate Professor Yun Huang, Informatics PhD student Yiren Liu, and BSIS student Tony An seeks to improve the reporting process for non-emergency situations for both community members and dispatch centers.

Yun Huang

New digital collection sheds light on queer nightlife in Champaign County

Adam Beaty decided to pursue an MSLIS degree to combine his love of history, the arts, and community-centered spaces. This combination of interests culminated in a 244-item digital collection that showcases digitized materials depicting nearly thirty years of queer nightlife in Champaign County. 

Adam Beaty_headshot

Get to Know Deekshita Karingula, MSIM Student

After graduation, Deekshita Karingula would like to build data pipelines, automate workflows for greater efficiency, and use data to transform healthcare. She views the MSIM program as the "ideal way" to connect her computer science and technical skills with data management skills, helping her reach her goals.

Deekshita Karingula

Wang group to present at BigData 2024

Members of Associate Professor Dong Wang's research group, the Social Sensing and Intelligence Lab, will present their research at the 2024 IEEE International Conference on Big Data (BigData 2024), which will be held from December 15-18 in Washington, D.C. BigData 2024 is the premier venue to present and discuss progress in research, development, standards, and applications of topics in artificial intelligence, machine learning and big data analytics.

Dong Wang