Jinseok Kim defends dissertation

Doctoral candidate Jinseok Kim successfully defended his dissertation, "The impact of author name disambiguation on knowledge discovery from large-scale scholarly data," on April 24. 

His committee included Assistant Professor Jana Diesner (chair), Associate Professor Catherine Blake, Assistant Professor Vetle Torvik, Michelle Shumate (associate professor of communication studies, Northwestern University), and Seok-Hyoung Lee (senior researcher, Korea Institute of Science and Technology Information).

From the abstract: In this study, I demonstrate that the choice of data pre-processing methods for resolving author name ambiguity can adversely affect our understanding of scholarly collaboration patterns and coauthorship network structure extracted from bibliometric data . . . A common challenge has been that author names in bibliometric data are not properly disambiguated: authors may share the same name (i.e., different authors are sometimes misrepresented to be a single author which can lead to a “merging of identities”). In addition, one author may use name variations (i.e., an author may be represented as two or more different authors which can lead to a “splitting of identities”). When faced with these challenges, most scholars have pre-processed bibliometric data using simple heuristics (e.g., if two author names share the same surname and given name initials, they are presumed to refer to the same author identity) and assumed that their findings are robust to errors due to author name ambiguity.

My findings show that initial-based name disambiguation methods can severely distort our understanding of given networks and such distortion gets severe over time. Moreover, this distortion can sometimes lead to false knowledge of network formation and evolution mechanisms such as preferential attachment generating power-law distribution of node degree and to false validation of theories about the choice of collaborators in scientific research, which may result in ill-informed decisions about research policy and resource allocation.

Tags:
Updated on
Backto the news archive

Related News

Senior Spotlight: Colton Keiser

After graduating with his BSIS degree in May, Colton Keiser will head to St. Louis to work as an internal audit and financial advisory consultant with Protiviti. He gained experience in auditing while working as an intern for the Montgomery County Public Defender in his hometown of Hillsboro, Illinois.

Colton Keiser

Winning exhibit features recipes from across the globe

MSLIS students Yung-hui Chou, Alice Tierney-Fife, and Elizabeth Workman are the winners of this year’s Graduate Student Exhibit Contest, sponsored by the University of Illinois Library. Their exhibit, "Culture and Cuisine in Diaspora: A Hidden Library Collection," displays items from seven campus libraries and highlights research and recreational material centered on traditional recipes from across the globe. The exhibit is on display in the library's Marshall Gallery through the end of April and also available online.

Get to know Michael Ferrer, MSIM student

After spending some time in the defense IT industry, Michael Ferrer decided to return to school for his MSIM degree to gain skills in areas such as data visualization and advance his career. Outside of his studies, Ferrer is a competitive ballroom dancer and member of the Illinois Army National Guard.

Michael Ferrer

ConnectED: Tech for All podcast launched by Community Data Clinic

The Community Data Clinic (CDC), a mixed methods data studies and interdisciplinary community research lab led by Associate Professor Anita Say Chan, has released the first episode of its new podcast, ConnectED: Tech for All. Community partners on the podcast include the Housing Authority of Champaign County, Champaign-Urbana Public Health District, Project Success of Vermilion County, and Cunningham Township Supervisor’s Office.

Community Data Clinic podcast logo