School of Information Sciences

Jiang and Mishra to present natural language processing research at COLING16

Doctoral students Ming Jiang and Shubhanshu Mishra will present research papers at the 26th International Conference on Computational Linguistics (COLING), which will be held December 11-16 in Osaka, Japan. The COLING conference, held every two years, is one of the top international conferences in the field of natural language processing and computational linguistics, which covers research topics such as question answering, text summarization, information extraction, discourse structure, and more. 

Jiang will present a paper coauthored with Assistant Professor Jana Diesner titled, "Says Who...? Identification of Expert versus Layman Critics’ Reviews of Documentary Films."

Abstract: We extend classic review mining work by building a binary classifier that predicts whether a review of a documentary film was written by an expert or a layman with 90.70% accuracy (F1 score), and compare the characteristics of the predicted classes. A variety of standard lexical and syntactic features was used for this supervised learning task. Our results suggest that experts write comparatively lengthier and more detailed reviews that feature more complex grammar and a higher diversity in their vocabulary. Layman reviews are more subjective and contextualized in peoples’ everyday lives. Our error analysis shows that laymen are about twice as likely to be mistaken as experts than vice versa. We argue that the type of author might be a useful new feature for improving the accuracy of predicting the rating, helpfulness and authenticity of reviews. Finally, the outcomes of this work might help researchers and practitioners in the field of impact assessment to gain a more fine-grained understanding of the perception of different types of media consumers and reviewers of a topic, genre or information product.

During the COLING16 workshop on noisy user-generated text (WNUT), Mishra will present a paper coauthored with Diesner titled, "Semi-supervised Named Entity Recognition in noisy-text." 

Abstract: Named entity recognition (NER) has played an immense role in improving information retrieval, text mining, and text based network construction. However, the most of the existing NER techniques are based on syntactically correct news corpus data, and hence don’t give good results on noisy data such as tweets because of issues like spelling errors, concept drifts, and few context words. In this paper, we describe our submission to the WNUT 2016 NER shared task, and also present an improvement over it using a semi-supervised approach. Our models are based on linear chain conditional random fields (CRFs), and use BIEOU NER chunking scheme, features based on word clusters and pre-trained distributed word representations; updated gazetteer features; global context predictions; and random feature dropout for up-sampling the training data. These approaches alleviate many issues related to NER on noisy data by allowing the meaning of new or rare tokens to be ingested into the system, while using existing training samples to improve the model. 

Diesner joined the iSchool faculty in 2012 and is a 2016 Dori J. Maynard Senior Fellow. Her research in social computing combines theories and methods from natural language processing, social network analysis, and machine learning. In her lab, she and her students develop and advance computational solutions that help people to measure and understand the interplay of information and socio-technical networks. They also bring these solutions into various application context, e.g. in the domain of impact assessment.

Updated on
Backto the news archive

Related News

Kemboi receives Knowledge Manager of the Year Award

PhD student Gladys Kemboi has been awarded the Knowledge Manager of the Year Award from CILIP, the UK's library and information association. This is an international award that recognizes an individual who has made a significant contribution and excellence in the discipline of knowledge management through their work and professionalism.

Gladys Kemboi

Christine Nguyen Awarded Julia C. Blixrud Scholarship 2026

The Association of Research Libraries (ARL) has awarded Christine Thuy Minh Nguyen the Julia C. Blixrud Scholarship to attend the 2026 ARL President’s Institute. Christine is a master of science in library and information science (LIS) student at the University of Illinois Urbana-Champaign specializing in digital archives and data stewardship. She currently serves as a graduate assistant in the Research Data Service Unit of the University of Illinois Library, where she has developed a strong commitment to inclusive user experience and accessible digital design by leading a project to innovate change in current technical workflows.

Christine Thuy Minh Nguyen

Koval Scholarship validates Mohammed's challenging academic journey

As a middle school student in Accra Newtown, Ghana, Fatihi Mohammed put his education on hold. Through renewed focus and efforts, the student has shown remarkable academic growth and is now working toward his MSLIS degree at the University of Illinois. Mohammed is receiving support for his studies through the Anna Mae Koval Scholarship Fund at the iSchool. 

Fatihi Mohammed

PhD student Meng Li wins iSchool T-shirt design contest

PhD student Meng Li's research focuses on neuro-symbolic AI, with an emphasis on using syntactic analysis and large language models (LLMs) to understand Python notebooks. This cutting-edge research keeps Li "super busy" for much of the term, but in August, she took a brief break from her work and shifted her focus to designing the winning entry for the iSchool T-shirt contest.

While the idea of the design "just popped into my mind," Li has been thinking about the contest for years.

Meng Li wears the T-shirt with her winning design. The shirt is dark blue, with a hand-sketched wave in white, while the figure and surf board are in Illini Orange.

Paper by He's lab honored at ICCV 2025 workshop

Professor Jingrui He's lab received an outstanding paper award at the Multi-Modal Reasoning for Agentic Intelligence Workshop, which was held during the International Conference on Computer Vision (ICCV 2025) last month in Honolulu, Hawaii. 

Jingrui He

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Fax: (217) 244-3302

Email: ischool@illinois.edu

Back to top