New computational tools to protect Homeland Security data

Jingrui He
Jingrui He, Professor and MSIM Program Director

Associate Professor Jingrui He is developing computational tools to protect against leaks and/or unauthorized use of sensitive data held and distributed among Department of Homeland Security (DHS) agencies and other parties. Her project, "Privacy-Preserving Analytics for Non-IID Data," has been awarded a three-year, $651,927 grant from the DHS Center for Accelerating Operational Efficiency (CAOE).

Innate risks exist from the unprecedented speed in which large amounts of data can be transferred to outside organizations, and these conditions have had negative consequences for DHS in the past.

"In 2019, a subcontractor working for CBP (DHS Customs and Border Protection) transferred copies of CBP's biometric data, such as traveler images, to its own company network and compromised approximately 184,000 traveler images from CBP's facial recognition pilot," He said. "This later led to a major privacy incident, as the subcontractor's network was subjected to a malicious cyberattack."

According to He, while the huge amount of collected data contains critical information that informs policy and decision making, the potential risks pertaining to sensitive information raise serious concerns regarding the use of such collected data. "It is of great importance to develop privacy-enhancing technologies to mitigate these risks while making effective use of the collected data," she said.

He's work is challenging, because the datasets involved in her research are held by multiple parties and distributed in varying ways. She proposes a two-pronged approach to sharing information while providing privacy protection.

One strategy involves generating synthetic data that mimics the actual data, and then sharing the synthetic information. "Our proposed techniques would guarantee that the parties receiving the synthetic data cannot use the synthetic data to recover the original data," she said.

The other method would create predictive analytics that can be performed for multiple parties via federated learning, in which artificial intelligence models are trained without anyone seeing or touching the data. This offers a means to unlock information to feed new artificial intelligence applications and enjoys the privacy protection because individual parties do not have to share data.

"The agencies holding the actual data will need to use their own data for analysis. But the central server responsible for creating the final predictive model orchestrating the efforts from all agencies will not have access to the actual data. Different agencies do not need to share their own data with each other either," He said.

She envisions several DHS agencies, including the Transportation Security Administration, the Office of Intelligence and Analysis, and the Federal Emergency Management Agency, will make use of the new tools.

He's general research theme is to design, build, and test a suite of automated and semi-automated methods to explore, understand, characterize, and predict real-world data by means of statistical machine learning. She received her PhD in machine learning from Carnegie Mellon University.

Updated on
Backto the news archive

Related News

Wang to deliver keynote at GenAIRecP 2025

Associate Professor Dong Wang will present the keynote at the second workshop on generative AI for recommender systems and personalization on August 4, in Toronto, Canada. The event will be held in conjunction with KDD 2025. 

McDowell authors new book on data storytelling for libraries

Associate Professor Kate McDowell has authored a new book that will equip readers with the skills to transform data into stories for library advocacy, social justice, and inclusivity. Critical Data Storytelling for Libraries: Crafting Ethical Narratives for Advocacy and Impact, the second book in a new ALA Editions series on Critical Cultural Information Studies, will be available next month.

Kate McDowell

iSchool faculty selected as Public Voices Fellows

Associate Professor Maria Bonn, Teaching Assistant Professor Haileleol Tibebu, and Assistant Professor Travis L. Wagner are among the twenty faculty from the University of Illinois System who were selected for the 2025–2026 cohort of the Public Voices Fellowship. The program is part of a national initiative led by The OpEd Project to help experts from underrepresented groups to be positioned as public thought leaders in their fields and contribute to the national dialogue around important issues.

Huang named a 2025–2026 Linowes Fellow

Associate Professor Yun Huang has been named a 2025–2026 Linowes Fellow by the Cline Center for Advanced Social Research at the University of Illinois. She is also the recipient of a 2024–2025 fellowship, which "provides exceptionally promising tenure-stream faculty with opportunities for innovation and discovery using the Cline Center's data holdings and/or analytic tools."

Yun Huang

New book explores video standards in film and archives

A new book co-authored by iSchool Adjunct Lecturer Jimi Jones and Marek Jancovic, assistant professor of media studies at Vrije Universiteit Amsterdam, examines video file standards and the tensions that have emerged between the film industry and the archiving community that is tasked with preserving cultural cinematic productions. 

Jimi Jones