School of Information Sciences

New computational tools to protect Homeland Security data

Jingrui He
Jingrui He, Professor and MSIM Program Director

Associate Professor Jingrui He is developing computational tools to protect against leaks and/or unauthorized use of sensitive data held and distributed among Department of Homeland Security (DHS) agencies and other parties. Her project, "Privacy-Preserving Analytics for Non-IID Data," has been awarded a three-year, $651,927 grant from the DHS Center for Accelerating Operational Efficiency (CAOE).

Innate risks exist from the unprecedented speed in which large amounts of data can be transferred to outside organizations, and these conditions have had negative consequences for DHS in the past.

"In 2019, a subcontractor working for CBP (DHS Customs and Border Protection) transferred copies of CBP's biometric data, such as traveler images, to its own company network and compromised approximately 184,000 traveler images from CBP's facial recognition pilot," He said. "This later led to a major privacy incident, as the subcontractor's network was subjected to a malicious cyberattack."

According to He, while the huge amount of collected data contains critical information that informs policy and decision making, the potential risks pertaining to sensitive information raise serious concerns regarding the use of such collected data. "It is of great importance to develop privacy-enhancing technologies to mitigate these risks while making effective use of the collected data," she said.

He's work is challenging, because the datasets involved in her research are held by multiple parties and distributed in varying ways. She proposes a two-pronged approach to sharing information while providing privacy protection.

One strategy involves generating synthetic data that mimics the actual data, and then sharing the synthetic information. "Our proposed techniques would guarantee that the parties receiving the synthetic data cannot use the synthetic data to recover the original data," she said.

The other method would create predictive analytics that can be performed for multiple parties via federated learning, in which artificial intelligence models are trained without anyone seeing or touching the data. This offers a means to unlock information to feed new artificial intelligence applications and enjoys the privacy protection because individual parties do not have to share data.

"The agencies holding the actual data will need to use their own data for analysis. But the central server responsible for creating the final predictive model orchestrating the efforts from all agencies will not have access to the actual data. Different agencies do not need to share their own data with each other either," He said.

She envisions several DHS agencies, including the Transportation Security Administration, the Office of Intelligence and Analysis, and the Federal Emergency Management Agency, will make use of the new tools.

He's general research theme is to design, build, and test a suite of automated and semi-automated methods to explore, understand, characterize, and predict real-world data by means of statistical machine learning. She received her PhD in machine learning from Carnegie Mellon University.

Updated on
Backto the news archive

Related News

PhD students receive scholarships from IAPP

Information Sciences PhD students Mubarak Raji, Eryclis Rodrigues Silva, and Eryue Xu, and Informatics PhD student Muhammad Hussain have received A. Serwin Conference Scholarships from the International Association of Privacy Professionals (IAPP). The award, which recognizes outstanding students in the areas of privacy, AI governance, and digital responsibility, consists of $1,000 and complimentary conference registration. The IAPP’s annual conference, Privacy. Security. Risk., will be held October 30-31 in San Diego, California.

Perkins defends dissertation

PhD candidate Jana M. Perkins successfully defended her dissertation, "Scholarship writ large: A data-rich analysis of professionalization in English literary scholarship from 1940 to the present."

Jana Perkins

Yu receives 2025 Google PhD Fellowship

PhD student Yaman Yu has been named a recipient of the 2025 Google PhD Fellowship in Privacy, Safety, and Security. The fellowship program recognizes outstanding graduate students who are conducting exceptional and innovative research in computer science and related fields, with a special focus on candidates who seek to influence the future of technology. Google PhD fellowships include tuition and fees, a stipend, and mentorship from a Google Research Mentor for up to two years. Google.org is providing over $10 million to support 255 PhD students across 35 countries and 12 research domains.

Yaman Yu

iSchool researchers to present at ASSETS 2025

iSchool faculty and students will present their research at the 27th International Association for Computing Machinery (ACM) Special Interest Group (SIG) ACCESS Conference on Computers and Accessibility (ASSETS 2025), which will be held in Denver, Colorado, October 26–29, 2025. This conference allows researchers to present their scholarship on design, evaluation, use, and education related to computing for people with disabilities and older adults.

Chan to give an invited talk on "Predatory Data"

Professor Anita Say Chan will give an invited lecture at the American University of Beirut (AUB) on October 23. The talk, part of the "Confronted with America" series hosted by the Center for American Studies and Research, will be moderated by Jihad Touma, founding director of AUB's School of Computing and Data Sciences.

Anita Say Chan

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Fax: (217) 244-3302

Email: ischool@illinois.edu

Back to top