School of Information Sciences

An informatics approach helps better identify chemical combinations in consumer products

Catherine Blake
Catherine Blake, Professor

By using products such as soap, shampoo, body lotion, toothpaste and makeup, the average consumer may be exposed to dozens of chemicals each day. It's not easy, though, to know exactly what is in many consumer products or what potential risks they pose, either individually or in combination.

A doctoral student and a professor in the University of Illinois School of Information Sciences are using an informatics approach to help prioritize chemical combinations for further testing by determining the prevalence of individual ingredients and their most likely combinations in consumer products.

Doctoral student Henry Gabb and professor Catherine Blake published the results of the first phase of their work in Environmental Health Perspectives, a journal of the National Institute of Environmental Health Sciences, part of the National Institutes of Health.

People are exposed to significantly higher levels of chemicals now than in the past from many sources, including consumer products.

"We are, in effect, test subjects in an uncontrolled biochemistry experiment. This has become an accepted, or perhaps ignored, trade-off of life in modern society," Gabb said.

In order to identify the chemicals present in consumer products, Gabb used a web-scraping program to gather product names, categories and ingredient lists from online retail sites such as Drugstore.com. The database he created includes nearly 39,000 products and more than 32,000 ingredient names.

Once he had information on the ingredients in consumer products, he had to solve the problem of chemical synonymy – the use of different names for the same substance.

"The same chemical can appear on multiple product labels under many different names. Unless you can resolve them to a unique chemical, you don't really know what you're counting," Gabb said. For example, according to the PubChem Compound database from the National Library of Medicine, wintergreen oil is another name for methyl salicylate, a suspected endocrine disruptor.

Gabb and Blake targeted 55 potential endocrine-disrupting and asthma-associated chemicals from a prior study that used gas chromatography-mass spectrometry analysis to measure the levels of these chemicals in consumer products. They found 30 percent of the products in their database contained at least one of the 55 target chemicals, and 13 percent contained more than one.

The informatics approach allows the researchers to look at many more products and detect many more chemicals than the gas chromatography-mass spectrometry approach, which is limited by the time it takes to prepare samples and run the experiments, among other things. However, the informatics approach is limited to what is actually listed on product labels, which are not always complete. Gas chromatography-mass spectrometry can identify chemicals that are not listed on a product label or even part of the product formulation, such as "chemicals that leach from the product packaging, degradation products or other impurities," Gabb said. The researchers said the two approaches should be considered complementary.

The initial informatics analysis considered chemical combinations within the same product, but combined exposure also occurs when several products are used in a given timeframe.

"This work provides another piece of the environmental-exposure puzzle and, unlike our genetic material, we can easily change our product usage," Blake said. "The combination of genetic susceptibility and individualized cumulative exposure – not just to other chemicals in consumer products, but from other sources such as air quality – empowers people to make informed decisions about changing the factors that directly influence their health outcomes."

Gabb and Blake hope their informatics approach can help prioritize testing based on the likelihood of exposure. They have started on the next phase of their research, in which they will expand their analysis from the 55 target chemicals in the first phase of the project to look at thousands of chemicals in the second phase.

They'll also study combinations of chemicals from multiple products based on actual consumer usage, rather than looking at products in isolation. They will use a dataset of consumer usage patterns, detailing what products are used and how often. The data can tell the researchers the chemicals and combination of chemicals consumers are being exposed to in a typical day or week.

Researchers can further prioritize which chemicals to study by also considering retention, or how the product is used. For example, shampoo and soap are rinsed off the body right away, while lotion is left on. Toothpaste and other products that come in contact with mucous membranes will likely result in more absorption of chemicals than a hair product.

Gabb and Blake's analysis also illustrates the difficulty consumers have in deciding which products to use or avoid. Manufacturers don't have to disclose the ingredients that produce fragrance and flavor in their products if those mixtures are considered proprietary. In such cases, the label would list "fragrance" or "flavor" rather than the specific ingredients. On the other hand, a label might list the chemicals that contribute to a fragrance, but not use the word "fragrance," leading a consumer to believe he or she is buying a fragrance-free product. This, in addition to chemical synonymy, makes a case for amending the Fair Packaging and Labeling Act to standardize ingredient nomenclature, at least for ingredients that are suspected of being harmful, Gabb said.

Gabb emphasized that the study examines the presence of potentially harmful chemicals (as determined by various authoritative sources like the EPA and NIH) in consumer products, but that it makes no value judgments regarding the safety of the chemicals themselves. His immediate goal is simply to help toxicologists better prioritize which chemicals and chemical combinations should be subjected to cumulative risk assessments.

Updated on
Backto the news archive

Related News

Seo selected as CAS Beckman Fellow

Assistant Professor JooYoung Seo has been selected as a Center for Advanced Study (CAS) Beckman Fellow for the 2026-2027 academic year. CAS is one of the most prestigious faculty recognition programs at the University of Illinois. Its primary mission is to identify and support the most productive and innovative faculty across all disciplines. CAS Fellows are nominated by their unit heads and selected by the Center's permanent faculty through a competitive review process, with final approval by the Board of Trustees. 

JooYoung Seo

Spectrum Scholar Spotlight: Nathaniel Allen Pila

Eight iSchool master's students have been named 2025–2026 Spectrum Scholars by the American Library Association. This "Spectrum Scholar Spotlight" series highlights the School's scholars. MSLIS student Nathaniel Allen Pila earned a bachelor's degree in psychology from Mount Holyoke College.

Nathaniel Allen Pila

iSchool participation in iConference 2026

The following iSchool faculty and students will participate in iConference 2026, which will be held virtually from March 23–26 and physically from March 29–April 2 in Edinburgh, Scotland. The theme of this year's conference is "Information Literacies, Authenticity and Use: The Move Towards a Digitally Enlightened Society."

Wang receives AccessComputing funding for video game project

Informatics PhD student Olive Wang has been awarded a minigrant by AccessComputing, an organization that supports people with disabilities in computing. The $5,000 grant will support Wang's work on the video game Loadouts, which teaches players why accessibility is important. In the game, players learn why video games are inaccessible for players who are low-vision and how accessibility features such as high contrast, auditory cues, and multimodality can be effective.

Olive Wang

Chan’s "Predatory Data" named a 2026 PROSE Award finalist

Professor Anita Say Chan's book Predatory Data: Eugenics in Big Tech and Our Fight for an Independent Future (University of California Press, 2025) has been named a finalist in the Computing and Information Sciences Category of the 2026 PROSE Awards. The annual awards bestowed by the Association of American Publishers recognize the very best in professional and scholarly publishing and celebrate works that have made significant advancements in their respective fields of study.

Anita Say Chan

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top