Novel AI Tool Unveiled for Analyzing Complex Bio Data

Utah State University

LOGAN, UTAH, USA -- As humans, our eyes take in two-dimensional images our brains convert to three-dimensional experiences. This ability enables us to be aware of our position in space, judge distances, possess depth perception, and visually examine and enjoy all manner of objects and happenings.

But trying to envision sub-visible structures and high-dimensional processes our human-engineered scopes can't capture is a challenge for data scientists and visualization experts, who turn to machine learning and AI tools to amplify visual exploration.

"Biological processes are an example of complex, high-dimensional data," says Kevin Moon, director of USU's Data Science and Artificial Intelligence (DSAI) Center and associate professor in the Department of Mathematics and Statistics . "One of the datasets we're using to test our AI tools, for example, is clinical data measured from multiple sclerosis patients. These datasets include hundreds of thousands of data points on disease progression at the cellular level, along with treatments and clinical outcomes."

Moon is corresponding author on the paper, "Gaining Biological Insights through Supervised Data Visualization," posted online June 30 in Nature Computational Science. The paper was published in collaboration with lead author and USU alum Jake Rhodes (PhD'22, Statistics), assistant professor at Brigham Young University, and Moon's USU colleagues Adele Cutler, professor emerita, Department of Mathematics and Statistics and Anhong Zhou, professor, Department of Biological and Chemical Engineering ; along with USU alum Wei Zhang (PhD'21, Biological Engineering), researcher at the University of Utah.

The team, whose research is supported by the National Institutes of Health and the IVADO Visiting Scholar Program , includes additional national and international collaborators. *

"In this paper, we introduce RF-PHATE, which is an acronym for Random Forest-Potential of Heat-diffusion for Affinity-based Trajectory Imbedding," Moon says. "That's a mouthful, but it's a supervised data visualization method that enables us to explore relevant data relationships in multi-dimensional datasets."

To understand this, he says, it helps to explore the capabilities of previously developed unsupervised and supervised data visualization methods.

"Commonly used unsupervised methods, including PHATE, t-SNE and UMAP, and existing supervised methods help us visualize the structure of big datasets," Moon says. "But each has some weaknesses. Some tend to over-emphasize differences between groups of data, and fail to take into account how those groups relate to each other. RF-PHATE does a much better job of preserving the structure of how they relate to each other."

Demonstrating the model's capabilities in the paper, the team documents how RF-PHATE provides evidence of a previously suspected multiple sclerosis sub-type.

"Identifying sub-types is crucial, because MS affects each patient differently, and knowing the specific type guides treatment decisions," Moon says.

Additional datasets used to investigate the RF-PHATE model included COVID-19 patient plasma data and antioxidant-treated lung cancer cell data, but Moon notes the model is not limited to biological data.

"RF-PHATE can be applied to many other disciplines, and can also be used to develop more interpretable AI models, as well as to analyze the models themselves," he says. "This is still a very active area of research for our group."

Moon says his group supports AI for Science – an international movement that encourages the use of artificial intelligence and machine learning to accelerate research, analyze massive datasets and simulate complex systems.

"Through interdisciplinary collaborations, we can develop and use AI tools to analyze scientific data more effectively and foster discovery," he says.

*Collaborating institutions, in addition to Utah State University, on the June 30, 2026 Nature Computational Science paper include Brigham Young University, Université de Montréal, Mila-Québec AI Institute, Centre Hospitalier de l'Université de Montréal, University of California, San Francisco; University of Utah, Charles LeMoyne Hospital, University of Lausanne and McGill University.

/Public Release. This material from the originating organization/author(s) might be of the point-in-time nature, and edited for clarity, style and length. Mirage.News does not take institutional positions or sides, and all views, positions, and conclusions expressed herein are solely those of the author(s).View in full here.