New Model Unveils Tissue Cell Organization

Helmholtz Munich (Helmholtz Zentrum München Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH))

Missing Context in Single-Cell Data

Single-cell RNA sequencing has transformed biology by showing which genes are active in individual cells. However, this approach requires cells to be removed from their natural environment, erasing information about their position and neighbors. Spatial transcriptomics preserves this context but is technically more limited and harder to scale. Researchers have long lacked a way to study cell identity and tissue organization together.

AI Model Reveals Hidden Tissue Structures

Nicheformer overcomes this barrier by learning from both dissociated and spatial data. It can "transfer" spatial context back onto cells that were previously studied in isolation – essentially reconstructing how they fit into the bigger picture of a tissue. To make this possible, the research team created SpatialCorpus-110M, one of the largest curated resources of single-cell and spatial data to date. In their study published in Nature Methods, the model consistently outperformed existing approaches and showed that spatial patterns leave measurable traces in gene expression, even when cells are dissociated. Beyond performance, the researchers also explored interpretability, revealing that the model identifies biologically meaningful patterns in its internal layers – offering a new window into how AI learns from biology.

"With Nicheformer we can now transfer spatial information onto dissociated single-cell data at scale," says Alejandro Tejada-Lapuerta, PhD student at Helmholtz Munich and TUM and co-first author of the study together with Anna Schaar. "This opens up many possibilities to study tissue organization and cellular neighborhoods without additional experiments."

The study connects to the emerging idea of a "Virtual Cell", a computational representation of how cells behave and interact within their native environments. While this concept is gaining momentum across biology and AI, previous models have largely treated cells as isolated entities, without reasoning their spatial relationships. Nicheformer is the first foundation model to learn directly from spatial organization, offering a way to reconstruct how cells sense and influence their neighbors. Beyond introducing this new capability, the researchers also present an entire suite of spatial benchmarking tasks that challenge future models to capture tissue architecture and collective cellular behavior – an essential step toward biologically realistic AI systems.

Info Box: Single-Cell Analysis vs. Spatial Transcriptomics

Single-cell analysis: Measures the molecular profile (e.g., gene activity) of individual cells, but cells are studied outside their original tissue context.

Spatial transcriptomics: Measures gene activity directly in tissue slices, keeping the spatial arrangement of cells intact.

Nicheformer combines both approaches, projecting the spatial context back onto dissociated single-cell data.

Next Steps

"With Nicheformer we are taking the first steps toward building general-purpose AI models that represent cells in their natural context – the foundation of a Virtual Cell and Tissue model," says Prof. Fabian Theis, Director of the Computational Health Center at Helmholtz Munich and Professor at TUM. "Such models will transform how we study health and disease and could ultimately guide the development of new therapies."

In their next project, the team aims to develop a "tissue foundation model" that also learns the physical relationships between cells. Such a model could help analyze tumor microenvironments and other complex structures in the body with direct relevance for diseases such as cancer, diabetes, and chronic inflammation.

About the Researchers

Alejandro Tejada-Lapuerta is a PhD student at the Institute of Computational Biology at Helmholtz Munich and at the Technical University of Munich (TUM).

Prof. Fabian Theis is Director of the Computational Health Center and the Institute of Computational Biology at Helmholtz Munich, Head of Helmholtz AI, and Professor for Mathematical Modeling of Biological Systems at the Technical University of Munich (TUM).

About Helmholtz Munich

Helmholtz Munich is a leading biomedical research center. Its mission is to develop breakthrough solutions for better health in a rapidly changing world. Interdisciplinary research teams focus on environmentally triggered diseases, especially the therapy and prevention of diabetes, obesity, allergies, and chronic lung diseases. With the power of artificial intelligence and bioengineering, researchers accelerate the translation to patients. Helmholtz Munich has around 2,500 employees and is headquartered in Munich/Neuherberg. It is a member of the Helmholtz Association, with more than 43,000 employees and 18 research centers the largest scientific organization in Germany. More about Helmholtz Munich (Helmholtz Zentrum München Deutsches Forschungszentrum für Gesundheit und Umwelt GmbH): www.helmholtz-munich.de/en      

/Public Release. This material from the originating organization/author(s) might be of the point-in-time nature, and edited for clarity, style and length. Mirage.News does not take institutional positions or sides, and all views, positions, and conclusions expressed herein are solely those of the author(s).View in full here.