Primary sclerosing cholangitis (PSC) is an immune-mediated cholestatic liver disease. Its molecular etiology remains poorly defined, hindering the development of mechanism-based diagnostics and therapies. Therefore, this study aimed to identify key molecular drivers and causal biomarkers of PSC by integrating transcriptomics, machine learning, and genetic causal inference.
Methods
We deployed an integrated computational framework combining transcriptomics, network biology, machine learning, and genetic causal inference. Peripheral blood transcriptomes from PSC patients and controls were analyzed to identify disease-associated modules. Candidate genes were refined via protein-protein interaction networks and a multi-algorithm machine learning screen. Causal inference was performed using two-sample Mendelian randomization, integrating plasma protein quantitative trait loci with PSC genome-wide association study summary statistics.
Results
Transcriptomic analysis revealed a PSC-associated module enriched in ribosome biogenesis and protein homeostasis pathways. A machine learning-optimized nine-gene signature (including PTMA, SUMO1, Shwachman-Bodian-Diamond syndrome (SBDS), RPL7, EIF1AX, ANP32A, PCNA, FAM98A, and MPHOSPH6) achieved high diagnostic accuracy (mean AUC = 0.908) and was consistently downregulated in PSC. This signature was linked to a remodeled immune microenvironment characterized by myeloid skewing and specific transcriptional-immune covariation patterns. Mendelian randomization identified SBDS as a putatively causal protective factor, where genetically instrumented higher plasma SBDS protein levels were robustly associated with a lower PSC risk (IVW OR = 0.525, 95% CI: 0.356–0.773, P = 0.001). Sensitivity analyses supported the validity of the Mendelian randomization assumptions.
Conclusions
Our integrated multi-omics and machine learning framework makes two key, interconnected contributions. First, it establishes a robust nine-gene expression signature with high accuracy for PSC diagnosis and stratification, offering a readily accessible molecular tool. Second, it profoundly advances our mechanistic understanding by establishing disrupted ribosome homeostasis as a causal pathway in PSC, specifically nominating plasma SBDS protein as a key protective factor and a high-priority therapeutic target through genetic causal inference. This work therefore not only provides a powerful biomarker panel for clinical use but also moves beyond genetic associations to reveal a druggable, causal mechanism. The strategy outlined here provides a generalizable blueprint for translating complex disease genetics into causal biomarkers and mechanistic therapeutic hypotheses.
Full text
https://www.xiahepublishing.com/2310-8819/JCTH-2025-00676
The study was recently published in the Journal of Clinical and Translational Hepatology .
The Journal of Clinical and Translational Hepatology (JCTH) is owned by the Second Affiliated Hospital of Chongqing Medical University and published by XIA & HE Publishing Inc. JCTH publishes high quality, peer reviewed studies in the translational and clinical human health sciences of liver diseases. JCTH has established high standards for publication of original research, which are characterized by a study's novelty, quality, and ethical conduct in the scientific process as well as in the communication of the research findings. Each issue includes articles by leading authorities on topics in hepatology that are germane to the most current challenges in the field. Special features include reports on the latest advances in drug development and technology that are relevant to liver diseases. Regular features of JCTH also include editorials, correspondences and invited commentaries on rapidly progressing areas in hepatology. All articles published by JCTH, both solicited and unsolicited, must pass our rigorous peer review process.