Computational frameworks for metabolic modeling from snRNA-seq in GBM (De Temmerman et al., 2026)

2026-03-22

Paper

De Temmerman M, Vandemoortele B, Vermeirssen V. “Comparison of different computational frameworks for metabolic modeling from single-cell transcriptomics data in glioblastoma.” bioRxiv (2026). DOI: https://doi.org/10.64898/2026.02.28.706749

Background

The problem

Metabolic reprogramming is a hallmark of glioblastoma (GBM), but how individual cell populations — malignant states, immune cells, stromal populations — each contribute to the metabolic heterogeneity within a tumor remains poorly defined. Bulk metabolomics cannot resolve these contributions, and single-cell metabolomics is still technically limited by low throughput, limited metabolite coverage, and the inability to amplify metabolites. Computational inference from single-cell transcriptomics offers a powerful alternative, but no systematic comparison of different inference methods applied to the same GBM dataset existed.

Study design

The authors applied three complementary computational methods to snRNA-seq data from 6 GBM patient samples (2 per Wang multiomic subtype: nmf1/PN-like, nmf2/MES-like, nmf3/CL-like), all mapped against the Human1 genome-scale metabolic model (GEM) as a unified annotation reference:

Method	What it captures	Resolution
Metabolic pathway activity scoring	Coordinated up/downregulation of pathway genes	Cell-type level
SCENIC GRN inference	TF-driven regulatory programs controlling metabolic enzymes	Single-cell level
scFEA flux prediction	Predicted reaction fluxes from a reduced GEM	Single-cell level

Core finding

Across all three methods, tumor-associated macrophages (TAMs) emerge as the metabolically dominant cell population in the GBM TME. TAMs in MES-like tumors show coordinated lipid metabolism driven by five recurrent transcription factors, consistent nucleotide biosynthesis flux, and glutamate-to-glutamine conversion that potentially supports malignant cells.

Methodology details

1. Metabolic pathway activity scoring

Pathway-level analysis adapted from Locasale (Xiao et al., 2019). Metabolic pathway gene sets are derived from Human1 GEM. Expression of enzyme-coding genes is aggregated per cell type, with an inverse-frequency weighting scheme so multi-pathway genes don’t dominate. Statistical significance is assessed by 5000-fold permutation testing against random gene sets of matched size.

Key results:

Protein degradation, drug metabolism, protein assembly, and even-chain fatty acid desaturation are consistently top-ranking pathways across patients
TAMs and monocytes drive protein degradation/drug metabolism; oligodendrocytes drive fatty acid desaturation
MES-like samples show TAM-enriched fatty acid activation and metabolism
Most malignant cell states show limited pathway activity, except MES-like cells in protein modification and ROS detoxification

2. SCENIC-based metabolic GRN inference

SCENIC (Aibar et al., 2017) reconstructs regulons (TFs + predicted target genes) and quantifies their activity per cell via AUCell. The authors ran SCENIC 10$\times$ per sample (GRNBoost2 + cisTarget + AUCell), keeping consensus edges present in $\ge 80\%$ of runs. Regulon target genes were then overlapped with Human1 metabolic pathway gene sets to build tripartite networks (TF $\rightarrow$ target gene $\rightarrow$ pathway). A connection specificity index (CSI) adapted from Bass et al. was used to identify regulon modules with similar metabolic pathway regulation profiles.

Key results:

Most regulons (54.3%) are sample-specific, reinforcing the need for per-patient analysis
A conserved core of 30 regulons spans both immune and malignant compartments across nearly all samples
Five TAM-specific TFs (CEBPD, FOSL2, HIF1A, IRF5, PRDM1) converge on a shared lipid-regulatory program in MES-like samples — regulating sphingolipid, glycerophospholipid, and arachidonic acid metabolism
E2F family members drive nucleotide biosynthesis in malignant cells

3. scFEA single-cell flux prediction

scFEA (Alghamdi et al., 2021) uses a graph neural network to predict cell-wise metabolic flux distributions across 168 reaction modules derived from a reduced GEM. The tool integrates single-cell expression with stoichiometric constraints from flux balance analysis. TAM-specific modules were identified when Cohen’s $d$ > 0.8 and $>50\%$ of TAMs showed positive flux.

Key results:

TAMs form distinct clusters in flux-based UMAP space across all samples
Purine and pyrimidine synthesis modules show the strongest and most consistent TAM-specific flux (AICAR $\rightarrow$ IMP; UMP $\rightarrow$ $\beta$-alanine; dTMP $\rightarrow$ succinyl-CoA)
Glutamate $\rightarrow$ glutamine conversion is a recurrent TAM-specific flux across all samples (the “metabolic tug-of-war” with tumor cells)
Fatty acid $\rightarrow$ acetyl-CoA flux is TAM-specific only in nmf3 (CL-like) samples — absent in nmf2 MES-like samples despite strong transcriptional signals there, revealing method-specific sensitivity differences

Cross-method comparison

The three methods span a hierarchy of increasing granularity:

Pathway activity (which pathways are upregulated?) → GRN inference (which TFs drive those pathways?) → Flux prediction (which reactions carry flux?)

Where all three converge — especially on TAM metabolic prominence — the findings are likely robust. Where they diverge (e.g., fatty acid metabolism in MES-like TAMs visible transcriptionally but not in flux), it reflects methodological differences in how each approach captures metabolic state from transcript data.

Pseudo data: input → method → output

Method 1: Metabolic pathway activity scoring

Input:

Cell	Gene A (glycolysis)	Gene B (glycolysis)	Gene C (lipid met.)	Gene D (lipid met.)	Cell type
Cell 1	2.1	3.4	0.5	0.2	TAM
Cell 2	1.8	2.9	0.3	0.1	TAM
Cell 3	0.4	0.6	4.1	3.2	Oligo
Cell 4	0.3	0.5	3.8	2.9	Oligo
Cell 5	1.0	1.2	0.8	0.7	MES-like

+ Human1 GEM pathway-gene mapping (e.g., Glycolysis → {Gene_A, Gene_B}; Lipid metabolism → {Gene_C, Gene_D})

Process: For each cell type, calculate weighted mean expression of pathway genes (weight = 1/number_of_pathways_per_gene). Compare against 5000 permuted random gene sets of same size to get empirical $p$-value.

Output:

Cell type	Pathway	Activity Score	$p$-value	Rank
TAM	Glycolysis	0.82	0.001	1
TAM	Lipid metabolism	0.15	0.340	5
Oligo	Lipid metabolism	0.91	<0.001	1
Oligo	Glycolysis	0.11	0.450	6
MES-like	Glycolysis	0.35	0.080	3

Advantages:

Simple, interpretable, and broadly applicable
Directly maps to biological pathway knowledge
Works well for identifying “which cell types are metabolically dominant in which pathways”
Computationally lightweight

Limitations

Some caveats to note for this first method:

Cell-type-level resolution only (no single-cell granularity)
Ignores network structure and reaction connectivity
Cannot identify regulatory mechanisms behind pathway changes

Method 2: SCENIC GRN inference + metabolic overlap

Input:

Cell	TF SPI1	TF HIF1A	TF E2F2	Enzyme FASN	Enzyme HK2	Enzyme IMPDH1	Cell type
Cell 1	4.2	3.1	0.5	3.8	2.9	0.4	TAM
Cell 2	3.9	2.8	0.6	3.5	2.7	0.3	TAM
Cell 3	0.3	0.4	3.8	0.2	1.1	3.6	NPC-like
Cell 4	0.2	1.9	2.7	0.9	1.8	2.4	MES-like

+ TF binding motif databases (cisTarget) + Human1 pathway-gene mapping

Process:

GRNBoost2 infers co-expression modules (TF → candidate target genes)
cisTarget prunes to targets with matching TF binding motifs
AUCell scores regulon activity per cell
Overlap regulon target genes with Human1 metabolic pathway gene sets → tripartite network (TF $\rightarrow$ target gene $\rightarrow$ pathway)
Compute CSI between regulon pairs to find modules with similar metabolic regulation profiles

Output:

Regulon activity scores (per cell):

Cell	SPI1 regulon	HIF1A regulon	E2F2 regulon
Cell 1 (TAM)	0.89	0.72	0.08
Cell 2 (TAM)	0.85	0.68	0.10
Cell 3 (NPC)	0.05	0.12	0.81
Cell 4 (MES)	0.04	0.45	0.62

Regulon-pathway overlap matrix:

Regulon (TF)	Lipid metabolism	Purine synthesis	Glycolysis
SPI1	12 genes	3 genes	5 genes
HIF1A	8 genes	2 genes	9 genes
E2F2	1 gene	11 genes	2 genes

Advantages:

Reveals the regulatory logic behind metabolic states (which TFs drive which pathways)
Single-cell resolution for regulon activity
Can identify conserved vs. patient-specific regulatory programs
CSI analysis groups TFs with similar metabolic-regulatory strategies

Limitations

Some caveats to note for the GRN approach:

Regulon composition is sample-specific (hard to directly compare regulon content across patients)
Only examines transcriptional regulation — misses post-transcriptional, allosteric, and metabolite-level control
Does not account for pathway connectivity or stoichiometric constraints

Method 3: scFEA flux prediction

Input:

Cell	Enzyme HK2	Enzyme PFK	Enzyme GLS	Enzyme IMPDH1	…
Cell 1 (TAM)	2.9	2.5	3.4	0.4	…
Cell 2 (TAM)	2.7	2.3	3.1	0.3	…
Cell 3 (Oligo)	1.1	0.8	0.5	0.2	…
Cell 4 (MES)	1.8	1.5	1.2	2.1	…

+ scFEA metabolic module definitions (168 modules) + stoichiometric constraint matrix (cmMat_c70_m168.csv)

Process: A graph neural network maps gene expression → predicted flux for each of the 168 reaction modules per cell, subject to metabolite balance constraints (flux balance analysis). TAM-specificity is assessed via Cohen’s $d$ (TAM vs. non-TAM) and proportion of TAMs with positive flux.

Output:

Cell	Glycolysis $\rightarrow$ Pyruvate	Glutamate $\rightarrow$ Glutamine	AICAR $\rightarrow$ IMP	FA $\rightarrow$ Acetyl-CoA
Cell 1 (TAM)	0.42	0.85	0.71	0.15
Cell 2 (TAM)	0.38	0.79	0.68	0.12
Cell 3 (Oligo)	0.22	0.10	0.05	0.08
Cell 4 (MES)	0.31	0.15	0.45	0.03

Module	Cohen’s $d$ (TAM)	TAM positive (%)	TAM specific?
Glutamate→Glutamine	1.42	88%	✓
AICAR→IMP	1.15	76%	✓
FA→Acetyl-CoA	0.35	32%	✗

Advantages:

Incorporates stoichiometric/network constraints — not just expression levels
Sub-pathway, reaction-level granularity (168 modules)
Single-cell flux profiles enable metabolic UMAP clustering
Captures metabolic dependencies invisible to transcript-level analysis alone

Limitations

Some caveats to note for flux prediction:

Depends on a reduced/linearized GEM — cannot capture all metabolic reactions (e.g., fatty acid oxidation in MES-TAMs missed)
Computational cost is higher than pathway scoring
Module definitions may not align with standard pathway annotations (requires additional mapping)
Flux predictions are inferred, not measured — validation against metabolomics is essential

Extending to spatial context

Why spatial matters for metabolic inference

All three methods in this study operate on dissociated single-cell/nucleus data and therefore lose the spatial organization of the tumor. In GBM, spatial context is critical: metabolic states are shaped by gradients of oxygen, nutrients, and signaling molecules that radiate from blood vessels, necrotic cores, and the infiltrative margin. A TAM adjacent to the necrotic core likely has a very different metabolic program from a TAM at the leading edge, even if both are transcriptionally classified as “TAM” in a dissociated dataset. Without spatial information, these metabolic subtypes are averaged together or, at best, separated only when they differ enough in gene expression to form distinct clusters.

Spatial transcriptomics technologies — Visium, MERFISH, Slide-seq, and 10X Xenium — now provide the means to anchor transcriptional profiles to tissue coordinates. This opens the possibility of spatially resolved metabolic inference: applying the same three computational frameworks (pathway activity, GRN, flux prediction) while preserving the spatial relationships between cells and their microenvironment. The Human1 GEM annotation framework used in this study would transfer directly to spatial data, providing a unified metabolic reference regardless of the measurement platform.

Practical approaches

Spatial pathway activity scoring is the most immediately accessible extension. For spot-based platforms like Visium (each spot covers ~5–10 cells), one can compute weighted pathway scores per spot using the same Human1 gene sets and then overlay these onto the tissue image. This would reveal, for example, whether lipid metabolism is concentrated around vascular niches or whether glycolysis peaks in hypoxic regions surrounding necrosis. For single-molecule platforms (MERFISH, Xenium) that achieve true single-cell resolution, pathway scoring could directly identify spatial clusters of metabolically similar cells — but the limited gene panels (hundreds to a few thousand genes) would require careful selection of metabolic gene targets during panel design.

Spatial GRN inference is more challenging but tractable. Tools like SpatialDM and spatially informed versions of SCENIC (e.g., based on SCENIC+) can incorporate spatial proximity to identify spatially coherent regulon modules. One could ask: do the five TAM-specific TFs (CEBPD, FOSL2, HIF1A, IRF5, PRDM1) that converge on lipid metabolism co-activate in spatially localized TAM niches? If so, this would transform the finding from “TAMs regulate lipid metabolism” to “TAMs in region X regulate lipid metabolism in concert with neighboring MES-like cells.” Ligand–receptor analysis tools (CellChat, COMMOT for spatial data) could then connect the metabolic GRN findings to intercellular communication axes, testing whether the glutamate-glutamine “metabolic tug-of-war” detected by scFEA actually occurs between spatially adjacent TAMs and tumor cells.

Spatial flux modeling represents the frontier. The core challenge is that scFEA and related tools assume cell autonomy — each cell’s flux distribution is predicted independently. In reality, metabolite exchange between neighboring cells is central to tumor metabolism: TAMs export glutamine that tumor cells import; tumor cells secrete lactate that rewires macrophage polarization. A spatially informed flux model would need to impose inter-cellular metabolite balance constraints — effectively treating the tissue as a multi-compartment system where metabolites flow between adjacent spots or cells. This is conceptually related to community-level FBA methods (e.g., SteadyCom, OptCom), which model metabolic interactions between microbial species, but would need substantial adaptation for mammalian tissue geometry. Early steps could include computing pairwise metabolite exchange scores between spatially adjacent cell types and testing whether scFEA-predicted export fluxes in one cell type correlate with import fluxes in neighbors.

What spatial resolution would add to this study’s conclusions

The most impactful spatial extension of this study’s findings would be to test whether the TAM metabolic states identified computationally correspond to spatially organized niches. The MES-like subtype’s lipid-regulatory TAM program might localize to myelin-rich border zones where macrophages phagocytose debris, while nucleotide biosynthesis-active TAMs might cluster near proliferative tumor zones. If these spatial predictions hold, the three-method framework used here — activity → regulation → flux — could be applied as a spatial pipeline, moving from “what metabolic programs exist” to “where do they operate and with whom do they interact.” This would substantially strengthen the translational relevance of these computational approaches and directly inform spatially targeted therapeutic strategies.