What question did this study set out to answer?

This research aims to develop a robust method for constructing gene co-expression networks (GCNs) from public RNA-seq data, overcoming limitations of traditional methods.

May 3, 2026

Constructing gene co-functional and co-regulatory networks from public transcriptomes using condition-specific ensemble co-expression.

Key Points

This research aims to develop a robust method for constructing gene co-expression networks (GCNs) from public RNA-seq data, overcoming limitations of traditional methods.
TEA-GCN method utilizes unsupervised transcriptomic dataset partitioning for GCN construction.
Benchmarking performed on over 450,000 RNA-seq samples across 12 species to validate performance.
Natural language processing is employed to identify biologically-relevant dataset partitions.
TEA-GCN demonstrates improved prediction of gene functions compared to existing methods (p<0.001).
High co-expression partitions reveal condition-specific gene relationships, enhancing explainability in GCNs.
TEA-GCNs show greater conservation across species, suitable for multi-species comparative studies.

Abstract

Gene co-expression networks (GCNs) can reveal useful gene co-functional and co-regulatory relationships. However, current GCN construction methodologies are sensitive to batch effects and sample composition, limiting their performance in generating GCNs from public RNA-seq samples abundant for many species. Here, we report the development of TEA-GCN (two-tier ensemble aggregation-GCN; https://github.com/pengkenlim/TEA-GCN), a GCN construction method that leverages unsupervised transcriptomic dataset partitioning and multi-metric co-expression scoring to derive ensemble gene co-expression. Benchmarking over 450,000 public RNA-seq samples across 12 species, TEA-GCN outperforms the state-of-the-art in predicting gene functions and inferring gene regulatory networks. Through the use of natural language processing, we also show that the biologically-relevant dataset partitions with high co-expression can identify tissue-/condition-specific co-expression in TEA-GCN, providing high level of explainability. Furthermore, we show that TEA-GCNs exhibit enhanced conservation across species, making them suitable for multi-species comparative studies.

Bookmark

Constructing gene co-functional and co-regulatory networks from public transcriptomes using condition-specific ensemble co-expression.

Key Points

Abstract

Cite This Study