Identifying targets of multiple co-regulating transcription factors from expression time-series by Bayesian model comparison

Michalis K Titsias, Antti Honkela, Neil D Lawrence, Magnus Rattray

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Background
Complete transcriptional regulatory network inference is a huge challenge because of the complexity of the network and sparsity of available data. One approach to make it more manageable is to focus on the inference of context-specific networks involving a few interacting transcription factors (TFs) and all of their target genes.

Results
We present a computational framework for Bayesian statistical inference of target genes of multiple interacting TFs from high-throughput gene expression time-series data. We use ordinary differential equation models that describe transcription of target genes taking into account combinatorial regulation. The method consists of a training and a prediction phase. During the training phase we infer the unobserved TF protein concentrations on a subnetwork of approximately known regulatory structure. During the prediction phase we apply Bayesian model selection on a genome-wide scale and score all alternative regulatory structures for each target gene. We use our methodology to identify targets of five TFs regulating Drosophila melanogaster mesoderm development. We find that confident predicted links between TFs and targets are significantly enriched for supporting ChIP-chip binding events and annotated TF-gene interations. Our method statistically significantly outperforms existing alternatives.

Conclusions
Our results show that it is possible to infer regulatory links between multiple interacting TFs and their target genes even from a single relatively short time series and in presence of unmodelled confounders and unreliable prior knowledge on training network connectivity. Introducing data from several different experimental perturbations significantly increases the accuracy.
Original languageEnglish
Article number53
JournalBMC Systems Biology
Volume6
Number of pages21
ISSN1752-0509
DOIs
Publication statusPublished - 30 May 2012
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 113 Computer and information sciences

Cite this

@article{2b0eb8ec90d34f6cbc1700976b50190b,
title = "Identifying targets of multiple co-regulating transcription factors from expression time-series by Bayesian model comparison",
abstract = "BackgroundComplete transcriptional regulatory network inference is a huge challenge because of the complexity of the network and sparsity of available data. One approach to make it more manageable is to focus on the inference of context-specific networks involving a few interacting transcription factors (TFs) and all of their target genes.ResultsWe present a computational framework for Bayesian statistical inference of target genes of multiple interacting TFs from high-throughput gene expression time-series data. We use ordinary differential equation models that describe transcription of target genes taking into account combinatorial regulation. The method consists of a training and a prediction phase. During the training phase we infer the unobserved TF protein concentrations on a subnetwork of approximately known regulatory structure. During the prediction phase we apply Bayesian model selection on a genome-wide scale and score all alternative regulatory structures for each target gene. We use our methodology to identify targets of five TFs regulating Drosophila melanogaster mesoderm development. We find that confident predicted links between TFs and targets are significantly enriched for supporting ChIP-chip binding events and annotated TF-gene interations. Our method statistically significantly outperforms existing alternatives.ConclusionsOur results show that it is possible to infer regulatory links between multiple interacting TFs and their target genes even from a single relatively short time series and in presence of unmodelled confounders and unreliable prior knowledge on training network connectivity. Introducing data from several different experimental perturbations significantly increases the accuracy.",
keywords = "113 Computer and information sciences",
author = "Titsias, {Michalis K} and Antti Honkela and Lawrence, {Neil D} and Magnus Rattray",
year = "2012",
month = "5",
day = "30",
doi = "10.1186/1752-0509-6-53",
language = "English",
volume = "6",
journal = "BMC Systems Biology",
issn = "1752-0509",
publisher = "BMC",

}

Identifying targets of multiple co-regulating transcription factors from expression time-series by Bayesian model comparison. / Titsias, Michalis K; Honkela, Antti; Lawrence, Neil D; Rattray, Magnus.

In: BMC Systems Biology, Vol. 6, 53, 30.05.2012.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Identifying targets of multiple co-regulating transcription factors from expression time-series by Bayesian model comparison

AU - Titsias, Michalis K

AU - Honkela, Antti

AU - Lawrence, Neil D

AU - Rattray, Magnus

PY - 2012/5/30

Y1 - 2012/5/30

N2 - BackgroundComplete transcriptional regulatory network inference is a huge challenge because of the complexity of the network and sparsity of available data. One approach to make it more manageable is to focus on the inference of context-specific networks involving a few interacting transcription factors (TFs) and all of their target genes.ResultsWe present a computational framework for Bayesian statistical inference of target genes of multiple interacting TFs from high-throughput gene expression time-series data. We use ordinary differential equation models that describe transcription of target genes taking into account combinatorial regulation. The method consists of a training and a prediction phase. During the training phase we infer the unobserved TF protein concentrations on a subnetwork of approximately known regulatory structure. During the prediction phase we apply Bayesian model selection on a genome-wide scale and score all alternative regulatory structures for each target gene. We use our methodology to identify targets of five TFs regulating Drosophila melanogaster mesoderm development. We find that confident predicted links between TFs and targets are significantly enriched for supporting ChIP-chip binding events and annotated TF-gene interations. Our method statistically significantly outperforms existing alternatives.ConclusionsOur results show that it is possible to infer regulatory links between multiple interacting TFs and their target genes even from a single relatively short time series and in presence of unmodelled confounders and unreliable prior knowledge on training network connectivity. Introducing data from several different experimental perturbations significantly increases the accuracy.

AB - BackgroundComplete transcriptional regulatory network inference is a huge challenge because of the complexity of the network and sparsity of available data. One approach to make it more manageable is to focus on the inference of context-specific networks involving a few interacting transcription factors (TFs) and all of their target genes.ResultsWe present a computational framework for Bayesian statistical inference of target genes of multiple interacting TFs from high-throughput gene expression time-series data. We use ordinary differential equation models that describe transcription of target genes taking into account combinatorial regulation. The method consists of a training and a prediction phase. During the training phase we infer the unobserved TF protein concentrations on a subnetwork of approximately known regulatory structure. During the prediction phase we apply Bayesian model selection on a genome-wide scale and score all alternative regulatory structures for each target gene. We use our methodology to identify targets of five TFs regulating Drosophila melanogaster mesoderm development. We find that confident predicted links between TFs and targets are significantly enriched for supporting ChIP-chip binding events and annotated TF-gene interations. Our method statistically significantly outperforms existing alternatives.ConclusionsOur results show that it is possible to infer regulatory links between multiple interacting TFs and their target genes even from a single relatively short time series and in presence of unmodelled confounders and unreliable prior knowledge on training network connectivity. Introducing data from several different experimental perturbations significantly increases the accuracy.

KW - 113 Computer and information sciences

U2 - 10.1186/1752-0509-6-53

DO - 10.1186/1752-0509-6-53

M3 - Article

VL - 6

JO - BMC Systems Biology

JF - BMC Systems Biology

SN - 1752-0509

M1 - 53

ER -