skip to main content
10.1145/1854776.1854842acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Effects of cDNA microarray time-series data size on gene regulatory network inference accuracy

Published: 02 August 2010 Publication History

Abstract

A number of models and algorithms have been proposed in the past for gene regulatory network (GRN) inference; however, none of them address the effects of the size of the time-series microarray expression data in terms of number of time-points. In this paper, we study this problem by analyzing the behavior of two algorithms based on information theory models. These algorithms were implemented on different sizes of data generated by synthetic network generation tools. Experiments show that the performances of these algorithms reach a saturation point after a specific data size, thus giving the biologist an idea about what size of data will give the best inference accuracy. Also, the fact that the accuracy saturates after a specific number of time points (the saturation point being different for different algorithms) suggests that generating time-series data for a lot of time-points will not necessary improve the inference accuracy beyond a certain point. To understand this saturation, we found out that the information theoretic quantity, mutual information, tends to zero as the number of time points increase although the entropy in the network rises to unity. This illustrates the fact that mutual information (MI) might not be the best metric to use for GRN inference algorithms. To modify the MI metric we introduce a new method of computing time lags between any pair of genes and present the time lagged mutual information (TLMI) metric for reverse engineering of GRNs.

References

[1]
Hansen M H, Yu B. 2001. Model Selection and the Principle of Minimum Description Length. Journal of the American Statistical Association. Vol.96, No.454, 746--774.
[2]
Hecker M, Lambeck S, Toepfer S, Eugene van Someren, Reinhard Guthke. 2009. Gene regulatory network inference: Data integration in dynamic models-A review. Bio Systems. Vol.96, No.1, 86--103.
[3]
Kanehisa et al. 2008. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 36, D480--D484.
[4]
Liang, S. 1998. Reveal, A general reverse engineering algorithm for inference of genetic network architectures. Pacific Symposium on Biocomputing. Vol.3, 18--29.
[5]
Marbach, D., Schaffter, T., Mattiussi, C. and Floreano, D. 2009. Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods. Journal of Computational Biology, Vol.16, No.2, 229--239.
[6]
Rissanen J. 2006. An introduction to the MDL principle. Helsinki Institute for Information Technology, Tampere and Helsinki Universities of Technology, Finland, and University of London, England.
[7]
Spellman et al. 1998. Comprehensive Identification of Cell Cycle-regulated Genes of the Yeast Saccharomyces cerevisiae by Microarray Hybridization. Molecular Biology of the Cell, Vol.9, 3273--3297.
[8]
Chaitankar V et.al. 2009. Gene Regulatory Network Inference Using Predictive Minimum Description Length Principle and Conditional Mutual Information Proceedings of International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing, 487--490.
[9]
Wentao Zhao, Serpedin E, Dougherty E R. 2006. Inferring gene regulatory networks from time series data using the minimum description length principle. Bioinformatics, Vol.22 No.17, 2219--2135.
[10]
Zou M, Conzen SD. 2005. A New Dynamic Bayesian Network (DBN) Approach For Identifying Gene Regulatory Networks From Time Course Microarray Data. Bioinformatics, Vol.21, No.1, 71--79.

Cited By

View all
  • (2013)Computational RegulomicsOMICS10.1201/b14289-11(225-244)Online publication date: 23-Apr-2013
  • (2012)Genome scale inference of transcriptional regulatory networks using mutual information on complex interactionsProceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine10.1145/2382936.2383051(643-648)Online publication date: 7-Oct-2012
  • (2012)sCoInProceedings of the 2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)10.1109/BIBE.2012.6399735(572-577)Online publication date: 11-Nov-2012

Index Terms

  1. Effects of cDNA microarray time-series data size on gene regulatory network inference accuracy

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        BCB '10: Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
        August 2010
        705 pages
        ISBN:9781450304382
        DOI:10.1145/1854776
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 02 August 2010

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. gene regulatory networks
        2. information theory

        Qualifiers

        • Research-article

        Conference

        BCB'10
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 254 of 885 submissions, 29%

        Upcoming Conference

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 14 Sep 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2013)Computational RegulomicsOMICS10.1201/b14289-11(225-244)Online publication date: 23-Apr-2013
        • (2012)Genome scale inference of transcriptional regulatory networks using mutual information on complex interactionsProceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine10.1145/2382936.2383051(643-648)Online publication date: 7-Oct-2012
        • (2012)sCoInProceedings of the 2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)10.1109/BIBE.2012.6399735(572-577)Online publication date: 11-Nov-2012

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media