Abstract
Systems biology and bioinformatics are now major fields for productive research. DNA microarrays and other array technologies and genome sequencing have advanced to the point that it is now possible to monitor gene expression on a genomic scale. Gene expression analysis is discussed and some important clustering techniques are considered. The patterns identified in the data suggest similarities in the gene behavior, which provides useful information for the gene functionalities. We discuss measures for investigating the homogeneity of gene expression data in order to optimize the clustering process. We contribute to the knowledge of functional roles and regulation of E. coli genes by proposing a classification of these genes based on consistently correlated genes in expression data and similarities of gene expression patterns. A new visualization tool for targeted projection pursuit and dimensionality reduction of gene expression data is demonstrated.
Similar content being viewed by others
References
J. L. DeRisi, V. R. Iyer, and P. O. Brown, Science 278, 680 (1997).
M. B. Eisen, P. T. Spellman, P. O. Brown, and D. Botstein, Proc. Natl. Acad. Sci. USA 95, 14863 (1998).
J. Khan et al., Nature Medicine 7, 673 (2001).
T. R. Golub et al., Science 286, 531 (1999).
P. Baldi and G. W. Hatfield, DNA Microarrays and Gene Expression (CUP, Cambridge, 2003).
D. Stekel, Microarray Bioinformatics (CUP, Cambridge, 2003).
M. Dunham, Data Mining. Introductory and Advanced Topics (Prentice Hall, New Jersey, 2003).
D. Jiang, C. Tang, and A. Zhang, IEEE Trans. Knowl. Data Eng. 16, 1370 (2004).
A. P. Dempster, N. M. Laird, and D. B. Rubin, J. R. Stat. Soc., Ser. B 39, 1 (1977).
L. Kaufman and P. J. Rousseeuw, Finding Groups in Data (Wiley, New York, 1990).
F. De Smet et al., Bioinformatics 18, 735 (2002).
C. Fraley and A. E. Raftery, Comput. J. 41, 578 (1998).
A. F. Famili, G. Liu, and Z. Liu, Bioinformatics 20, 1535 (2004).
S. Datta and S. Datta, Bioinformatics 19, 459 (2003).
N. Bolshakova, F. Azuaje, and P. Cunningham, Bioinformatics 21, 2546 (2005).
F. Marincs, I. W. Manfield, J. A, Stead, et al., Biochem. J. 396, 227 (2006).
M. Angelova, Bulg. J. Phys. 33(s1), 876 (2006).
R. M. Ewing and J. M. Cherry, Bioinformatics 17, 658 (2001).
E.-K. Lee, D. Cook, S. Klinke, and T. Lumley, J. Comput. Graph. Stat. 14, 831 (2005).
J. Misra et al., Genome Res. 12, 1112 (2002).
J. H. Friedman and J. W. Tukey, IEEE Trans. Comput. C-23, 881 (1974).
D. Asimov, SIAM J. Sci. Stat. Comput. 6, 128 (1985).
J. Faith, R. Mintram, and M. Angelova, Bioinformatics 22, 2667 (2006).
J. Faith and M. Brockway, J. Integrat. Biol. (in press).
Author information
Authors and Affiliations
Corresponding author
Additional information
The text was submitted by the authors in English.
Rights and permissions
About this article
Cite this article
Angelova, M., Myers, C. & Faith, J. Classification of genes based on gene expression analysis. Phys. Atom. Nuclei 71, 780–787 (2008). https://doi.org/10.1134/S1063778808050025
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1063778808050025