[OPTICAL REVIEW Vol. 14, No. 2 (2007) 97-104]
© 2007 The Optical Society of Japan
Noise Analysis of Duplicated Data on Microarrays Using Mixture Distribution Modeling
Masaru TAKEYA, Takehiro MATSUDA1, Masao IWAMOTO2, Norimichi TSUMURA1, Toshiya NAKAGUCHI1, and Yoichi MIYAKE1
Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, 2-1-2 Kannondai, Tsukuba, Ibaraki 305-8602, Japan
1Graduate School of Science and Technology, Chiba University, 1-33 Yayoi-cho, Inage-ku, Chiba 263-8522, Japan
2Division of Plant Sciences, National Institute of Agrobiological Sciences, 2-1-2 Kannondai, Tsukuba, Ibaraki 305-8602, Japan
(Received September 5, 2005; Accepted December 6, 2006)
We propose a technique for estimating gene expression values for duplicated data on cDNA microarrays. In the scatter plots, the distribution is constructed from a mixture of normal two-dimensional distributions, which represent fluctuations in gene expression values due to noise. An expectation-maximization (EM) algorithm is used for estimating the modeling parameters. The probability that duplicated data is shifted by noise is calculated using Bayesian estimation. Six data sets of rice cDNA microarray assays were used to test the proposed technique. Genes in the data sets were subjected to clustering based on probability of true value. Clustering successfully identified candidate genes regulated by circadian rhythms in rice.
Key words: cDNA microarray, mixture distribution model, duplicated data, circadian rhythms