October 1, 1999

Clustering Gene Expression Patterns

Key Points

Key points are not available for this paper at this time.

Abstract

Recent advances in biotechnology allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. Analysis of data produced by such experiments offers potential insight into gene function and regulatory mechanisms. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. The corresponding algorithmic problem is to cluster multicondition gene expression patterns. In this paper we describe a novel clustering algorithm that was developed for analysis of gene expression data. We define an appropriate stochastic error model on the input, and prove that under the conditions of the model, the algorithm recovers the cluster structure with high probability. The running time of the algorithm on an n-gene dataset is On2[log(n)c]. We also present a practical heuristic based on the same algorithmic ideas. The heuristic was implemented and its performance is demonstrated on simulated data and on real gene expression data, with very promising results.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Amir Ben‐Dor

Translational Genomics Research Institute

Ron Shamir

Tel Aviv University

Zohar Yakhini

Reichman University

Journals

Journal of Computational Biology

Actions

Institutions

University of Washington

Tel Aviv University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Clustering Gene Expression Patterns

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study