Key points are not available for this paper at this time.
Flow cytometry (FC) is a powerful technology for rapid multivariate analysis and functional discrimination of cells. Current FC platforms generate large, high-dimensional datasets which pose a significant challenge for traditional manual bivariate analysis. Automated multivariate clustering, though highly desirable, is also stymied by the critical requirement of identifying rare populations that form rather small clusters, in addition to the computational challenges posed by the large size and dimensionality of the datasets. In this paper, we address these twin challenges by developing a two-stage scalable multivariate parametric clustering algorithm. In the first stage, we model the data as a mixture of Gaussians and use an iterative weighted sampling technique to estimate the mixture components successively in order of decreasing size. In the second stage, we apply a graph-based hierarchical merging technique to combine Gaussian components with significant overlaps into the final number of desired clusters. The resulting algorithm offers a reduction in complexity over conventional mixture modeling while simultaneously allowing for better detection of small populations. We demonstrate the effectiveness of our method both on simulated data and actual flow cytometry datasets.
Building similarity graph...
Analyzing shared references across papers
Loading...
Iftekhar Naim
Suprakash Datta
Gaurav Sharma
University of Rochester
York University
Building similarity graph...
Analyzing shared references across papers
Loading...
Naim et al. (Fri,) studied this question.
www.synapsesocial.com/papers/6a0da8839a2918c675a4f34b — DOI: https://doi.org/10.1109/icassp.2010.5495653