What question did this study set out to answer?

The study aims to evaluate the effectiveness of statistical filtering methods in sorting phytoplankton images based on library size and shape.

May 15, 2026Open Access

Statistical filtering to aid in the sorting of phytoplankton: The effects of image library size and phytoplankton shape

Key Points

The study aims to evaluate the effectiveness of statistical filtering methods in sorting phytoplankton images based on library size and shape.
Evaluated two filtering approaches: intrinsic (5-15 images) and compiled (30-80 images) for seven algal shapes.
Used FlowCam imaging flow cytometer for image processing and filtering analysis.
Compared recall and precision across different image library sizes and types of phytoplankton.
Recall was highest for larger libraries (>86% intrinsic, >94% compiled) but precision was lower (3-85% intrinsic, <1-8% compiled).
Precision peaked in small image libraries, particularly with the intrinsic method (>75% for most taxa).
Filtering performance was superior for solitary-celled taxa compared to small-celled colonial species.

Abstract

Abstract The demand for efficient image sorting methods has increased due to technological advancements that enable more intensive phytoplankton monitoring. Both statistical and machine learning algorithms can misidentify algal taxa in taxonomically diverse samples, in which phytoplankton morphology and image traits can vary. We evaluated the statistical filtering performance of the image processing software of an imaging flow cytometer (FlowCam) for two approaches to image library development; these were applied independently to seven commonly occurring algal shapes in mixed natural samples. The “intrinsic method” used a small selection of images (5–15 images of a target taxon) from the same sample being filtered (i.e., intrinsic), whereas the “compiled method” used a larger selection of images (30–80 images of a target taxon) compiled from multiple samples. Filter performance varied with the type of image library, image library size, and target taxon. The largest image libraries offered the highest recall (> 86% for intrinsic, > 94% for compiled) but lower precision (3–85% for intrinsic, 75% for most taxa) than the compiled method (< 20% for most taxa). Statistical filtering performance was higher for larger, solitary‐celled taxa with relatively uniform features (e.g., Gyrosigma ) compared to small‐celled colonial species with more complex or variable shapes (e.g., mucilaginous colonial cyanobacteria, and Scenedesmus ). Iteratively using the intrinsic statistical filtering method with manual correction between each iteration can be used to augment manual sample classification and reduce processing time.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Farrow et al. (Wed,) studied this question.

synapsesocial.com/papers/6a06b8dfe7dec685947ab5fc https://doi.org/https://doi.org/10.1002/lom3.70062

Bookmark

View Full Paper