Abstract Automatic recognition of insect sound could help us understand changing biodiversity trends around the world—but insect sounds are challenging to recognize even for deep learning, due to the broad frequency ranges and limited amount of training data. We present a new dataset comprised of 26298 audio files (226.6 hours), from 459 species of Orthoptera (310 species) and Cicadidae (149 species). InsectSet459 is the first large-scale dataset of insect sound that is easily applicable for developing novel deep-learning methods. Its recordings were made with a variety of audio recorders using varying sample rates to capture the extremely broad range of frequencies that insects produce. We benchmark performance with two state-of-the-art deep learning classifiers, demonstrating good performance but also significant room for improvement in acoustic insect classification. This dataset can serve as a realistic test case for implementing insect monitoring workflows, and as a challenging basis for the development of audio representation methods that can handle highly variable frequencies and/or sample rates.
Building similarity graph...
Analyzing shared references across papers
Loading...
Marius Faiß
University of Konstanz
Burooj Ghani
Naturalis Biodiversity Center
Dan Stowell
Naturalis Biodiversity Center
Scientific Data
Tilburg University
University of Konstanz
Naturalis Biodiversity Center
Building similarity graph...
Analyzing shared references across papers
Loading...
Faiß et al. (Fri,) studied this question.
synapsesocial.com/papers/69c8c3bdde0f0f753b39eb04 — DOI: https://doi.org/10.1038/s41597-026-07123-4