Environmental sound classification with convolutional neural networks

Key Points

Key points are not available for this paper at this time.

Abstract

This paper evaluates the potential of convolutional neural networks in classifying short audio clips of environmental sounds. A deep model consisting of 2 convolutional layers with max-pooling and 2 fully connected layers is trained on a low level representation of audio data (segmented spectrograms) with deltas. The accuracy of the network is evaluated on 3 public datasets of environmental and urban recordings. The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.

Mark Helpful

Bookmark

Relay

Cite This Study

Karol J. Piczak (Tue,) studied this question.

synapsesocial.com/papers/69dc19f92e12f42e21955189 https://doi.org/https://doi.org/10.1109/mlsp.2015.7324337

Also Consider

Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context:

Mark Helpful

Bookmark

Relay