March 1, 2016

Sound source localization based on deep neural networks with directional activate function exploiting phase information

Key Points

Key points are not available for this paper at this time.

Abstract

This paper describes sound source localization (SSL) based on deep neural networks (DNNs) using discriminative training. A naïve DNNs for SSL can be configured as follows. Input is the frequency-domain feature used in other SSL methods, and the structure of DNNs is a fully-connected network using real numbers. The training fails because its network structure loses two important properties, i.e., the orthogonality of sub-bands and the intensity- and time-information saved in complex numbers. We solved these two problems by 1) integrating directional information at each sub-band hierarchically, and 2) designing a directional activator that could treat the complex numbers at each sub-band. Our experiments indicated that our method outperformed the naive DNN-based SSL by 20 points in terms of the block-level accuracy.

AI에게 질문

Bookmark

Cite This Study

Takeda et al. (Tue,) studied this question.

synapsesocial.com/papers/6a0903d8da638ec17f73af26 https://doi.org/https://doi.org/10.1109/icassp.2016.7471706

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

AI에게 질문

Bookmark