Jump to content

Sound source localization via distance metric learning with regularization

Fast facts

  • Internal authorship

  • Further publishers

    Mingmin Liu, Zhihua Lu, Xiaodong Wang, João Paulo J. da Cost

  • Publishment

    • 2024
  • Journal

    Signal Processing

  • Organizational unit

  • Subjects

    • Communication and information technology
  • Research fields

    • Institute for Communication Technology (IKT)
  • Publication format

    Journal article (Article)

Quote

M. Liu, Z. Lu, X. Wang, J. P. J. da Cost, and T. Fei, “Sound source localization via distance metric learning with regularization,” Signal Processing, p. 109721, 2024 [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0165168424003414

Content

Sound source localization (SSL) or simply direction of arrival (DOA) classification is an important ingredient in acoustic applications. Traditional model-based algorithms are susceptible to the effects of noise and reverberation, while data-driven deep learning algorithms maintain strong performance across a variety of acoustic circumstances, but typically require a large amount of labeled data. Nevertheless, the existing datasets for SSL are not sufficiently big and diverse to achieve the full potential of deep learning algorithms. Then, it is an imperative work to develop a non-data-hungry algorithm of SSL using small or medium data volume. To this end, we propose a regularized distance metric learning algorithm, that is, by means of the kernel method, we design a nonlinear feature transformation from two aspects: feature points and feature distributions. It transforms the data into a new feature space that brings features of the same class as close as possible and removes features of different classes as far away as possible, which can significantly improve the output of a DOA classifier that follows. Experimental results show that the proposed algorithm outperforms deep learning algorithms in diverse acoustic conditions.

Notes and references

This site uses cookies to ensure the functionality of the website and to collect statistical data. You can object to the statistical collection via the data protection settings (opt-out).

Settings(Opens in a new tab)