4.5 Review

A survey of sound source localization with deep learning methods

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Acoustics

Learning and controlling the source-filter representation of speech with a variational autoencoder

Samir Sadok et al.

Summary: Understanding and controlling latent representations in deep generative models is a challenging yet important problem. In this work, the source-filter model of speech production naturally arises as orthogonal subspaces of the VAE latent space. A method is proposed to identify and control the source-filter speech factors within the latent subspaces, as well as a robust f(0) estimation method.

SPEECH COMMUNICATION (2023)

Article Computer Science, Artificial Intelligence

Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition

Aswin Shanmugam Subramanian et al.

Summary: This paper proposes a novel supervised learning method using deep neural networks for multi-source localization in multi-talker conversation analysis. The method utilizes a source splitting mechanism to estimate the direction of arrival (DOA) of all speakers simultaneously from the audio mixture. The proposed method outperforms existing deep learning methods by performing utterance level prediction and incorporating temporal selection and averaging inside the network. Experimental results demonstrate the effectiveness of a variant of earth mover distance (EMD) in classifying DOA at a high resolution. Moreover, the estimated DOAs are used as additional input features in a speech recognition baseline, significantly improving the recognition performance.

COMPUTER SPEECH AND LANGUAGE (2022)

Article Computer Science, Artificial Intelligence

A review of speaker diarization: Recent advances with deep learning

Tae Jin Park et al.

Summary: Speaker diarization is a task to label audio or video recordings with speaker identity classes, and with the advancement of deep learning technology, rapid progress has been made in this field, showing the importance of complementary relationship between speaker diarization and speech recognition.

COMPUTER SPEECH AND LANGUAGE (2022)

Article Computer Science, Artificial Intelligence

A Survey on Multi-Task Learning

Yu Zhang et al.

Summary: This paper provides a survey of Multi-Task Learning (MTL) from the perspective of algorithmic modeling, applications, and theoretical analyses. It discusses different MTL algorithms and their characteristics, as well as the combination of MTL with other learning paradigms. The paper also reviews MTL models for large-scale tasks or high-dimensional data, as well as dimensionality reduction and feature hashing. Real-world applications of MTL are examined, and theoretical analyses and future directions are discussed.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Proceedings Paper Computer Science, Artificial Intelligence

TrackFormer: Multi-Object Tracking with Transformers

Tim Meinhardt et al.

Summary: This study proposes an end-to-end trainable multi-object tracking approach called TrackFormer, based on an encoder-decoder Transformer architecture. TrackFormer achieves outstanding performance in track initialization, identity, and spatio-temporal trajectory reasoning, and introduces the attention mechanism. Through self- and encoder-decoder attention on global frame-level features, additional graph optimization or modeling of motion and/or appearance is omitted.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Article Computer Science, Information Systems

gpuRIR: A python library for room impulse response simulation with GPU acceleration

David Diaz-Guerra et al.

Summary: This paper introduces a new implementation that uses GPUs to parallelize the computation of acoustic simulations of ISMs, significantly improving computational speed. A Python library is provided for easy use, which is about 100 times faster than other CPU libraries.

MULTIMEDIA TOOLS AND APPLICATIONS (2021)

Article Engineering, Electrical & Electronic

A Comprehensive Survey on Transfer Learning

Fuzhen Zhuang et al.

Summary: Transfer learning aims to improve the performance of target learners by transferring knowledge from related source domains, reducing the reliance on target-domain data. This survey aims to systematize and summarize existing research studies in order to help readers understand the current status and ideas in the area of transfer learning.

PROCEEDINGS OF THE IEEE (2021)

Article Acoustics

Spatial reconstruction of sound fields using local and data-driven functions

Manuel Hahmann et al.

Summary: Sound field analysis methods utilize local representations to characterize and reconstruct complex sound fields, reducing model discrepancies and using data-driven approaches for suitable models. The use of dictionary learning and principal component analysis demonstrate the potential for modeling diverse sound fields based on their local and statistical properties.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2021)

Article Acoustics

Sound source localization based on multi-task learning and image translation network

Yifan Wu et al.

Summary: Supervised learning-based sound source localization methods have been successful in achieving high accuracy, while a new method called MTIT using Multi-Task learning and Image Translation network for SSL is introduced in this paper. By extracting spatial features and utilizing multi-task learning, MTIT outperforms baseline methods in dynamic environments and shows good generalization performance.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2021)

Article Engineering, Mechanical

Deep learning-based method for multiple sound source localization with high resolution and accuracy

Soo Young Lee et al.

Summary: A deep learning approach is proposed for multiple sound source localization with high resolution and accuracy, regardless of the sound sources' positions on the grid points. By introducing a target function to obtain spatial source distribution maps, the proposed model can accurately predict the positions and strengths of multiple sound sources, outperforming model-based methods. The model is evaluated on a dataset with monopole sources on a square plane with a spiral array of microphones, demonstrating precise localization results irrespective of frequency and the number of sound sources.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2021)

Article Acoustics

Deep learning assisted sound source localization using two orthogonal first-order differential microphone arraysa)

Nian Liu et al.

Summary: This paper presents a deep learning assisted sound localization method using a small-sized microphone array to achieve higher spatial resolution, and proposes an improved feature extraction scheme to enhance robustness. Simulation and real-world experimental results show the proposed approach outperforms state-of-the-art counterparts in noisy and reverberant environments.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2021)

Article Engineering, Electrical & Electronic

Acoustic source localization with deep generalized cross correlations

Juan Manuel Vera-Diaz et al.

Summary: This study introduces a Convolutional Deep Neural Network that transforms Generalized Cross Correlation (GCC) between two signals into a Gaussian-shaped signal, named Deep Generalized Cross Correlation (DeepGCC). By combining DeepGCC estimations, a 3D acoustic map is created and further refined using a sparse generative model without the need for retraining the network.

SIGNAL PROCESSING (2021)

Review Computer Science, Artificial Intelligence

Multiple object tracking: A literature review

Wenhan Luo et al.

Summary: This review comprehensively examines the problem of Multiple Object Tracking (MOT) and proposes interesting directions for future research. By analyzing existing methods and experimental results, some fundamental agreements in the field have been verified.

ARTIFICIAL INTELLIGENCE (2021)

Article Engineering, Mechanical

Acoustic source imaging using densely connected convolutional networks

Pengwei Xu et al.

Summary: This study developed several Deep Neural Network (DNN) models specifically designed for acoustic imaging tasks, which were shown to outperform traditional acoustic imaging methods in terms of source localization and strength estimation. These DNN models represent a promising proof-of-concept for the use of DNN models in the field of acoustic imaging.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2021)

Article Acoustics

Dynamically localizing multiple speakers based on the time-frequency domain

Hodaya Hammer et al.

Summary: This study introduces a deep neural network-based online multi-speaker localization algorithm that can accurately localize and track multiple speakers simultaneously, performing exceptionally well in both static and dynamic scenarios.

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING (2021)

Article Acoustics

A neural network based microphone array approach to grid-less noise source localization

Paolo Castellini et al.

Summary: This paper proposes the use of artificial neural networks for localizing and quantifying multiple sound sources in a grid-less way, aiming to improve spatial resolution and computational efficiency.

APPLIED ACOUSTICS (2021)

Article Acoustics

BeamLearning: An end-to-end deep learning approach for the angular localization of sound sources using raw multichannel acoustic pressure dataa)

Hadrien Pujol et al.

Summary: This paper introduces a multiresolution deep learning approach called BeamLearning for sound source localization, which aims to capture relevant information from unprocessed acoustic signals and outperforms traditional methods in terms of accuracy and efficiency in noisy environments.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2021)

Article Acoustics

Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization

Bing Yang et al.

Summary: This paper proposes using deep neural networks to learn DP-RTF for robust binaural sound source localization. By utilizing a monaural speech enhancement network to improve DP-RTF estimation, and training a single DP-RTF learning network to generalize across different binaural arrays, the proposed method shows effectiveness in noisy and reverberant environments.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Computer Science, Information Systems

Semi-Supervised Source Localization in Reverberant Environments With Deep Generative Modeling

Michael J. Bianco et al.

Summary: In reverberant environments, localization remains challenging. Supervised learning approaches have shown promise, but lack labels in such scenarios. A semi-supervised learning method, VAE-SSL, is proposed to address this issue by using deep generative modeling. This approach outperforms conventional methods and fully supervised CNNs in label-limited scenarios.

IEEE ACCESS (2021)

Article Acoustics

Exploiting Temporal Context in CNN Based Multisource DOA Estimation

Alexander Bohlender et al.

Summary: Supervised learning methods are effective for DOA estimation, and in this study, a CNN approach with LSTM extension showed superior performance. By adjusting the training data generation framework to incorporate temporal context, a gradual evolution of source activity was achieved. Experimental results demonstrated the effectiveness of using LSTM extension for speaker localization tasks.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Acoustics

On the representation of wavefronts localized in space-time and wavenumber-frequency domains

Elias Zea et al.

Summary: This study reports evidence of a representation system for transient waves with band limited spectra, known as localized waves in the space-time and wavenumber-frequency domains. Theoretical analysis shows that the pressure spectrum of transient monopoles is distributed over hyperbolic regions of propagating and evanescent waves. Experimental analysis using dictionary learning on reverberant sound fields measured in three rooms reveals components related by analytical transformations in the spectra, suggesting partitioning characterized by hyperbolic dispersion curves and multiple directions and times of arrival.

JASA EXPRESS LETTERS (2021)

Article Acoustics

Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation

Weipeng He et al.

Summary: This paper proposes a novel approach for multi-speaker direction-of-arrival estimation using data augmentation and weakly-supervised domain adaptation. By generating source domain data with simulation and collecting real data annotated with weak labels, the proposed method achieves similar performance as fully-labeled real data. The approach suggests an effective development procedure for DOA estimation models applied to new types of microphone arrays with minimal data collection efforts.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Acoustics

On Improved Training of CNN for Acoustic Source Localisation

Elizabeth Vargas et al.

Summary: This study found that training CNNs with speech or music signals improves the accuracy of DoA estimation compared to training with random signals, across various audio classes. Additionally, the improvement is observed in different acoustic conditions and is significant when the training and test environments are similar and reverberant.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Acoustics

Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019

Archontis Politis et al.

Summary: This paper provides an overview of the research area of sound event localization and detection, focusing on the first international evaluation organized as part of the DCASE 2019 Challenge. It discusses the evaluation and ranking of systems, as well as the characteristics of the best-performing systems.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Acoustics

Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks

David Diaz-Guerra et al.

Summary: The article introduces a new single sound source DOA estimation and tracking system based on the SRP-PHAT algorithm and a three-dimensional Convolutional Neural Network, which accurately tracks sound sources even in highly reverberant scenarios. The system's causal architecture and new training procedure demonstrate its feasibility for real-time applications and robustness in various acoustic conditions. By using acoustical simulations and actual recordings, the system's good performance is proven even with low-resolution inputs.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Deep learning in video multi-object tracking: A survey

Gioele Ciaparrone et al.

NEUROCOMPUTING (2020)

Article Computer Science, Artificial Intelligence

Squeeze-and-Excitation Networks

Jie Hu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2020)

Article Acoustics

Three-dimensional source localization using sparse Bayesian learning on a spherical microphone array

Guoli Ping et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2020)

Article Engineering, Electrical & Electronic

Data-Driven Multi-Microphone Speaker Localization on Manifolds

Bracha Laufer-Goldshtein et al.

FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING (2020)

Article Acoustics

Robust Source Counting and DOA Estimation Using Spatial Pseudo-Spectrum and Convolutional Neural Network

Thi Ngoc Tho Nguyen et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)

Article Acoustics

A Deep Learning Framework for Robust DOA Estimation Using Spherical Harmonic Decomposition

Vishnuvardhan Varanasi et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)

Article Acoustics

Multi-Source DOA Estimation Through Pattern Recognition of the Modal Coherence of a Reverberant Soundfield

Abdullah Fahim et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)

Article Engineering, Electrical & Electronic

DeepMUSIC: Multiple Signal Classification via Deep Learning

Ahmet M. Elbir

IEEE SENSORS LETTERS (2020)

Article Acoustics

The LOCATA Challenge: Acoustic Source Localization and Tracking

Christine Evers et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)

Article Acoustics

Source Localization Using Distributed Microphones in Reverberant Environments Based on Deep Learning and Ray Space Transform

Luca Comanducci et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

Detection of activity and position of speakers by using deep neural networks and acoustic data augmentation

Paolo Vecchiotti et al.

EXPERT SYSTEMS WITH APPLICATIONS (2019)

Editorial Material Engineering, Electrical & Electronic

Introduction to the Issue on Acoustic Source Localization and Tracking in Dynamic Real-Life Scenes

S. Gannot et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2019)

Article Engineering, Electrical & Electronic

Multi-Speaker DOA Estimation Using Deep Convolutional Networks Trained With Noise Signals

Soumitro Chakrabarty et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2019)

Article Engineering, Electrical & Electronic

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks

Sharath Adavanne et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2019)

Article Engineering, Electrical & Electronic

CRNN-Based Multiple DoA Estimation Using Acoustic Intensity Features for Ambisonics Recordings

Laureline Perotin et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2019)

Article Engineering, Electrical & Electronic

Deep Learning for Audio Signal Processing

Hendrik Purwins et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2019)

Article Acoustics

Experimental characterization of the sound field in a reverberation room

Melanie Nolan et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2019)

Review Engineering, Mechanical

Acoustic beamforming for noise source localization - Reviews, methodology and applications

Paolo Chiariotti et al.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2019)

Article Engineering, Electrical & Electronic

Building and Evaluation of a Real Room Impulse Response Dataset

Igor Szoke et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2019)

Article Acoustics

A deep learning method for grid-free localization and quantification of sound sources

Adam Kujawski et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2019)

Article Acoustics

Sound Localization Based on Phase Difference Enhancement Using Deep Neural Networks

Junhyeong Pak et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2019)

Article Acoustics

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

Yi Luo et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2019)

Article Acoustics

Machine learning in acoustics: Theory and applications

Michael J. Bianco et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2019)

Article Acoustics

Model-based Bayesian direction of arrival analysis for sound sources using a spherical microphone array

Christopher R. Landschoot et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2019)

Article Acoustics

Robust Speaker Localization Guided by Deep Learning-Based Time-Frequency Masking

Zhong-Qiu Wang et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2019)

Article Computer Science, Information Systems

Real-Time Convolutional Neural Network-Based Speech Source Localization on Smartphone

Abdullah Kucuk et al.

IEEE ACCESS (2019)

Article Computer Science, Information Systems

Multitask Learning of Time-Frequency CNN for Sound Source Localization

Cheng Pang et al.

IEEE ACCESS (2019)

Article Engineering, Electrical & Electronic

Voice Activity Detection Using an Adaptive Context Attention Model

Juntae Kim et al.

IEEE SIGNAL PROCESSING LETTERS (2018)

Article Acoustics

Sound source localization and speech enhancement with sparse Bayesian learning beamforming

Angeliki Xenaki et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2018)

Article Acoustics

Introduction to compressive sensing in acoustics

Peter Gerstoft et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2018)

Article Engineering, Electrical & Electronic

Acoustic source localization in strong reverberant environment by parametric Bayesian dictionary learning

Lu Wang et al.

SIGNAL PROCESSING (2018)

Article Acoustics

Supervised Speech Separation Based on Deep Learning: An Overview

DeLiang Wang et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2018)

Article Computer Science, Information Systems

A Convolutional Neural Network Smartphone App for Real-Time Voice Activity Detection

Abhishek Sehgal et al.

IEEE ACCESS (2018)

Article Engineering, Electrical & Electronic

Direction-of-Arrival Estimation Based on Deep Neural Networks With Robustness to Array Imperfections

Zhang-Meng Liu et al.

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION (2018)

Article Acoustics

Sparse Bayesian learning for beamforming using sparse linear arrays

Santosh Nannuru et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2018)

Article Computer Science, Artificial Intelligence

Exploiting CNNs for Improving Acoustic Source Localization in Noisy and Reverberant Conditions

Daniele Salvati et al.

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE (2018)

Article Engineering, Electrical & Electronic

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Justin Salamon et al.

IEEE SIGNAL PROCESSING LETTERS (2017)

Article Acoustics

Spatial analysis and auralization of room acoustics using a tetrahedral microphone

Sebastia V. Amengual Gari et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2017)

Review Computer Science, Information Systems

A Survey of Sound Source Localization Methods in Wireless Acoustic Sensor Networks

Maximo Cobos et al.

WIRELESS COMMUNICATIONS & MOBILE COMPUTING (2017)

Article Acoustics

Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition

Tara N. Sainath et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2017)

Article Acoustics

A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation

Sharon Gannot et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2017)

Article Computer Science, Hardware & Architecture

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky et al.

COMMUNICATIONS OF THE ACM (2017)

Article Robotics

Sound Source Localization Using Deep Learning Models

Nelson Yalta et al.

JOURNAL OF ROBOTICS AND MECHATRONICS (2017)

Article Engineering, Electrical & Electronic

DNN-Based Feature Enhancement Using DOA-Constrained ICA for Robust Speech Recognition

Ho-Yong Lee et al.

IEEE SIGNAL PROCESSING LETTERS (2016)

Article Engineering, Electrical & Electronic

Multisnapshot Sparse Bayesian Learning for DOA

Peter Gerstoft et al.

IEEE SIGNAL PROCESSING LETTERS (2016)

Article Engineering, Electrical & Electronic

The Ray Space Transform: A New Framework for Wave Field Processing

Lucio Bianchi et al.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2016)

Article Engineering, Electrical & Electronic

Enhancing Sparsity and Resolution via Reweighted Atomic Norm Minimization

Zai Yang et al.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2016)

Article Acoustics

Multichannel Audio Source Separation With Deep Neural Networks

Aditya Arie Nugraha et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2016)

Article Acoustics

Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization

Xiaofei Li et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2016)

Article Computer Science, Artificial Intelligence

A survey on sound source localization in robotics: From binaural to array processing methods

S. Argentieri et al.

COMPUTER SPEECH AND LANGUAGE (2015)

Article Acoustics

Grid-free compressive beamforming

Angeliki Xenaki et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2015)

Review Multidisciplinary Sciences

Deep learning

Yann LeCun et al.

NATURE (2015)

Article Acoustics

Tree-Based Recursive Expectation-Maximization Algorithm for Localization of Acoustic Sources

Yuval Dorfan et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2015)

Article Acoustics

Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression

Antoine Deleforge et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2015)

Article Acoustics

A Bayesian direction-of-arrival model for an undetermined number of sources using a two-microphone array

Jose Escolano et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2014)

Article Acoustics

Compressive beamforming

Angeliki Xenaki et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2014)

Article Engineering, Electrical & Electronic

Off-grid DOA estimation using array covariance matrix and block-sparse Bayesian learning

Yi Zhang et al.

SIGNAL PROCESSING (2014)

Article Engineering, Electrical & Electronic

Single-snapshot DOA estimation by using Compressed Sensing

Stefano Fortunati et al.

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING (2014)

Article Acoustics

Speaker Tracking Using Recursive EM Algorithms

Ofer Schwartz et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2014)

Article Mathematics, Applied

The cosparse analysis model and algorithms

S. Nam et al.

APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS (2013)

Review Computer Science, Artificial Intelligence

Representation Learning: A Review and New Perspectives

Yoshua Bengio et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2013)

Article Computer Science, Information Systems

An Approach for Sound Source Localization by Complex-Valued Neural Network

Hirofumi Tsuzuki et al.

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS (2013)

Review Acoustics

Speaker Diarization: A Review of Recent Research

Xavier Anguera Miro et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2012)

Article Acoustics

Binaural Localization of Multiple Sources in Reverberant and Noisy Environments

John Woodruff et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2012)

Article Engineering, Electrical & Electronic

An Efficient Maximum Likelihood Method for Direction-of-Arrival Estimation via Sparse Bayesian Learning

Zhang-Meng Liu et al.

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS (2012)

Article Acoustics

Rigid sphere room impulse response simulation: Algorithm and applications

D. P. Jarrett et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2012)

Article Engineering, Electrical & Electronic

Multi-source TDOA estimation in reverberant audio using angular spectra and clustering

Charles Blandin et al.

SIGNAL PROCESSING (2012)

Article Acoustics

A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End

Tobias May et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2011)

Article Engineering, Electrical & Electronic

Sparse Sensing With Co-Pprime Samplers and Arrays

Palghat P. Vaidyanathan et al.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2011)

Article Acoustics

Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model

Ngoc Q. K. Duong et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2010)

Article Acoustics

Model-Based Expectation-Maximization Source Separation and Localization

Michael I. Mandel et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2010)

Article Acoustics

Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses

Eric A. Lehmann et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2010)

Article Engineering, Electrical & Electronic

A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture

Simon Arberet et al.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2010)

Article Engineering, Electrical & Electronic

Nested Arrays: A Novel Approach to Array Processing With Enhanced Degrees of Freedom

Piya Pal et al.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2010)

Article Computer Science, Interdisciplinary Applications

WOZ acoustic data collection for interactive TV

Alessio Brutti et al.

LANGUAGE RESOURCES AND EVALUATION (2010)

Article Acoustics

Binaural tracking of multiple moving sources

Nicoleta Roman et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2008)

Article Acoustics

An overview of automatic speaker diarization systems

Sue E. Tranter et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2006)

Article Mathematics, Applied

Stable signal recovery from incomplete and inaccurate measurements

Emmanuel J. Candes et al.

COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS (2006)

Article Engineering, Electrical & Electronic

Time difference of arrival estimation of speech source in a noisy and reverberant environment

TG Dvorkind et al.

SIGNAL PROCESSING (2005)

Article Acoustics

Relative transfer function identification using speech seals

I Cohen

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING (2004)

Article Engineering, Electrical & Electronic

A neural network-based smart antenna for multiple source tracking

AH El Zooghby et al.

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION (2000)