NEWS

The Centre for Acoustic Signal Processing Research (CASPR) is a newly founded centre at the Section for Signal and Information Processing, Department of Electronic Systems, Aalborg University, Denmark.

[November, 2018] Papers of CASPR members:

  • Mean Square Performance Evaluation in Frequency Domain for an Improved Adaptive Feedback Cancellation in Hearing Aids. A. Kar, A. Anand, J. Østergaard, S.H. Jensen, and M.N.S. Swarmy. Accepted for publication in Signal Processing, Elsevier Journal, 2019.
  • Information Loss in the Human Auditory System. M. Z. Jahromi, A. Zahedi, J. Jensen, and J. Østergaard. Accepted for publication in IEEE Trans. Audio, Speech, Language Process., 2018.

[October, 2018] Papers of CASPR members:

  • Public Perception of Android Robots: Indications from an Analysis of YouTube Comments. E. Vlachos and Z.-H. Tan, the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), Madrid, Spain, 1-5 October 2018.

[September, 2018] Prof. Wai-Yip Geoffrey Chan from Queens University, Kingston, Canada is visiting CASPR for four months in the period September to December 2018. Prof. Chan will be working on new ideas in deep learning for speech intelligibility enhancement.

WAI-YIP Geoffrey Chan received the B.Eng. and M.Eng. degrees from Carleton University, Ottawa, and the Ph.D. degree from the University of California at Santa Barbara, CA, USA, all in electrical engineering. He has held positions with the Communications Research Centre, Bell Northern Research (Nortel), McGill University, and the Illinois Institute of Technology. He is currently a Professor with the Department of Electrical and Computer Engineering, Queen’s University,
Canada. His research interests are in communications, speech, and multimedia signal processing and coding. He was an Associate Editor of the IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING. He is an Associate Editor of the EURASIP Journal on Audio, Speech, and Music Processing. He has helped organize several IEEE sponsored conferences on communi- cations, speech coding, and image processing. He received the CAREER Award from the U.S. National Science Foundation.

[August, 2018] Papers of CASPR members:

  • Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification. H. Yu, T. Hu, Z. Ma, Z.-H. Tan and J. Guo, IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2018), Guiyang, China, August 22 – 24, 2018.
  • The Sound or Silence: investigating the influence of robot noise on proxemics. G. Trovato, R. Paredes, J. Balvin, F. Cuellar, N.B. Thomsen, S. Bech, and Z.-H. Tan, the 27th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2018, Nanjing and Tai’an, China, 27-31 August 2018.

[July, 2018] The third Newsletter from CASPR has been released:
CASPR_Newsletter_July2018.pdf

[July, 2018] Papers of CASPR members:

  • Zero-Delay Rate Distortion via Filtering for Vector-Valued Gaussian Sources. P. A. Stavrou, J. Østergaard, and C. Charalambous. IEEE Journal of Selected Topics in Signal Processing, July, 2018.

[June, 2018] CASPR has a new fully funded PhD stipend available within Signal Quality Estimation for Speech Enhancement using Miniature EEG Devices.

The main objective of this PhD project is to estimate the perceived speech or sound quality from EEG signals recorded by in-ear and around the ear EEG devices. Such compact EEG devices may be integrated into various hearing assistive devices (HADs), for example to help guide the signal processing in the HADs.

In this PhD project, a signal processing and information theoretic approach will be pursued, which involves the use of recent results on information losses in the human auditory system, fundamental information flows in the EEG signals, and variants of transfer entropy.

The stipends are open for appointment from 1 September 2018, or as soon as possible thereafter.

To apply for the position please use the following link: http://www.vacancies.aau.dk/show-vacancy/?vacancy=982588

[June, 2018] Papers of CASPR members:

  • Refinement and Validation of the Binaural Short Time Objective Intelligibility Measure for Spatially Diverse Conditions. A.H. Andersen, J.M. de Haan, Z.-H. Tan and J. Jensen, accepted by Speech Communication.
  • Non-Intrusive Speech Intelligibility Prediction using Convolutional Neural Networks. A.H. Andersen, J.M. de Haan, Z.-H. Tan and J. Jensen, accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing.
  • Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. P.S. Frederiksen, J. Villalba, S. Watanabe, Z.-H. Tan and N. Dehak, accepted by Interspeech 2018, Hyderabad, India, September 2-6, 2018.

[May, 2018] Papers of CASPR members:

  • A Spatial Self-Similarity Based Feature Learning Method for Face Recognition under Varying Poses. X. Duan and Z.-H. Tan, accepted by Pattern Recognition Letters, 2018.

[March, 2018] Papers of CASPR members:

  • Bias-compensated Informed Sound Source Localization Using Relative Transfer Functions. M. Farmani, M. S. Pedersen, Z.-H. Tan, and J. Jensen, accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018.

[February, 2018] Papers of CASPR members:

  • Using Closed-set Speaker Identification Score Confidence to Enhance Audio-based Collaborative Filtering for Multiple Users. S.E. Shepstone, Z.-H. Tan and M.S. Kristoffersen, accepted by IEEE Transactions on Consumer Electronics, 2018.
  • Evaluation and Comparison of Late Reverberation Power Spectral Density Estimators. S. Braun, A. Kuklasinski, O. Schwartz, O. Thiergart, E.A.P. Habets, S. Gannot, S. Doclo, and J. Jensen, accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018.

[January, 2018] Papers of CASPR members:

  • Monaural Speech Enhancement Using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure. M. Kolbæk, Z.-H. Tan and J. Jensen, The 43th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), 15-20 April 2018, Calgary, Alberta, Canada.
  • Fixed-Rate Zero-Delay Source Coding for Stationary Vector-Valued Gauss-Markov Sources. P. A. Stavrou and J. Østergaard, IEEE Data Compression Conference (DCC), March 2018.
  • A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments. R. Peng, Z.-H. Tan, X. Li, and C. Zheng, accepted by Speech Communication, 2017.

[December, 2017] The second Newsletter from CASPR has been released: CASPR_Newsletter_December17.pdf

[November, 2017] Papers of CASPR members:

  • An Upper Bound to Zero-Delay Rate Distortion via Kalman Filtering for Vector Gaussian Sources. P. A. Stavrou, J. Østergaard, C. Charalambos, and M. Derpich. Proceedings of the IEEE Information Theory Workshop, Kaohsiung, Taiwan, 2017.
  • Spoofing Detection in Automatic Speaker Verification Systems Using DNN Classifiers and Dynamic Acoustic Features. H. Yu, Z.-H. Tan, Z. Ma, R. Martin, and J. Guo, accepted by IEEE Transactions on Neural Networks and Learning Systems, 2017.
  • Time-Contrastive Learning Based DNN Bottleneck Features for Text-Dependent Speaker Verification. A.K. Sarkar and Z.-H. Tan, NIPS 2017 Time Series Workshop, Long Beach, CA, USA, Dec. 8, 2017.

[September, 2017] Papers of CASPR members:

  • Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. M. Sahidullah, D.A.L. Thomsen, R.G. Hautamaki, T. Kinnunen, Z.-H. Tan, R. Parts, M. Pitkanen, accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017.
  • iSocioBot – A Multimodal Interactive Social Robot. Z.-H. Tan, N.B. Thomsen, X. Duan, E. Vlachos, S.E. Shepstone, M.H. Rasmussen and J.L. Højvang, accepted by International Journal of Social Robotics, 2017.
  • Weighted Score Based Fast Converging CO-training with Application to Audio-Visual Person Identification. X. Duan, N.B. Thomsen, Z.-H. Tan, B. Lindberg and S.H. Jensen,  The 29th IEEE International Conference on Tools with Artificial Intelligence (ICTAI2017), Boston, Massachusetts, USA, Nov. 6-8, 2017.
  • An Upper Bound to Zero-Delay Rate Distortion via Kalman Filtering for Vector Gaussian Sources. P. A. Stavrou, J. Østergaard, C. Charalambos, and M. Derpich. Proceedings of the IEEE Information Theory Workshop, Kaohsiung, Taiwan, 2017.

[August, 2017] Papers of CASPR members:

  • Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker Verification. A. Sarkar and Z.-H. Tan, accepted by Computer Speech & Language, 2017.
  • Latent Dirichlet Mixture Model. J.-T. Chien, C.-H. Lee and Z.-H. Tan, accepted by Neurocomputing, 2017.
  • Visual Detection of Events of Interest from Urban Activity. S. Astaras, A. Pnevmatikakis and Z.-H. Tan, accepted by Wireless Personal Communications, 2017.
  • Joint Separation and Denoising of Noisy Multi-Talker Speech Using Recurrent Neural Networks and Permutation Invariant Training, M. Kolbæk, D. Yu, Z.-H. Tan and J. Jensen, accepted by the IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP), Tokyo, Japan, 25-28 September 2017.

[July, 2017] We are happy to announce that the registration for the Winter School on Signal Processing for Hearing Assistive Devices has now been opened. The Winter School takes place at Aalborg University during the period 6 -10 November, 2017.

Registration fee:
PhD students: 1500,- DKK
Industry: 8000,- DKK

To register for the Winter School please use the following link: https://phd.moodle.aau.dk/course/view.php?id=792

Important: The link to payment can be found under the section Course Fee in the above link.

A description of the course can be found on the CASPR website: http://caspr.es.aau.dk/education/courses

[July, 2017] Papers of CASPR members:

  • Morten Kolbæk, Dong Yu, Zheng-Hua Tan and Jesper Jensen, “Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks”, accepted by IEEE Transactions on Audio, Speech and Language Processing, 2017.

[9th June, 2017]
The first Newsletter from CASPR has been released: CASPR_Newsletter_June17

[June, 2017] Papers of CASPR members to appear at Interspeech 2017:

  • Humans do not maximize the probability of correct decision when recognizing DANTALE words in noise. Z. Jahromi, J. Østergaard, and J. Jensen, Proc. Interspeech 2017, Stockholm, Sweden, 2017, to appear.
  • On the use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure. A.H. Andersen, J.M. de Haan, Z.-H. Tan and J. Jensen, Proc. Interspeech 2017, Stockholm, Sweden, 2017, to appear.
  • Adversarial Network Bottleneck Features for Noise Robust Speaker Verification. H. Yu, Z.-H. Tan, Z. Ma and J. Guo, Proc. Interspeech 2017, Stockholm, Sweden, 2017, to appear.
  • Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification. D. Michelsanti and Z.-H. Tan, Proc. Interspeech 2017, Stockholm, Sweden, 2017, to appear.
  • Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data. A. Sarkar, Md Sahidullah, Z.-H. Tan and T. Kinnunen, Proc. Interspeech 2017, Stockholm, Sweden, 2017, to appear.

[May, 2017] J. Jensen of CASPR has received a research grant from the Danish Council for Independent Research for the project entitled “Intelligibility-Aware Hearing Assistive Devices”, which is in the interest-sphere of CASPR. The project involves a 3-year phd track and a 3-year postdoc track.

Abstract: Hearing assistive devices, such as headsets for speech communication in noisy environments and hearing aid systems, cochlear implants, etc., aim at improving the speech intelligibility (SI) for the user. To do so, the hearing assistive devices process the acoustic signals, before they are presented to the ears of the user. The research project explores deep-learning based methods for predicting the SI experienced by the user in a given acoustic situation (the phd track) and enhancing the SI by processing the microphone signals before they are presented the the ears of the user (the postdoc track). The project will take place at the Section for Signal and Information Processing (SIP), Department of Electronic Systems, Aalborg University.

For more information on the open phd and postdoc positions, please consult www.vacancies.aau.dk or contact Professor Jesper Jensen (email: jje@es.aau.dk).

[7th February, 2017] On February 2nd, Professor Patrick Naylor from Imperial College London, visited our group to discuss future research collaboration and to give a presentation entitled “Measurement and Exploitation of Reverberation in Speech signals”.

 

 

[March, 2017] Papers of CASPR members to appear at IEEE Data Compression Conference 2017:

  • An Asymmetric Difference Multiple Description Gaussian Noise Channel. J. Østergaard, Y. Kochman, and R. Zamir, IEEE Data Processing Conference, April, 2017.

[20th January, 2017] In connection to the official opening of our new research centre within the area of acoustic signal processing: Centre for Acoustic Signal Processing Research (CASPR), you are invited to an afternoon event with technical presentations, demos, and lab tours at the Section for Signal and Information Processing (SIP), Department of Electronic Systems, Aalborg University.

Date: March 2., 2017.
Place: Aalborg Universitet, Fredrik Bajers Vej 7, Room A4-108.
Free registration. Use the link to sign up: Sign Up
Technical program can be downloaded here: invitation

[4th January, 2017] A paper co-authored by J. Jensen of CASPR received an IEEE Signal Processing Society Best Paper Award.
An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech, C. H. Taal, R. C. Hendriks, R. Heusdens and J. Jensen, Transactions on Audio, Speech, and Language Processing, Volume 19, No. 7, September 2011.
For this award, papers in a 5 year window spanning from 2011 – 2016 are considered. The award honors the author(s) of a paper of exceptional merit dealing with a subject related to the Society’s technical scope, and appearing in one of the Society’s solely owned transactions or the Journal of Selected Topics in Signal Processing, irrespective of the author’s age.

[15th December, 2016]  We will be organizing a Winter School on Signal Processing for Hearing Assistive Devices at Aalborg University, November 6 – 10, 2017. For details see: http://caspr.es.aau.dk/education/

[December, 2016]  Papers of CASPR members to appear at ICASSP 2017:

  • Permutation Invariant Training of Deep Models for Speaker-Independent Multi-Talker Speech Separation. D. Yu, M. Kolbæk, Z.-H. Tan, J. Jensen, Proc. International Conf. Audio, Speech, Signal Proc. (ICASSP), 2017.
  • A Non-Intrusive Short-Time Objective Intelligibility Measure. A. H. Andersen, J. M. de Haan, Z.-H. Tan, and J. Jensen, Proc. International Conf. Audio, Speech, Signal Proc. (ICASSP), 2017.
  • RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research. T. Kinnunen, M. Sahidullah, M. Falcone, L. Costantini, R. Hautamaki, D. Thomsen, A. Sarkar, Z.-H. Tan, H. Delgado, M. Todisco, N. Evans, V. Hautamaki, and K.A. Lee, Proc. International Conf. Audio, Speech, Signal Proc. (ICASSP), 2017.