Prof. Anja Leue from the psychology department of our university organized a lecture series on the topic "language and society". Also the DSS group participated in that event and we presented some of our results on speech in disturbed environments. The lecture took place on Thursday, 5th of May, in one of the lecture rooms in the Audimax building. After the talk, a nice, interesting (and for our field) long discussion took place. Here is the contents of the DSS talk:

Nowadays technical systems allow for voice communication even in very disturbed environments. Examples are communication masks for firefighters, swim googles for under water speech communication or speech communication within cars. In the latter example, the speech of dialog partners is impaired by several factors. Depending on the driving speed, a moderate or even high level of background noise superposes to the speech signals generated by the passengers or by loudspeakers that emit the signals from communication partners connected via mobile phones. Due to the seat adjustment (position and orientation) the front passengers do not speak into the direction of the rear passengers and face-to-face communication among the passengers is not as easy as in a “normal” communication.

If so-called ICC systems (ICC abbreviates in-car communication) are used, the passengers are recorded using microphones. After appropriate signal processing (mainly noise, echo, and feedback reduction) the enhanced signals of the talkers are played back via loudspeakers close to the ears of the listening passengers. At first glance such systems face the same problems as hands-free or speech dialog systems but due to the closed electro-acoustic loop that they have to operate in special problems arise, e.g. correlation of the local signals with the loudspeaker signals that lead to problems when performing system identification with adaptive filters. Furthermore, the enhancement usually leads to a better signal-to-noise ratio at the ears of the listeners. However, the more the signal-to-noise ratio is improved for the listening passengers the more the speaking passengers are aware of or even disturbed by their own voices due to echo perception.

In this talk, I will try to mention most of the challenges that one faces when building enhancement systems for speech in disturbed environments. The solution to these challenges is usually a “cocktail” of individual processing units where the ingredients are low-delay filterbanks, adaptive structures for system identification, spectral suppression rules, decorrelation schemes, and adaptive mixing approaches. In most cases a compromise between the needs of the talking and the listening passengers has to be found which makes this application a very interesting challenge.

If one combines pure ICC systems with other speech and audio systems in a car such as hand-free, anti-noise, or music playback systems the complexity of the resulting system increases. However, the system components mentioned before can be combined such that they can overcome some of the problems, which is again an interesting challenge.

Let me finally mention that even after decades of great and continuous improvement in speech and audio signal processing the communication of people in highly disturbed environments could still be improved. Thus, speech signal enhancement remains “a rocky road” – to say it with the words of one of the early German speech processing researches.

Related Sections:

You might want to look in an overview about our

or you can see our news for individual years:

 

Website News

27.01.2020: Contributions on nerve signal modeling and magnetic muscle measurement by OPMs availaible on IEEE (early access).

27.01.2020: Talk about magnetic shielding by Allard Schnabel (PTB, Berlin) takes place on Feb 13, 2020, 17 h, Room: C-SR 1.

26.01.2020: Some reflections on the year 2019 are online now.

17.12.2019: Journal paper on signal processing for breathing protection masks published.

23.11.2019: GaS price 2019 for Jannek Winter for an excellent bachelor topic on underwater communication systems.

15.11.2019: Our new MIMO-SONAR system (sponsored by DFG) is now ready for "take off".

20.10.2019: We had a very good retreat on the island of Sylt.

07.08.2019: Talk from Juan Rafael Orozco-Arroyave added.

11.07.2019: First free KiRAT version released - a game for Parkinson patients

Recent Publications

   

E. Elzenheimer, H. Laufs, W. Schulte-Mattler, G. Schmidt: Magnetic Measurement of Electrically Evoked Muscle Responses with Optically Pumped Magnetometers, IEEE Transactions on Neural Systems and Rehabilitation Engineering, January 2020, doi: 10.1109/TNSRE.2020.2968148

   

M. Brodersen, A. Volmer, G. Schmidt: Signal Enhancement for Communication Systems Used by Firefighter, EURASIP Journal on Audio, Speech, and Music Processing, vol. 21, pp. 1 - 19, 2019

   

E. Elzenheimer, H. Laufs, W. Schulte-Mattler, G. Schmidt: Signal Modeling and Simulation of Temporal Dispersion and Conduction Block in Motor Nerves, IEEE Transactions on Biomedical Engineering, November 2019, doi: 10.1109/TBME.2019.2954592

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

How to find us

Recent News

Our SONAR Simulator Supports Underwater Speech Communication Now

Due to the work of Owe Wisch and Alexej Namenas (and of the rest of the SONAR team, of course) our SONAR simulator supports now a real-time mode for testing underwater speech communication. A multitude of "subscribers" can connect to our virtual ocean and send and receive signals. The simulator consists of large (time-variant) convolution engine as well as a realistic noise simulation that ...


Read more ...