The SPID Corpus

The "SPID" (SPontaneous In-car Dialogues) corpus allows investigating - for the first time ever - the communication between speakers inside a driving car and the Lombard effect that emerges at different driving speeds. Currently, speech recordings are made in order to analyze the effects of an in-car-communication (ICC) system on speech production at different driving noise levels. ICC systems are meant to improve the communication of passengers inside a car and thus help increase driving safety.

At the heart of the SPID corpus is an acoustic ambiance simulation. That is, the speakers sit inside a stationary car and hear realistic driving noises of exactly this car. The acoustic simulation is further complemented by a visual (screen-based) projection of real driving situations that match with the driving noises. On this basis, the Lombard effect can be investigated under highly sophisticated and at the same time highly controlled laboratory conditions; and in addition, the noise can be entirely removed again from the signals after the recordings by means of adaptive cancellation and suppression approaches. Thereby, Lombardaffected speech features like intonation, stress, and formants can be analyzed in full detail, without the corresponding measurements being distorted by background noise.

The recordings themselves were based on the Map-Task paradigm, which elicits spontaneous speech with a number of selected target words included (names of streets, places, persons etc.). One speaker sat in the front passenger seat and the other one behind him/her on the backseat. The two speakers had their own microphone. Recordings were made in driving simulations at 50 km/h (city) and 130 km/h (highway), as well as in a silent reference condition.

 

Key Features

The corpus consists of approximately 25 hours of spontaneous dialogues whose individual durations vary depending on how long it took the dialogue partners to solve their Map Task. The speaker sample included 8 male and 8 female native speakers of Standard German. They lived for a long time in Northern Germany and were between 22 and 31 years old (average age: 26.5 years) at the time of the recording. Pairs of speakers were always of the same gender.

 

State of the Corpus

You can find the current state of the corpus here.

 

Download

The corpus can be used for non-commercial research purposes. Details can be found here.

 

Examples

 
Without an ICC system at 130 km/h.   With an ICC system at 130 km/h.

 

Creators of the Data Base

The data base was created as a joint work between Kiel University (CAU) and the University of Southern Denmark (SDU, Mads Clausen Institute). Involved researchers are:

  • Tina John (CAU)
  • Rabea Landgraf (CAU)
  • Christian Lüke (CAU)
  • Oliver Niebuhr (SDU)
  • Gerhard Schmidt (CAU)
  • Anne Theiss (CAU)

 

Corresponding Publications

R. Landgraf, T. John, G. Schmidt, M. Gimm, O. Niebuhr: Man höre und staune – Perzeptive Evidenz für die Funktionalität eines ICC-Systems anhand von Dialogdaten aus einer multimodalen Fahrsimulation, DAGA 2018

R. Landgraf, G. Schmidt, J. Köhler-Kaeß, O. Niebuhr, and T. John: More Noise, Less Talk - The Impact of Driving Noise and ICC Systems on Acoustic-prosodic Parameters in Dialogue, Proc. DAGA, Kiel, Germany, open access, 2017

R. Landgraf, J. Köhler-Kaeß, C. Lüke, O. Niebuhr, G. Schmidt: Can You Hear Me Now? Reducing the Lombard Effect in a Driving Car Using an In-Car Communication System, Proc. Speech Prosody, pp. 479 - 483, 2016

A. Warhadpande, C. Lüke, A. Theiß, G. Schmidt: Improvement by Adding Video Feature in an Acoustic Ambiance Simulation for Automobiles, 7th Biennial Workshop on DSP for In-Vehicle Systems and Safety 2015, Berkeley, CA, USA

O. Niebuhr, B. Peters, R. Landgraf, G. Schmidt: The Kiel Corpora of "Speech & Emotion" - A Summary, Proc. DAGA 2015, March 16-19, 2015, Nürnberg, Germany

R. Landgraf, O. Niebuhr, G. Schmidt, T. John, C. Lüke, A. Theiß: Von der Straße ins Labor: Die Modifikation der Sprachproduktion bei lauten Fahrgeräuschen, DAGA 2015, Nürnberg, Germany

C. Lüke, A. Theiß, G. Schmidt, O. Niebuhr, T. John: Creation of a Lombard Speech Database using an Acoustic Ambiance Simulation with Loudspeakers, 6th Biennial DSP Workshop for In-Vehicle Systems 2013, Seoul, Korea

Website News

18.07.2018: New section About our Parkinson voice training game added.

07.07.2018: New lecture Fundamentals of Acoustics by Jan Abshagen added.

03.03.2018: Team wall added.

28.02.2018: News wall added.

20.01.2017: Talk from Dr. Sander-Thömmes added.

Recent Publications

T. O. Wisch, T. Kaak, A. Namenas, G. Schmidt: Spracherkennung in stark gestörten Unterwasserumgebungen, Proc. DAGA, Germany, 2018

S. Graf, T. Herbig, M. Buck, G. Schmidt: Low-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra, Proc. Interspeech, pp. 2316 -2320, 2017

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

New Lecture - Fundamentals of Acoustics

Starting this winter term the DSS group offers a new lecture entiteled "Fundamentals of Acoustcs". The lecture is given by Dr. Jan Abshagen (see picture). It will be a 3+1 lecture which takes place once a week. The lecture will cover the following topics:

  • fundamentals of vibrations,
  • theory of sound fields,
  • sound and systems,
  • transducers,
  • sound-structure interaction,
  • ship acoustics,
  • ...

Read more ...
Cookies make it easier for us to provide you with our services. With the usage of our services you permit us to use cookies.
Ok