Time Line of the Lecture "Pattern Recognition"

  • Basics of speaker verification und speaker identification
    • Preprocessing and segmentation
    • Codebook-based schemes
    • Schemes based on Gaussian mixture models
    • Model adaption
    • Discriminative approaches
  • Fundamentals of speech recognition
    • Speech recognition and understanding
    • Applications and system variants
    • Evaluation
  • Statistical speech recognition
    • Maximum a-posteriori (MAP) rule
    • Model simplification
    • Modeling
  • Conclusion and outlook

Slides of the lecture

  • Motivation
  • Fundamentals
    • The „hidden“ part of the model
    • The inner family of random processes
  • Fundamental problems of Hidden Markov Models
    • Efficient calculation of sequence probabilities
    • Efficient calculation of the most probable sequence

Slides of the lecture

  • Motivation
  • Fundamentals
    • Gaussian mixture models in practice
    • Generation of Gaussian mixture models
  • Applications in speech and audio processing
    • Bandwidth extension
    • Signal separation
    • Speaker recognition

Slides of the lecture

  • Motivation
  • System concept
  • Extension of the excitation signal
    • Spectral shifting and modulation
    • Non-linear characteristics
  • Extension of the spectral envelope
    • Approaches using neural networks
    • Codebook-based approaches
    • Linear mapping
  • Examples

Slides of the lecture

  • Motivation
  • Application examples
  • Cost function for the training of a codebook
  • LBG- and k-means algorithm
    • Basic schemes
    • Extensions
  • Combination with additional mapping schemes

Slides of the lecture

  • Introduction
  • Features for speech and speaker recognition
    • Fundamental frequency
    • Spectral envelope
  • Representation of the spectral envelope
    • Predictor coefficients
    • Cepstral coefficients
    • Mel-filtered cepstral coefficients

Slides of the lecture

  • Introduction
  • Characteristic of multi-microphone systems
  • Delay-and-sum structures
  • Filter-and-sum structures
  • Interference compensation
  • Audio examples and results
  • Outlook on postfilter structures

Slides of the lecture

  • Generation and properties of speech signals
  • Wiener filter
  • Frequency-domain solution
  • Extensions of the gain rule
  • Extensions of the entire framework
  • Empirical mode decomposition

Slides of the lecture

Website News

30.11.2018: New student project on driver distraction added.

01.10.2018: Dissertation of Philipp Bulling added.

14.08.2018: New section about our SONAR "sisters" added.

18.07.2018: New section about our Parkinson voice training game added.

07.07.2018: New lecture Fundamentals of Acoustics by Jan Abshagen added.

Recent Publications

J. Reermann, E. Elzenheimer and G. Schmidt: Real-time Biomagnetic Signal Processing for Uncooled Magnetometers in Cardiology, IEEE Sensors Journal, January, 2019 (early access, doi:  10.1109/JSEN.2019.2893236)

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

Saturday Morning Physics 2018

The DSS team was invited to participate in the last of the "Saturday Morning Physics" (SMP) events in 2018. On December 8th, a Saturday of course, Thorben Kaak, Gerhard Schmidt, and Owe Wisch gave a talk on underwater signal processing. Pupils from all around Schleswig-Holstein were quite interested, especially in the basics of SONAR systems.