Time Line of the Lecture "Pattern Recognition"

  • Fundamentals
    • Speech recognition and understanding
    • Applications and system variants
    • Evaluation
  • Statistical speech recognition
    • Maximum a-posteriori (MAP) rule
    • Model simplification
    • Modeling
  • Conclusion and outlook

Slides of the lecture

  • Motivation
  • Fundamentals
    • The „hidden“ part of the model
    • The inner family of random processes
  • Fundamental problems of Hidden Markov Models
    • Efficient calculation of sequence probabilities
    • Efficient calculation of the most probable sequence

Slides of the lecture

  • Motivation
  • Basics of speaker verification und speaker identification
    • Preprocessing and segmentation
    • Codebook-based schemes
    • Schemes based on Gaussian mixture models
  • Model adaption
  • Discriminative approaches

Slides of the lecture

  • Motivation
  • Fundamentals
    • Gaussian mixture models in practice
    • Generation of Gaussian mixture models
  • Applications in speech and audio processing
    • Bandwidth extension
    • Signal separation
    • Speaker recognition

Slides of the lecture

  • Motivation
  • System concept
  • Extension of the excitation signal
    • Spectral shifting and modulation
    • Non-linear characteristics
  • Extension of the spectral envelope
    • Approaches using neural networks
    • Codebook-based approaches
    • Linear mapping
  • Examples

Slides of the lecture

  • Motivation
  • Application examples
  • Cost function for the training of a codebook
  • LBG- and k-means algorithm
    • Basic schemes
    • Extensions
  • Combination with additional mapping schemes

Slides of the lecture

  • Introduction
  • Features for speech and speaker recognition
    • Fundamental frequency
    • Spectral envelope
  • Representation of the spectral envelope
    • Predictor coefficients
    • Cepstral coefficients
    • Mel-filtered cepstral coefficients

Slides of the lecture

  • Introduction
  • Characteristic of multi-microphone systems
  • Delay-and-sum structures
  • Filter-and-sum structures
  • Interference compensation
  • Audio examples and results
  • Outlook on postfilter structures

Slides of the lecture

  • Generation and properties of speech signals
  • Wiener filter
  • Frequency-domain solution
  • Extensions of the gain rule
  • Extensions of the entire framework
  • Empirical mode decomposition

Slides of the lecture

Website News

03.03.2018: Team wall added.

28.02.2018: News wall added.

20.01.2017: Talk from Dr. Sander-Thömmes added.

12.01.2018: New RED section on Trend Removal added.

29.12.2017: Section Years in Review added.

Recent Publications

T. O. Wisch, T. Kaak, A. Namenas, G. Schmidt: Spracherkennung in stark gestörten Unterwasserumgebungen, Proc. DAGA, Germany, 2018

S. Graf, T. Herbig, M. Buck, G. Schmidt: Low-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra, Proc. Interspeech, pp. 2316 -2320, 2017


Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

DSS Participation in the Lecture Series "Language and Society"

Prof. Anja Leue from the psychology department of our university organized a lecture series on the topic "language and society". Also the DSS group participated in that event and we presented some of our results on speech in disturbed environments. The lecture took place on Thursday, 5th of May, in one of the lecture rooms in the Audimax building. After the talk, a nice, interesting (and for ...

Read more ...