Time Line of the Lecture "Pattern Recognition"

  • Fundamentals
    • Speech recognition and understanding
    • Applications and system variants
    • Evaluation
  • Statistical speech recognition
    • Maximum a-posteriori (MAP) rule
    • Model simplification
    • Modeling
  • Conclusion and outlook

Slides of the lecture

  • Motivation
  • Fundamentals
    • The „hidden“ part of the model
    • The inner family of random processes
  • Fundamental problems of Hidden Markov Models
    • Efficient calculation of sequence probabilities
    • Efficient calculation of the most probable sequence

Slides of the lecture

  • Motivation
  • Basics of speaker verification und speaker identification
    • Preprocessing and segmentation
    • Codebook-based schemes
    • Schemes based on Gaussian mixture models
  • Model adaption
  • Discriminative approaches

Slides of the lecture

  • Motivation
  • Fundamentals
    • Gaussian mixture models in practice
    • Generation of Gaussian mixture models
  • Applications in speech and audio processing
    • Bandwidth extension
    • Signal separation
    • Speaker recognition

Slides of the lecture

  • Motivation
  • System concept
  • Extension of the excitation signal
    • Spectral shifting and modulation
    • Non-linear characteristics
  • Extension of the spectral envelope
    • Approaches using neural networks
    • Codebook-based approaches
    • Linear mapping
  • Examples

Slides of the lecture

  • Motivation
  • Application examples
  • Cost function for the training of a codebook
  • LBG- and k-means algorithm
    • Basic schemes
    • Extensions
  • Combination with additional mapping schemes

Slides of the lecture

  • Introduction
  • Features for speech and speaker recognition
    • Fundamental frequency
    • Spectral envelope
  • Representation of the spectral envelope
    • Predictor coefficients
    • Cepstral coefficients
    • Mel-filtered cepstral coefficients

Slides of the lecture

  • Introduction
  • Characteristic of multi-microphone systems
  • Delay-and-sum structures
  • Filter-and-sum structures
  • Interference compensation
  • Audio examples and results
  • Outlook on postfilter structures

Slides of the lecture

  • Generation and properties of speech signals
  • Wiener filter
  • Frequency-domain solution
  • Extensions of the gain rule
  • Extensions of the entire framework
  • Empirical mode decomposition

Slides of the lecture

Website News

18.07.2018: New section About our Parkinson voice training game added.

07.07.2018: New lecture Fundamentals of Acoustics by Jan Abshagen added.

03.03.2018: Team wall added.

28.02.2018: News wall added.

20.01.2017: Talk from Dr. Sander-Thömmes added.

Recent Publications

T. O. Wisch, T. Kaak, A. Namenas, G. Schmidt: Spracherkennung in stark gestörten Unterwasserumgebungen, Proc. DAGA, Germany, 2018

S. Graf, T. Herbig, M. Buck, G. Schmidt: Low-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra, Proc. Interspeech, pp. 2316 -2320, 2017

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

New Lecture - Fundamentals of Acoustics

Starting this winter term the DSS group offers a new lecture entiteled "Fundamentals of Acoustcs". The lecture is given by Dr. Jan Abshagen (see picture). It will be a 3+1 lecture which takes place once a week. The lecture will cover the following topics:

  • fundamentals of vibrations,
  • theory of sound fields,
  • sound and systems,
  • transducers,
  • sound-structure interaction,
  • ship acoustics,
  • ...

Read more ...
Cookies make it easier for us to provide you with our services. With the usage of our services you permit us to use cookies.
Ok