Time Line of the Lecture "Pattern Recognition"

  • Fundamentals
    • Speech recognition and understanding
    • Applications and system variants
    • Evaluation
  • Statistical speech recognition
    • Maximum a-posteriori (MAP) rule
    • Model simplification
    • Modeling
  • Conclusion and outlook

Slides of the lecture

  • Motivation
  • Fundamentals
    • The „hidden“ part of the model
    • The inner family of random processes
  • Fundamental problems of Hidden Markov Models
    • Efficient calculation of sequence probabilities
    • Efficient calculation of the most probable sequence

Slides of the lecture

  • Motivation
  • Basics of speaker verification und speaker identification
    • Preprocessing and segmentation
    • Codebook-based schemes
    • Schemes based on Gaussian mixture models
  • Model adaption
  • Discriminative approaches

Slides of the lecture

  • Motivation
  • Fundamentals
    • Gaussian mixture models in practice
    • Generation of Gaussian mixture models
  • Applications in speech and audio processing
    • Bandwidth extension
    • Signal separation
    • Speaker recognition

Slides of the lecture

  • Motivation
  • System concept
  • Extension of the excitation signal
    • Spectral shifting and modulation
    • Non-linear characteristics
  • Extension of the spectral envelope
    • Approaches using neural networks
    • Codebook-based approaches
    • Linear mapping
  • Examples

Slides of the lecture

  • Motivation
  • Application examples
  • Cost function for the training of a codebook
  • LBG- and k-means algorithm
    • Basic schemes
    • Extensions
  • Combination with additional mapping schemes

Slides of the lecture

  • Introduction
  • Features for speech and speaker recognition
    • Fundamental frequency
    • Spectral envelope
  • Representation of the spectral envelope
    • Predictor coefficients
    • Cepstral coefficients
    • Mel-filtered cepstral coefficients

Slides of the lecture

  • Introduction
  • Characteristic of multi-microphone systems
  • Delay-and-sum structures
  • Filter-and-sum structures
  • Interference compensation
  • Audio examples and results
  • Outlook on postfilter structures

Slides of the lecture

  • Generation and properties of speech signals
  • Wiener filter
  • Frequency-domain solution
  • Extensions of the gain rule
  • Extensions of the entire framework
  • Empirical mode decomposition

Slides of the lecture

Website News

14.08.2018: New section about our SONAR "sisters" added.

18.07.2018: New section about our Parkinson voice training game added.

07.07.2018: New lecture Fundamentals of Acoustics by Jan Abshagen added.

03.03.2018: Team wall added.

28.02.2018: News wall added.

Recent Publications

T. O. Wisch, T. Kaak, A. Namenas, G. Schmidt: Spracherkennung in stark gestörten Unterwasserumgebungen, Proc. DAGA, Germany, 2018

J. Sautter, F. Faubel, M. Buck, G. Schmidt: Evaluation of Different Excitation Generation Algorithms for Artificial Bandwidth Extension, Conference on Electronic Speech Signal Processing, 2018, Ulm, Germany (online access)

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

Philipp is now "Doc Bulling"

On Monday, 10th of September 2018, Philipp Bulling sucessfully defended his dissertation on speech enhancement for in-car communication systems. His work focused mainly on the control of adaptive feedback cancellation filters (details can be found here). After an excellent talk Philipp gave very good answers to the questions of the committee and at around 11:30 h everythings war over and we ...


Read more ...