Lecture "Pattern Recognition"

Basic Information

Lecturers:   Gerhard Schmidt (lecture) and Tobias Hübschen (exercise)
Room:   F-SR-II
Language:   English
Target group:   Students in electrical engineering and computer engineering
Prerequisites:   Basics in system theory
Contents:  

In this lecture the basics of speech, audio, and music signal processing are treated. Often schemes that are based on statistical optimization are utilized for these applications. The involved cost function are matched to the human audio perception.

Topic overview:

  • Preprocessing to reduce signal distortions
    • Noise reduction
    • Beamforming
  • Speech and speaker recognition
    • Fundamentals of speech generation
    • Feature extraction
    • Gaussian mixture models (GMMs)
    • Hidden Markov models (HMMs)
    • Recognition of speech and speakers
  • Enhancement of signal playback
    • Extending the bandwidth of speech signals
    • Equalization of loudspeakers
    • Upmix of stereo signals for playback with more than two loudspeakers

 

News

It is now possible to reserve your examination time slot via our booking system. Please also remember to sign up for a talk as this is a prerequisite for your admission to the oral exam.

 

Lecture Slides

The slides of the lecture can be found here.

 

Matlab Demos

  Matlab demo (GUI based) for adaptive noise suppression
  Matlab demo (GUI based) for linear prediction

 

Exercises

Please note that the questionnaires will be uploaded every week before the excercises, if you download them earlier, you won't get the most recent version.

de en    
  Questionnaire for the lecture "Noise Suppression"
  Questionnaire for the lecture "Beamforming"
  Questionnaire for the lecture "Feature extraction"
  Questionnaire for the lecture "Codebook training"
  Questionnaire for the lecture "Bandwidth extension"
  Questionnaire for the lecture "Gaussian Mixture Models"
  Questionnaire for the lecture "Speaker recognition"
  Questionnaire for the lecture "Hidden Markov Models"
  Questionnaire for the lecture "Speech recognition"

 

Talks

At the end of the semester, each student will give a talk about a certain topic. The aim is both to give you the chance to work on a pattern recognition-related topic that interests you, and to improve your presentational skills. The talk is also a prerequisite for your admission to the exam. The talks should take ten minutes, plus 2.5 minutes of discussion and 2.5 minutes of feedback. Please write an email to This email address is being protected from spambots. You need JavaScript enabled to view it. to reserve your topic.

Below you can find the schedule of the talks.

Date   Room   Time   Topic   Presenter(s)
19.01.2018   F-SR-II   xx:xx h   Beamforming using Artificial Neural Networks   Nico Simoski
19.01.2018   F-SR-II   xx:xx h   Genetic Algorithms   Bastian Kaulen
19.01.2018   F-SR-II   xx:xx h   Adaptive Filters   Tim Benedikt Kupke
                 
02.02.2018   F-SR-II   xx:xx h   Speaker Recognition using Neural Networks   Patricia Piepjohn

 

Exams

Below is the list of students with their exam dates. If you do not have a date for the exam yet please use the oral exam booking system on this website. You can find the booking system here.

Date   Time   Students (matriculation numbers)   Assessor
19.02.2018   08:00 h   1008366, 1018104   Tobias Hübschen
19.02.2018   11:00 h   1122961   Tobias Hübschen
19.02.2018   15:00 h   1113778, 1021302   Tobias Hübschen

 

Website News

03.12.2017: Added pictures from our Sylt meeting.

01.10.2017: Started with a Tips and Tricks section for KiRAT.

01.10.2017: Talks from Jonas Sauter (Nuance) and Vasudev Kandade Rajan (Harman/Samsung) added.

13.08.2017: New Gas e.V. sections (e.g. pictures or prices) added.

Recent Publications

J. Reermann, P. Durdaut, S. Salzer, T. Demming, A. Piorra, E. Quandt, N. Frey, M. Höft, and G. Schmidt: Evaluation of Magnetoelectric Sensor Systems for Cardiological Applications, Measurement (Elsevier), ISSN 0263-2241, https://doi.org/­10.1016/­j.measurement.2017.09.047, 2017

S. Graf, T. Herbig, M. Buck, G. Schmidt: Low-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra, Proc. Interspeech, pp. 2316 -2320, 2017

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

Jugend Forscht

On November 24th, one of our DSS team members, Owe Wisch, took part in the "Jugend forscht Perspektivforum" at the CAU. Thirty young students from the "Jugend forscht" project came to Kiel and participated in three different workshops focusing on career paths in maritime climate protection. Owe Wisch from our chair lead one of the workshops and presented his research topics, beamforming ...


Read more ...