Lecture "Pattern Recognition"

Basic Information

Lecturers:   Gerhard Schmidt (lecture) and Tobias Hübschen (exercise)
Room:   F-SR-II
Language:   English
Target group:   Students in electrical engineering and computer engineering
Prerequisites:   Basics in system theory
Contents:  

In this lecture the basics of speech, audio, and music signal processing are treated. Often schemes that are based on statistical optimization are utilized for these applications. The involved cost function are matched to the human audio perception.

Topic overview:

  • Preprocessing to reduce signal distortions
    • Noise reduction
    • Beamforming
  • Speech and speaker recognition
    • Fundamentals of speech generation
    • Feature extraction
    • Gaussian mixture models (GMMs)
    • Hidden Markov models (HMMs)
    • Recognition of speech and speakers
  • Enhancement of signal playback
    • Extending the bandwidth of speech signals
    • Equalization of loudspeakers
    • Upmix of stereo signals for playback with more than two loudspeakers

 

News

No news yet!

 

Lecture Slides

The slides of the lecture can be found here.

 

Matlab Demos

  Matlab demo (GUI based) for adaptive noise suppression
  Matlab demo (GUI based) for linear prediction

 

Exercises

Please note that the questionnaires will be uploaded every week before the excercises, if you download them earlier, you won't get the most recent version.

de en    
  Questionnaire for the lecture "Noise Suppression"
  Questionnaire for the lecture "Beamforming"
  Questionnaire for the lecture "Feature extraction"
  Questionnaire for the lecture "Codebook training"
  Questionnaire for the lecture "Bandwidth extension"
  Questionnaire for the lecture "Gaussian Mixture Models"
  Questionnaire for the lecture "Speaker recognition"
  Questionnaire for the lecture "Hidden Markov Models"
  Questionnaire for the lecture "Speech recognition"

 

Talks

At the end of the semester, each student will give a talk about a certain topic. The aim is both to give you the chance to work on a pattern recognition-related topic that interests you, and to improve your presentational skills. The talk is also a prerequisite for your admission to the exam. The talks should take ten minutes, plus 2.5 minutes of discussion and 2.5 minutes of feedback.

Below you can find the schedule of the talks.

Date   Room   Time   Topic   Presenter(s)
xx.xx.2018   F-SR-II   xx:xx h        

 

Exams

Below is the list of students with their exam dates. If you do not have a date for the exam yet please use the oral exam booking system on this website.

Date   Time   Students (matriculation numbers)   Assessor
xx.xx.2018   xx:xx h       Tobias Hübschen

 

Website News

01.10.2017: Started with a Tips and Tricks section for KiRAT.

01.10.2017: Talks from Jonas Sauter (Nuance) and Vasudev Kandade Rajan (Harman/Samsung) added.

13.08.2017: New Gas e.V. sections (e.g. pictures or prices) added.

05.08.2017: The first "slide carousel" added.

Recent Publications

J. Reermann, P. Durdaut, S. Salzer, T. Demming,A. Piorra, E. Quandt, N. Frey, M. Höft, and G. Schmidt: Evaluation of Magnetoelectric Sensor Systems for Cardiological Applications, Measurement (Elsevier), ISSN 0263-2241, https://doi.org/10.1016/j.measurement.2017.09.047, 2017

S. Graf, T. Herbig, M. Buck, G. Schmidt: Low-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra, Proc. Interspeech, pp. 2316 -2320, 2017

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

Jens Reermann Defended his Dissertation with Distinction

On Friday, 21st of June, Jens Reermann defended his research on signals processing for magnetoelectric sensor systems very successfully. After 90 minutes of talk and question time he finished his PhD with distinction. Congratulations, Jens, from the entire DSS team.

Jens worked for about three and a half years - as part of the collaborative research center (SFB) 1261 - on all kinds of signal ...


Read more ...