Course Name: 

Perceptual Audio and Speech Processing (IT806)


M.Tech (IT)


Elective Courses (Ele)

Credits (L-T-P): 

(3-0-0) 3


Fundamentals of Audio and Speech Processing; Speech and Audio Analysis: Transforms – STFT, DCT, Wavelets and Gamma tone Filter banks; Audio and Speech Compression
Standards: MPEG, AC-3, EAC-3 and AAC; Human Auditory Perception; Perceptual Audio Quality Metrics, Perceptual Audio Coding and Processing of Digital Speech; Speech and
Audio Storage, Retrieval and Communication; Applications and Research Trends.


Jacob Benesty, M. Mohan Sondhi and Yiteng Huang, Handbook of Speech Processing, Springer-Verlag, 2008.
Andreas Spanias, Ted Painter and Venkatraman Atti, “AudioSignal Processing and Coding”, Wiley-Interscience, 2007.
Soren Bech and Nick Zacharov, “Perceptual Audio Evaluation - Theory, Method and Application”, Wiley, 2006.
Hugo Fastl and Eberhard Zwicker, “Psychoacoustics: Facts and Models”, Springer, 3rd edition, 2006.
Marina Bosi and Richard E. Goldberg, “Introduction to Digital Audio Coding Standards”, Springer, 2002.
Ben G. and Nelson M., “Speech and Audio Signal Processing: Processing and Perception of Speech and Music”, Wiley, 1999.


Information Technology

Contact us

Head of the Department,
Department of Information Technology,
National Institute of Technology Karnataka,
SurathkalP. O. Srinivasnagar, Mangalore - 575 025
Ph.:    +91-824-2474056
Email:  hodit [at] nitk [dot] edu [dot] in

Web Admin: Sowmya Kamath S

Connect with us

We're on Social Networks. Follow us & stay in touch.