Course Name: 

Perceptual Audio and Speech Processing (IT807)


M.Tech (IT)


Elective Courses (Ele)

Credits (L-T-P): 

(3-0-0) 3


Fundamentals of Audio and Speech Processing; Speech and Audio Analysis: Transforms – STFT, DCT, Wavelets and Gamma tone Filter banks; Audio and Speech Compression
Standards: MPEG, AC-3, EAC-3 and AAC; Human Auditory Perception; Perceptual Audio Quality Metrics, Perceptual Audio Coding and Processing of Digital Speech; Speech and
Audio Storage, Retrieval and Communication; Applications and Research Trends.


Jacob Benesty, M. Mohan Sondhi and Yiteng Huang, Handbook of Speech Processing, Springer-Verlag, 2008.
Andreas Spanias, Ted Painter and Venkatraman Atti, “AudioSignal Processing and Coding”, Wiley-Interscience, 2007.
Soren Bech and Nick Zacharov, “Perceptual Audio Evaluation - Theory, Method and Application”, Wiley, 2006.
Hugo Fastl and Eberhard Zwicker, “Psychoacoustics: Facts and Models”, Springer, 3rd edition, 2006.
Marina Bosi and Richard E. Goldberg, “Introduction to Digital Audio Coding Standards”, Springer, 2002.
Ben G. and Nelson M., “Speech and Audio Signal Processing: Processing and Perception of Speech and Music”, Wiley, 1999.


Information Technology

Contact us

G. Ram Mohana Reddy

Professor and Head,
Department of Information Technology, NITK, Surathkal,
P. O. Srinivasnagar, Mangalore - 575 025
Karnataka, India.
Ph.:    +91-824-2474056
Email:  infotech[AT]nitk[DOT]ac[DOT]in

Sowmya Kamath S (Web Admin)

Connect with us

We're on Social Networks. Follow us & stay in touch.