IT806
Course Name:
Perceptual Audio and Speech Processing (IT806)
Programme:
M.Tech (IT)
Category:
Elective Courses (Ele)
Credits (L-T-P):
(3-0-0) 3
Content:
Fundamentals of Audio and Speech Processing; Speech and Audio Analysis: Transforms – STFT, DCT, Wavelets and Gamma tone Filter banks; Audio and Speech Compression
Standards: MPEG, AC-3, EAC-3 and AAC; Human Auditory Perception; Perceptual Audio Quality Metrics, Perceptual Audio Coding and Processing of Digital Speech; Speech and
Audio Storage, Retrieval and Communication; Applications and Research Trends.
References:
Jacob Benesty, M. Mohan Sondhi and Yiteng Huang, Handbook of Speech Processing, Springer-Verlag, 2008.
Andreas Spanias, Ted Painter and Venkatraman Atti, “AudioSignal Processing and Coding”, Wiley-Interscience, 2007.
Soren Bech and Nick Zacharov, “Perceptual Audio Evaluation - Theory, Method and Application”, Wiley, 2006.
Hugo Fastl and Eberhard Zwicker, “Psychoacoustics: Facts and Models”, Springer, 3rd edition, 2006.
Marina Bosi and Richard E. Goldberg, “Introduction to Digital Audio Coding Standards”, Springer, 2002.
Ben G. and Nelson M., “Speech and Audio Signal Processing: Processing and Perception of Speech and Music”, Wiley, 1999.
Department:
Information Technology