Speech Processing for B.Tech 7th sem is covered here. This gives the details about credits, number of hours and other details along with reference books for the course.
The detailed syllabus for Speech Processing B.Tech (R13) seventhsem is as follows
OBJECTIVES:
- To introduce speech production and related parameters of speech.
- To show the computation and use of techniques such as short time Fourier transform, linear predictive coefficients and other coefficients in the analysis of speech.
- To understand different speech modeling procedures such as Markov and their implementation issues.
UNIT I : BASIC CONCEPTS [10 hours]
Speech Fundamentals: Articulatory Phonetics – Production and Classification of Speech Sounds; Acoustic Phonetics – Acoustics of speech production; Review of Digital Signal Processing concepts; Short-Time Fourier Transform, Filter-Bank and LPC Methods.
UNIT II: SPEECH ANALYSIS [10 hours]
Features, Feature Extraction and Pattern Comparison Techniques: Speech distortion measures– mathematical and perceptual – Log–Spectral Distance, Cepstral Distances, Weighted Cepstral Distances and Filtering, Likelihood Distortions, Spectral Distortion using a Warped Frequency Scale, LPC, PLP and MFCC Coefficients, Time Alignment and Normalization – Dynamic Time Warping, Multiple Time – Alignment Paths.
UNIT III : SPEECH MODELING [8 hours]
Hidden Markov Models: Markov Processes, HMMs – Evaluation, Optimal State Sequence – Viterbi Search, Baum-Welch Parameter Re-estimation, Implementation issues.
TOTAL: 45 PERIODS
OUTCOMES: Upon completion of the course, students will be able to:
- Model speech production system and describe the fundamentals of speech.
- Extract and compare different speech parameters.
- Choose an appropriate statistical speech model for a given application.
- Design a speech recognition system.
- Use different speech synthesis techniques.
TEXTBOOKS:
- Lawrence Rabiner and Biing-Hwang Juang, “Fundamentals of Speech Recognition”, Pearson Education, 2003.
- Daniel Jurafsky and James H Martin, “Speech and Language Processing – An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition”, Pearson Education, 2002.
- Frederick Jelinek, “Statistical Methods of Speech Recognition”, MIT Press, 1997.
REFERENCES:
- Steven W. Smith, “The Scientist and Engineer‟s Guide to Digital Signal Processing”, California Technical Publishing, 1997.
- Thomas F Quatieri, “Discrete-Time Speech Signal Processing – Principles and Practice”, Pearson Education, 2004.
- Claudio Becchetti and Lucio Prina Ricotti, “Speech Recognition”, John Wiley and Sons, 1999.
- Ben Gold and Nelson Morgan, “Speech and Audio Signal Processing, Processing and Perception of Speech and Music”, Wiley- India Edition, 2006.
For all other B.Tech ECE 7th sem syllabus go to Anna University B.Tech ELECTRONICS AND COMMUNICATION ENGINEERING (ECE) 7th Sem Course Structure for (R13) Batch.All details and yearly new syllabus will be updated here time to time. Subscribe, like us on facebook and follow us on google plus for all updates.
Do share with friends and in case of questions please feel free drop a comment.