L  W  D  Date  Lecture and Assignment 
Part I: Acoustics/VT modeling/Signal processing (20 Lectures)  
1  35  M  8/27  Introductions. Lecture: Overview: Speech production, Acoustics, Psychophysics, neural and information processing. Entropy: Its meaning, definition and the intuition behind it; Read: Flanagan, Chapt. 1: (pdf);Entire Book: pdf (2014) 
2  W  8/29  Lecture: Mechanisms of Speech production: Sounds of speech: Vowels and consonants; Read: Chapter 2 HW1: Basic Acoustics (due 1 week) (HW01.pdf)  
3  F  8/31  Lecture: Pressure, volume velocity, impedance. Intensity, speech power; sound level; dB, dBSPL Read: Chapter 3 (pdf)  
  36  M  9/3  Labor Day Holiday  No class 
4  W  9/5  Lecture: 1D wave equation; ABCDTransmission (Chain) matrix; Acoustic Transmission lines; Matrix formulations; Vowels, Formants; Helmholtz resonator pdf) Read: Flanagan, Ch. 2.2, 3.13.2;HW1 due HW2: TL and reflectance (due 1 week) (pdf, txline.m)  
5  2  F  9/7  Lecture: Solution of 1D transmission line equation: Lip radiation impedance, The Helmholtz resonator Read: Lecture on Horns (pdf) 
6  37  M  9/10  Lecture:d'Alembert solutions for 1D and 3D transmission lines; Sound propagation in tubes in speech production; Introduction to reflectance; Read: Wave model of the Cat eardrum, Parent and Allen 2007 pdf), 2010 Human ear canal pdf, ; Review: Octave/Matlab filter design techniques: bilinear Z, FIR, IIR 
7  W  9/12  Lecture: Transmission Lines with complex loads and the Propagated Reflectance; Text: Flanagan, Ch. VI (Sect. 6.262, pp 272276); Readings:Conversion tables for 2ports pdf) Bilbao PhD Thesis p. 115 pdf) 25% HW2 due; HW3: TLs with complex terminations due in two weeks) Simulation of the middle ear:'' pdf  
8  F  9/14  Lecture:Impedance and reflectance at a tube junction (Karal correction pdf), and halfsphere (the mouth) Text: Ch. 3, Sec. 3.3, pages 136152  
9  38  M  9/17  Lecture: 2port networks; Read: Bilabo Thesis (page 14) pdf);Text Chap. 3, sect. 3.8.1 pages 3134 radiation impedance load; Review: Rosowski, Carney and Peak (1988) on the cat middle ear pdf; Background material: Guinan and Peak djvu; pdf missing; Lynch et al. 1982 pdf 
10  W  9/19  Lecture: cont. 2port and the 3port nasal tract; Network Postulates pdf; Conversion tables for 2portsOptional Read:Thevenin (pdf),Norton (pdf) Read:Peterson and Barney (1952) djvu andPeterson (1952) djvu  
11  F  9/21  Lecture: Horn Radiation Impedance; Karal Correction; Bernoulli's equation Text: Flanagan pp 4153; Read: vanDen Berg (1957) djvu  
12  39  M  9/24  Lecture: 'Signal processing:' Fourier Series, Fourier Transform, Laplace Transform, ZT, DTFT, DFT, FFT 
13  W  9/26  Lecture: Cepstral analysis; HW3 Due txline.m, gamma.m,HW4: VocalTract Simulation: Due in two weeks (pdf, Octave/Matlab m files)  
14  F  9/28  Lecture: History of acoustics: , pdf BC: Pythagoras; Aristotle; 17C: Mersenne, Marin; Galilei, Galileo; Hooke, Robert; Boyle, Robert; Newton, Sir Issac; 18C: Bernoulli, Daniel; Euler; Lagrange; d'Alembert; 19C: Gauss; Laplace; Fourier; Helmholtz; Heaviside; Strutt, William; Rayleigh, Lord;20C: pdf, George; Hilbert, David; , Emmy; Fletcher (1929), Harvey; Nyquist pdf, Harry; Bode, Henrik; Dudley, Homer; Shannon, Claude; Flanagan, James;  
15  40  M  10/1  Lecture: Review HW1, HW2, HW3; The glottal oscillator & Bernoulli's (, , ) Read: Flanagan Sec. 3.74 pages 6972 
16  W  10/3  Lecture: Linear prediction of speech; Read: Flanagan Sec. 8.112, pp 372376; Sec. 8.13, pp 390395; Atal and Hanauer (1971) pdf missing, djvu  
17  F  10/5  Lecture: 'STFT window methods; STFT for speech processing with analysis/synthesis; Inverse STFT; pdf, pdf,Reading spectrograms: classroom exercises Read: Flanagan Chap. 8, pp 361363; Begin writing your Final exam: Part I;  
  41  M  10/8  No office hours; Optional class: Discuss of Exam; No class due to Exam I: 79:30 PM; 3013 ECEB NO CRIB SHEETs 
Part II: Psychoacoustics (10 Lectures)  
18  W  10/10  Lecture: Room acoustics; point source; 1, 2 and 6 wall Image method; Wall reflection coef. for finite impedance walls: (pdf); Di and Gilbert (1993) (pdf)  
19  F  10/12  Lecture: Music and audio compression coding (MPEG) Read: $\mu$Law speech coding pdf; Music coding methods pdf  
20  42  M  10/15  Special Guest Lecture on Digital Speech Coding/compression, Prof. Mark HasagawaJohnson; \\ 
21  W  10/17  Lecture: Psychoacoustics I: Internal noise model of the JND Riesz puretone intensity JND (1928) Masking: Weber's and Fechner's Law; Intensity JND and the nearmiss; 'Psychoacoustics II:' Introduction to loudness, Steven's Law; Loudness Lecture notes (Allen): pdf Read: Flanagan Chapter 4 (pdf), Allen Review (pages 2030) pdf, Fletcher and Munson (1933) (pp 8294) pdf HW5: LPC (Due 1 week 10/22) (pdf, Speech samples) HW4 due  
22  F  10/19  Lecture: Cochlear Physiology I: Middle ear and inner ear (Cochlear) anatomy, basilar membrane, 1D Models, Hair cells, Nonlinear basilar membrane; Frequency JND, semitone, Internal noise and Masking; relation between the intensity and frequency JND (Cochlear frequency response and the slope of the tuning curve); Read: Review of Cochlear Modeling: Part II (pp 1928): pdf; page 151(166)155(170) of Fletcher's 1929 Book djvu missing? MIT/HST725: The auditory system pdf  
23  43  M  10/22  Lecture: Cochlear Physiology II: traveling waves, neural tuning curves, critical bands, hair cells, neural masking, Upward spread of masking; Forward masking; Auditory Pathway I: Neural Tuning Read: Review of Cochlear Modeling (pp 119) (pdf) 
24  W  10/24  Lecture: Cochlear Physiology III: Micromechanics, OHC, IHC Lecture Notes (): Modeling the Cochlea and Organ of Corti, and Read: Wegel and Lane (1924), Part II (pdf) HW5 due HW6: Radial waves/STFT/OLA/Speech coding (Due in 2 weeks) (pdf, speech files)  
25  F  10/26  Lecture: Psychoacoustics III: Relations between Psychophysics and the cochlea; Greenwood's placemap function Read: Review of Cochlear Modeling (pp 119) (pdf), Part I; Fletcher and Munson (1933) (pp 8294) (pdf)  
26  44  M  10/29  Lecture: Cochlear Critical bands Read:'Harvey Fletcher's role ...' (not the section on the Articulation Index (AI)) pdf 
27  W  10/31  Lecture: Cochlear Physiology IV: The nonlinear Cochlea; The upward spread of masking (USM) and 2tone suppression (2TS) Supplement: The Auditory Nerve (pdf) Read: Features in speech Allen and Li, (2010) (pdf)  
28  F  11/2  Lecture: Prof. Wickesberg, Part I: The central auditory system; Neural signal and the Auditory Pathway)  
29  45  M  11/5  Lecture: Prof. Wickesberg, Auditory Pathway (AN+CN), Part II; KalluriJCNS (pdf, pdf) 
30  W  11/7  Lecture: Prof. Wickesberg, Auditory Pathway (AN+CN): Part III: (pdf, pptx)  
Part III: Human Speech Recognition/Information Processing (10 Lectures)  
31  F  11/10  Lecture: Information theory I: Information, Entropy, Relative Entropy; Channel Capacity Read: Pierce, JR ''Early days of Information Theory (pdf)  
32  46  M  11/12  Lecture: Information theory II: Morse code example Shannon Channel Read: pp 110 Shannon (1948) (pdf I, pdf III) 
33  W  11/14  Lecture: Information theory III:; Entropy, Relative Entropy, Markov models, State diagram; Read: Shannon (1950) (pdf)  
34  F  11/16  Lecture: EM algorithm: Example: Speech and noise separation Read: French and Steinberg (1947) (pdf); GoodTuring djvu  
  48      Thanksgiving Holiday (11/1711/26) Discussion of the EM alg. with examples 
35  47  M  11/26  Lecture: Articulation Index/Speech Transmission Index/Speech Intelligibility Index; Support material: RV Shannon et al (1995) pdfSmith et al. djvu HW7: Information processing (Due 1 Week); (pdf, files) 
36  W  11/28  Lecture: STI/SII Read: Steeneken & Houtgast (1980) Speech Transmission Index (djvu) and Houtgast (1989) Modulation detection (djvu)  
37  F  11/30  Lecture: Human speech recognition (HSR), Articulation Index (AI), average score: {$P'_c_'(AI)=1e'_[chance]_'e'_[min]_''^[AI]^'$}, The confusion matrix (CM), maximum entropy syllable error models, etc.; Allen notes (djvu) Read: Continue with French and Steinberg (1947) djvu; Miller Nicely (1955) djvu; Miller Nicely confusions as a function of the articulation index; entropy, grouping and chance pdf DUE: Preliminary version of your Final Exam, for review  
  49  M  12/3  Exam II: (NO CLASS) Psychoacoustics, Physiology, Speech, LPC, Tubes, AI, Historical items;Place: 3013 ECEB; Time: 79 PM Assignment: Work on Final exam 
38  W  12/5  Lecture: Effects of language and semantic context Miller (1962), Boothroyd (1988), Allen Notes: Events and the AI (pdf) Read: Miller, Heisen and Lichten (1951) (pdf)  
39  F  12/7  Lecture: Language context models, Boothroyd; Bronkhorst93  
40  50  M  12/10  Lecture: Language context models, cont., Bronkhorst93 Read: W. Li Random texts exhibit Zipf'sLaw djvu 
41  W  12/12  HW7 due; Course Review; Final paper Prep;  
  R  12/14  Reading Day  
  51  M  12/1419  [Final paper Due before or on Dec 19] FinalThe final is a 1525 page paper, written in the style and format (but single column) of a journal paper, that discusses everything that you have learned in this course. Writing style, spelling, figures, labels of figures, are all part of the grade. Allen's Grading of the finalThe final is graded based on a list of all the topics that are covered. If there is a paragraph that discusses each topic on my list, then you get at least 1 point, and if the discussion covers the topic effectively, you can get up to 5 points. There are at least 20 topics on the list. When you get to 100 points, you get an A+ on the exam. I expect that you draw on the homework as a starting point. Don't just dump the homework into the exam without modification, that wont get you points. Don't just dump a large number of unexplained figures (that you got from someone else for example) and expect to get points. I need words around each figure. I am looking for insightful comments that link the material together. Your comments on the relevance of each of the topics I covered in this course, homework problems, exams, etc., are welcome. No points will be taken off, nor given, for strong opinions on my teaching style, or lack thereof, organization, or lack thereof, etc. Please put all such comments in a discussion section at the end of the paper, isolated from the rest of the material. 
      Not proofed beyond here

Powered by PmWiki