Vision and Audition are stand-off senses that allow animals to observe the world and do sense-making. Cameras and microphones allow computer systems to capture the world, analyze it, and possibly recreate it. However, the fields of computer vision and computational audition have developed along somewhat different lines. This course will take a fundamentals based view (math, signal processing, psychophysics) to the formulation of various algorithms for acoustical scene analysis, speech and speaker recognition, source separation, scene recreation, and other topics. The focus will be on both classical formulations and their possible implementation in a deep learning context.