Schedule of Classes

CMSC

Computer Science Department Site

CMSC848M

Selected Topics in Information Processing; Multimodal Computer Vision

Credits: 3

Grad Meth: Reg, Aud

The future of Artificial Intelligence demands a paradigm shift towards multimodal perception, enabling systems to interpret and fuse information from diverse sensory inputs. While we humans perceive the world by looking, listening, touching, smelling, and tasting, tradit form of machine intelligence has primarily focused on a single sensory modality, often vision. To truly understand the world around us, AI must learn to jointly interpret multimodal signals. This graduate-level seminar course explores computer vision from a multimodal perspective, focusing on learning algorithms that augment vision with other essentiamodalities, such as audio, touch, language, and more. The majority of the course will consist of student presentations, experiments, and paper discussions, and we will delve into the latest research and advancements in multimodal perception.

Hide Sections

0101

Ruohan Gao

Seats (Total: 26, Open: 0, Waitlist: 0 )

Tu 3:30pm - 6:00pm

CSI 2120

PJ01

Ruohan Gao

Seats (Total: 14, Open: 4, Waitlist: 0 )

Tu 3:30pm - 6:00pm

CSI 2120

Must be in the Computer Science (M.S.) program. Golden ID students are not eligible for this section.