Must be in the Graduate Program in Computer Science. All other graduate students must request permission.
Discusses recent foundation models proposed in the literature, with a focus on vision-language models. Topics include large language models, vision-language models, and vision-audio models.