Selected Topics in Information Processing; Multimodal Foundation Models
Credits:3
Grad Meth:
Reg, Aud
Discusses recent foundation models proposed in the literature, with a focus on vision-language models. Topics include large language models, vision-language models, and vision-audio models.