Automated generation of structured data from unstructured information sources such as text, images, audio, and video. Transformations include text representation, classification, topic and entity extraction, as well as captioning, transcription, and event description for other modalities. The course covers classical feature-based approaches and modern generative AI, especially large language models, with an emphasis on evaluation, limitations, and ethical considerations.