Hide Advanced Options
Courses - Spring 2025
CMSC
Computer Science Department Site
Open Seats as of
10/30/2024 at 10:30 PM
CMSC848O
Selected Topics in Information Processing; Long-Context Language Models
Credits: 3
Grad Meth: Reg, Aud
Restriction: Must be in the Computer Science Master's or Doctoral programs, or permission of instructor.

Focuses on recent developments in training, aligning, and evaluating long-context language models, which have allowed cutting-edge LLMs to process and generate millions of words. Topics include neural architectures (e.g., Transformers, Mamba), extended context fine-tuning/upscaling, and tasks such as summarization and QA over books.