Hide Advanced Options
Courses - Spring 2023
DATA
Data Science and Analytics
DATA110
Applications of R for Data Science
Credits: 1
Grad Meth: Reg, P-F, Aud
Prerequisite: DATA100, STAT100, or MATH135; or any 400-level STAT course.
Cross-listed with: STAT110.
Credit only granted for: STAT110 or DATA110.
Intended to prepare students for subsequent courses requiring computation with R, providing powerful and easy to use tools for statistical data analysis. Covers basics of R and R Studio including file handling, data simulation, graphical displays, vector and function operations, probability distributions, and inferential techniques for data analysis.
DATA120
Python Programming for Data Science
Credits: 1
Grad Meth: Reg, P-F, Aud
Prerequisite: STAT100, MATH135, or any 400-level STAT course.
An introduction to programming in Python language, using Jupyter Notebooks and Python scripts. Covers variables, conditionals, loops, functions, lists, strings, tuples, sets, dictionaries, files and visualization.
DATA604
Data Representation and Modeling
Credits: 3
Grad Meth: Reg
Prerequisite: DATA601 or MSML601.
An introductory course connecting students to the most recent developments in the field of data science. It covers several fundamental mathematical concepts which form the foundations of Big Data theory. Among the topics included are Principal Component Analysis, metric learning and nearest neighbor search, elementary spectral graph theory, minimum and maximum graph cuts, graph partitions, Laplacian Eigenmaps, manifold learning and dimension reduction concepts, clustering and classification techniques such as k-means, kernel methods, Mercer's theorem, and Support Vector Machines. Some relevant concepts from geometry and topology will be also covered.
DATA605
Credits: 3
Grad Meth: Reg
Prerequisite: DATA602.
Restriction: Must be in the Data Science Post-Baccalaureate Certificate of Professional Studies or Master of Professional Studies in Data Science and Analytics program.
Credit only granted for: DATA605 or CMSC642.
Formerly: CMSC642.
An overview of data management systems for performing data science on large volumes of data, including relational databases, and NoSQL systems. The topics covered include: different types of data management systems, their pros and cons, how and when to use those systems, and best practices for data modeling.
DATA606
Algorithms for Data Science
Credits: 3
Grad Meth: Reg
Prerequisite: DATA602.
Restriction: Must be in the Data Science Post-Baccalaureate Certificate of Professional Studies or Master of Professional Studies in Data Science and Analytics program.
Credit only granted for: DATA606 or CMSC644.
Formerly: CMSC644.
Provides an in-depth understanding of some of the key data structures and algorithms essential for advanced data science. Topics include random sampling, graph algorithms, network science, data streams, and optimization.
DATA641
Natural Language Processing
Credits: 3
Grad Meth: Reg
Prerequisite: DATA603 or MSML603.
Cross-listed with: MSML641.
Credit only granted for: DATA641 or MSML641.
Introduces fundamental concepts and techniques involved in getting computers to deal more intelligently with human language. Focused primarily on text (as opposed to speech), the class will offer a grounding in core NLP methods for text processing (such as lexical analysis, sequential tagging, syntactic parsing, semantic representations, text classification, unsupervised discovery of latent structure), key ideas in the application of deep learning to language tasks, and consideration of the role of language technology in modern society.