Schedule of Classes

DATA100

Elementary Statistics and Probability

Credits: 3

Grad Meth: Reg, P-F, Aud

GenEd: FSAR, FSMA

Prerequisite: MATH107, MATH113, or MATH115 with a C- or better; or permission of CMNS-Mathematics department; or must have math eligibility of STAT100 or higher and math eligibility is based on the Math Placement Exam or the successful completion of Math 003 with appropriate eligibility.

Restriction: Must not have completed MATH111; or must not have completed any STAT course with a prerequisite of MATH141.

Cross-listed with: STAT100.

Credit only granted for: DATA100 or STAT100.

Simplest tests of statistical hypotheses; applications to before-and-after and matched pair studies. Events, probability, combinations, independence. Binomial probabilities, confidence limits. Random variables, expected values, median, variance. Tests based on ranks. Law of large numbers, normal approximation. Estimates of mean and variance.

Show Sections

DATA110

Applications of R for Data Science

Syllabus Repository (0)

Credits: 1

Grad Meth: Reg, P-F, Aud

Prerequisite: DATA100, STAT100, MATH135, or any 400-level STAT course.

Cross-listed with: STAT110.

Credit only granted for: STAT110 or DATA110.

Intended to prepare students for subsequent courses requiring computation with R, providing powerful and easy to use tools for statistical data analysis. Covers basics of R and R Studio including file handling, data simulation, graphical displays, vector and function operations, probability distributions, and inferential techniques for data analysis.

Show Sections

DATA120

Python Programming for Data Science

Syllabus Repository (0)

Credits: 1

Grad Meth: Reg

Prerequisite: DATA100, STAT100, MATH135, or any 400-level STAT course.

An introduction to programming in Python language, using Jupyter Notebooks and Python scripts. Covers variables, conditionals, loops, functions, lists, strings, tuples, sets, dictionaries, files and visualization.

Show Sections

DATA200

Knowledge in Society: Science, Data and Ethics

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg, P-F, Aud

Prerequisite: DATA100, STAT100, MATH135, or any 400-level STAT course.

An introduction to the fundamental concepts and principles that govern ethical data collection, analysis, and usage. Students will explore various methods of data collection and gain an understanding of the ethical implications associated with each approach. Key topics include data ownership, data privacy, data anonymity, and data validity, providing students with a solid foundation in ethical data practices.

Show Sections

DATA250

Discrete Mathematics

Syllabus Repository (1)

Credits: 4

Grad Meth: Reg, P-F, Aud

Prerequisite: DATA110 or DATA120; and MATH141.

Introduction to basic discrete mathematical and linear algebraic structures and use of these mathematical structures to solve programming problems. Logic, set theory, formal proof methodology, functions, combinatorics, advanced counting techniques, and elements of linear algebra.

Show Sections

DATA320

Introduction to Data Science

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg, P-F, Aud

Prerequisite: DATA110 or DATA120; and DATA200 and DATA250; or by permission of the DATA Program Director.

Restriction: Must not be a Computer Science major.

Jointly offered with: CMSC320.

Credit only granted for: CMSC320 or DATA320.

An introduction to data science i.e., the end-to-end process of going from unstructured, messy data to knowledge and actionable insights. Provides a broad overview of several topics including statistical data analysis, basic data mining and machine learning algorithms, large-scale data management, cloud computing, and information visualization.

Show Sections

DATA350

Data Visualization and Presentation

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg, P-F, Aud

Prerequisite: DATA100, STAT100, MATH135, or any 400 level STAT course; and DATA110 or DATA120.

Introduction to effective and intuitive visual representations of data, including customizing graphics, plotting arrays, statistical graphics, and representing time series.

Show Sections

DATA400

Applied Probability and Statistics I

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg, P-F, Aud

Prerequisite: 1 course with a minimum grade of C- from (MATH131, MATH141); or students who have taken courses with comparable content may contact the department.

Cross-listed with: STAT400.

Credit only granted for: DATA400, ENEE324, or STAT400.

Additional information: Not acceptable toward graduate degrees in MATH/STAT/AMSC.

Random variables, standard distributions, moments, law of large numbers and central limit theorem. Sampling methods, estimation of parameters, testing of hypotheses.

Show Sections

DATA602

Principles of Data Science

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg, Aud

Restriction: Must be in one of the following programs: (Data Science Post-Baccalaureate Certificate, Master of Professional Studies in Data Science and Analytics, or Master of Professional Studies in Machine Learning).

Cross-listed with: BIOI602, MSAI602, MSML602.

Credit only granted for: BIOI602, DATA602, MSAI602, MSML602 or CMSC641.

Formerly: CMSC641.

An introduction to the data science pipeline, i.e., the end-to-end process of going from unstructured, messy data to knowledge and actionable insights. Provides a broad overview of what data science means and systems and tools commonly used for data science, and illustrates the principles of data science through several case studies.

Show Sections

DATA603

Principles of Machine Learning

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg

Restriction: Must be in one of the following programs: (Data Science Post-Baccalaureate Certificate, Master of Professional Studies in Data Science and Analytics, or Master of Professional Studies in Machine Learning).

Cross-listed with: BIOI603, MSAI603, MSML603, MSQC603.

Credit only granted for: BIOI603, DATA603, MSAI603, MSML603, MSQC603 or CMSC643.

Formerly: CMSC643.

A broad introduction to machine learning and statistical pattern recognition. Topics include: Supervised learning: Bayes decision theory, discriminant functions, maximum likelihood estimation, nearest neighbor rule, linear discriminant analysis, support vector machines, neural networks, deep learning networks. Unsupervised learning: clustering, dimensionality reduction, PCA, auto-encoders. The course will also discuss recent applications of machine learning, such as computer vision, data mining, autonomous navigation, and speech recognition.

Show Sections

DATA604

Data Representation and Modeling

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg

Prerequisite: DATA601 or MSML601.

An introductory course connecting students to the most recent developments in the field of data science. It covers several fundamental mathematical concepts which form the foundations of Big Data theory. Among the topics included are Principal Component Analysis, metric learning and nearest neighbor search, elementary spectral graph theory, minimum and maximum graph cuts, graph partitions, Laplacian Eigenmaps, manifold learning and dimension reduction concepts, clustering and classification techniques such as k-means, kernel methods, Mercer's theorem, and Support Vector Machines. Some relevant concepts from geometry and topology will be also covered.

Show Sections

DATA605

Big Data Systems

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg

Prerequisite: DATA602.

Restriction: Must be in the Data Science Post-Baccalaureate Certificate of Professional Studies or Master of Professional Studies in Data Science and Analytics program.

Credit only granted for: DATA605 or CMSC642.

Formerly: CMSC642.

An overview of data management systems for performing data science on large volumes of data, including relational databases, and NoSQL systems. The topics covered include: different types of data management systems, their pros and cons, how and when to use those systems, and best practices for data modeling.

Show Sections

DATA606

Algorithms for Data Science

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg

Prerequisite: DATA602.

Restriction: Must be in the Data Science Post-Baccalaureate Certificate of Professional Studies or Master of Professional Studies in Data Science and Analytics program.

Credit only granted for: DATA606 or CMSC644.

Formerly: CMSC644.

Provides an in-depth understanding of some of the key data structures and algorithms essential for advanced data science. Topics include random sampling, graph algorithms, network science, data streams, and optimization.

Show Sections

DATA641

Natural Language Processing

Syllabus Repository (0)

Credits: 3

Grad Meth: Reg

Prerequisite: DATA603 or MSML603.

Cross-listed with: MSAI641, MSML641.

Credit only granted for: DATA641, MSAI641 or MSML641.

Introduces fundamental concepts and techniques involved in getting computers to deal more intelligently with human language. Focused primarily on text (as opposed to speech), the class will offer a grounding in core NLP methods for text processing (such as lexical analysis, sequential tagging, syntactic parsing, semantic representations, text classification, unsupervised discovery of latent structure), key ideas in the application of deep learning to language tasks, and consideration of the role of language technology in modern society.

Show Sections