Hide Advanced Options
Courses - Spring 2025
BIOI
Bioinformatics and Computational Biology
Open Seats as of
11/20/2024 at 10:30 PM
BIOI601
Probability and Statistics
Credits: 3
Grad Meth: Reg
Prerequisite: Undergraduate courses in calculus and basic linear algebra.
Cross-listed with: DATA601, MSML601.
Credit only granted for: BIOI601, DATA601 or MSML601.
Provides a solid understanding of the fundamental concepts of probability theory and statistics. The course covers the basic probabilistic concepts such as probability space, random variables and vectors, expectation, covariance, correlation, probability distribution functions, etc. Important classes of discrete and continuous random variables, their inter-relation, and relevance to applications are discussed. Conditional probabilities, the Bayes formula, and properties of jointly distributed random variables are covered. Limit theorems, which investigate the behavior of a sum of a large number of random variables, are discussed. The main concepts random processes are then introduced. The latter part of the course concerns the basic problems of mathematical statistics, in particular, point and interval estimation and hypothesis testing.
BIOI602
Principles of Data Science
Credits: 3
Grad Meth: Reg, Aud
Restriction: Must be in one of the following programs: (Data Science Post-Baccalaureate Certificate, Master of Professional Studies in Data Science and Analytics, or Master of Professional Studies in Machine Learning).
Cross-listed with: DATA602, MSML602.
Credit only granted for: BIOI602, DATA602, MSML602 or CMSC641.
Formerly: CMSC641.
An introduction to the data science pipeline, i.e., the end-to-end process of going from unstructured, messy data to knowledge and actionable insights. Provides a broad overview of what data science means and systems and tools commonly used for data science, and illustrates the principles of data science through several case studies.
BIOI603
Principles of Machine Learning
Credits: 3
Grad Meth: Reg, Aud
Restriction: Must be in one of the following programs: (Data Science Post-Baccalaureate Certificate, Master of Professional Studies in Data Science and Analytics, or Master of Professional Studies in Machine Learning).
Cross-listed with: DATA603, MSML603, MSQC603.
Credit only granted for: BIOI603, DATA603, MSML603, MSQC603 or CMSC643.
Formerly: CMSC643.
A broad introduction to machine learning and statistical pattern recognition. Topics include: Supervised learning: Bayes decision theory, discriminant functions, maximum likelihood estimation, nearest neighbor rule, linear discriminant analysis, support vector machines, neural networks, deep learning networks. Unsupervised learning: clustering, dimensionality reduction, PCA, auto-encoders. The course will also discuss recent applications of machine learning, such as computer vision, data mining, autonomous navigation, and speech recognition.
BIOI605
Data Sources and Data Management in Bioinformatics
Credits: 3
Grad Meth: Reg, Aud
An introduction to the different types of data generated for bioinformatics analyses and data management principles required for scientific rigor and reproducibility. Data sources include, but are not limited to, sequencing data, 'omics data (e.g., proteomics, metabolomics, lipidomics), imaging data, and clinical data. Data organization will cover topics such as management and curation of metadata, downloading data from and submitting data to public repositories, and using databases versus spreadsheets and tables.
BIOI606
Sequence Alignment
Credits: 3
Grad Meth: Reg, Aud
In-depth coverage of biological sequence alignment including the following: definitions, algorithms, and statistics for local, global, pairwise, and multiple alignments; scoring schemes; BLAST, BLAST variants, and similar programs; motif finding; and related topics.
BIOI607
Data Structures and Algorithms for Bioinformatics
Credits: 3
Grad Meth: Reg, Aud
An introduction to the fundamental data structures and algorithms underlying many parts of Bioinformatics. Standard data structures for efficient indexing and sequence search will be covered, including the suffix array and the FM-index, as will alignment-free methods for sequence comparison. This course will also introduce the fundamental algorithms in computational phylogenomics and biological network analysis. Finally, bioinformatics oriented applications of classic unsupervised learning algorithms (e.g., clustering and dimensionality reduction) and database techniques (e.g., sorting, selection, joining) will be examined. The focus will be both on formal understanding of computational efficiency as well as the practical applications of these concepts.