2021-2022 Catalog

CS 479 Data Mining and Machine Learning

Data mining involves the processing, analysis, and presentation of data to gain valuable information. Machine learning refers to a broad set of algorithms for identifying patterns in data to build models that might then be possibly productized. Students completing this course will develop an understanding of data mining concepts such as proximity measurement, data preparation, cluster analysis, classification and regression and apply machine learning algorithms such as supervised learning, unsupervised learning, and deep learning.




  1. As a result of this course, students will know or be able to do the following:
  2. Understand the bias-variance tradeoff in supervised learning.
  3. Understand the importance of feature selection for clustering, classification, and regression.
  4. Apply clustering, classification, and regression algorithms to small and medium data sets.
  5. Analyze the performance of supervised and unsupervised algorithms using various metrics.
  6. Evaluate criteria that might lead to selection of one method over another in supervised learning.
  7. Create a data mining and machine learning deliverable using appropriate algorithms.