Machine Learning -3 Syllabus



Introduction: Brief Introduction to Machine Learning, Abstraction and Knowledge Representation, Types of Machine Learning Algorithms, Definition of learning systems, Goals and applications of machine learning, Aspects of developing a learning system, Data Types, training data, concept representation, function approximation.

Data Pre-processing: Definition, Steps involved in pre-processing, Techniques


Performance measurement of models: Accuracy, Confusion matrix, TPR, FPR, FNR, TNR, Precision, recall, F1-score, Receiver Operating Characteristic Curve (ROC) curve and AUC.

Supervised Learning1: Linear Regression, Multiple Variable Linear Regression, Naïve Bayes Classifiers, Gradient Descent, Multicollinearity, Bias-Variance trade-off.


Supervised Learning2: Regularization, Logistic Regression, Squashing function, KNN, Support Vector Machine.

Decision Tree Learning: Representing concepts as decision trees, Recursive induction of decision trees, picking the best splitting attribute: entropy and information gain, searching for simple trees and computational complexity, Occam's razor, overfitting, noisy data, and pruning. Decision Trees – ID3-CART-Error bounds.



Unsupervised Learning: K-Means, Customer Segmentation, Hierarchical clustering, DBSCAN, Anomaly Detection, Local Outlier Factor, Isolation Forest, Dimensionality Reduction, PCA, GMM, Expectation Maximization.


Ensemble Models: Ensemble Definition, Bootstrapped Aggregation (Bagging) Intuition, Random Forest and their construction, Extremely randomized trees, Gradient Boosting, Regularization by Shrinkage, XGBoost, AdaBoost.



