Showing posts with label Clustering. Show all posts
Showing posts with label Clustering. Show all posts

Machine Learning 1: UNIT 5 (B) : Clustering Questions

                                                                         UNIT 5 (B)

                                                                     Clustering

5.     How does clustering work

6.     finding similarities using distances

7.     Euclidean distance and other distance metrics

k-Means Clustering:

8.     Plotting customers with their segments

9.     normalizing features

10.  cluster centres and interpreting the Clusters

11.  Hierarchical Clustering

 

UNIT-5 (B)

Long Answer Questions

1.    5.     Explain Hierarchical Clustering algorithm with example.

6.     What is the purpose of normalizing features?  How it can perform? Explain.

7.     How can you find the similarities using distances? List the different distance metrics.

8.     How can you be plotting the customers with their segments? Explain.

 

Short Answer Questions

1. 

4.     What is the purpose of cluster centre?

5.     How can you define cluster?

6.     What is the clustering algorithm?

7.     How can you identify the similarities?

8.     Euclidian Distance.

9.     Define Normalization features.

10.  List the techniques can be used for discovering the possible number of clusters?

11.  Explain Dendrogram.

12.  Explain Elbow Method

13.  Define Hierarchical Clustering Algorithm.

Machine Learning 1: UNIT 5 (B) PPTs: Clustering PPTs

  Unit 5 B

Clustering

How does clustering work

finding similarities using distances

Euclidean distance and other distance metrics

k-Means Clustering:

Plotting customers with their segments

normalizing features

cluster centres and interpreting the Clusters

 Hierarchical Clustering


------------------------------------------------------------------------------------------------------------------------


Machine Learning 1: UNIT 5(B) NOTEs: Clustering NOTES

 Unit 5 B

Clustering

5.     How does clustering work

6.     finding similarities using distances

7.     Euclidean distance and other distance metrics

k-Means Clustering:

8.     Plotting customers with their segments

9.     normalizing features

10.  cluster centres and interpreting the Clusters

11.  Hierarchical Clustering

-------------------------------------------------------------------------------------------------------------------------


About Machine Learning 1

 Machine Learning

  • The Machine Learning Landscape
  • Classification
  • Support Vector Machines
  • Decision Trees
  • Ensemble Learning and Random Forests
  • Dimensionality Reduction
  • Clustering

Unit I:

The Machine Learning Landscape: What Is Machine Learning? Why Use Machine Learning? Types of Machine Learning Systems, Supervised/Unsupervised Learning, Batch and Online Learning, Instance-Based Versus Model-Based Learning, Main Challenges of Machine Learning, Insufficient Quantity of Training Data, Nonrepresentative Training Data, Poor-Quality Data, Irrelevant Features, Overfitting the Training Data, Underfitting the Training Data, Stepping Back, Testing and Validating.

👉UNIT 1(A) NOTEs : The Machine Learning Landscape Notes

👉UNIT 1(A) PPTs: The Machine Learning Landscape

👉UNIT 1(B) NOTEs: The Machine Learning Landscape NOTEs

👉Machine Learning 1 : UNIT 1(B) PPTs: The Machine Learning Landscape PPTs

👉Machine Learning 1: UNIT 1 The Machine Learning Landscape Questions


Unit II:

Classification: Training a Binary Classifier, Performance Measures, Measuring Accuracy UsingCross-Validation, Confusion Matrix, Precision and Recall, Precision/RecallTradeoff , The ROC Curve, Multiclass Classification, Error Analysis, Multilabel Classification, Multi Output Classification. k-NN Classifier.

👉Machine Learning 1: UNIT 2 NOTEs: Classification Notes

👉Machine Learning 1 : UNIT 2: Classification PPTs

👉Machine Learning 1: UNIT 2: Classification MCQs

👉Machine Learning 1: UNIT 2: Classification Questions

Unit III:

Support Vector Machines: Linear SVM Classification, Soft Margin Classification, Nonlinear SVM Classification, Polynomial Kernel, Adding Similarity Features, Gaussian RBF Kernel, Computational Complexity, SVM Regression, Under the Hood, Decision Function and Predictions, Training Objective, Quadratic Programming, The Dual Problem, Kernelized SVM, Online SVMs.

👉Machine Learning 1: UNIT 3 (A) NOTES: Support Vector Machines NOTEs

👉Machine Learning 1: UNIT 3 (A) PPTs: Support Vector Machines PPTs

👉Machine Learning 1: UNIT 3 (B) NOTEs: Support Vector Machines NOTEs

👉Machine Learning 1: UNIT 3 (B) PPTs: Support Vector Machines PPTs

👉Machine Learning 1: UNIT 3 A & B : Support Vector Machines Questions

👉Machine Learning 1: UNIT 3 : Support Vector Machines MCQs

Unit IV:

Decision Trees: Training and Visualizing a Decision Tree, Making Predictions, Estimating Class Probabilities, The CART Training Algorithm, Computational Complexity, Gini Impurity or Entropy? Regularization Hyperparameters, Regression

👉Machine Learning 1: UNIT 4 (A) NOTEs: Decision Trees NOTEs

👉Machine Learning 1: UNIT 4 (A) PPTs: Decision Trees PPTs

👉Machine Learning 1: UNIT 4 (A): Decision Trees Questions

👉Machine Learning 1: UNIT 4 (A) : Decision Trees MCQs

Ensemble Learning and Random Forests: Voting Classifiers, Bagging and Pasting, Bagging and Pasting in Scikit-Learn, Out-of-Bag Evaluation, Random Patches and Random Subspaces, Random Forests, Extra-Trees, Feature Importance, Boosting, AdaBoost, Gradient Boosting, Stacking.

👉Machine Learning 1: UNIT 4 (B) NOTES: Ensemble Learning and Random Forests NOTES

👉Machine Learning 1: UNIT 4 (B) PPTs: Ensemble Learning and Random Forests PPTs

👉Machine Learning 1: UNIT 4 (B) : Ensemble Learning and Random Forests Questions

👉Machine Learning 1: UNIT 4 (B) : Ensemble Learning and Random Forests MCQs

Unit V:

Dimensionality Reduction: The Curse of Dimensionality, Main Approaches for Dimensionality Reduction, Projection, PCA.

👉Machine Learning 1: UNIT-5(A) NOTES: Dimensionality Reduction NOTES

👉Machine Learning 1: UNIT-5(A) PPTs: Dimensionality Reduction PPTs

👉Machine Learning 1: UNIT-5(A): Dimensionality Reduction Questions

Clustering: How does clustering work: finding similarities using distances, Euclidean distance and other distance metrics. k-Means Clustering: Plotting customers with their segments, normalizing features, cluster centres and interpreting the Clusters. Hierarchical Clustering.

👉Machine Learning 1: UNIT 5(B) NOTEs: Clustering NOTES

👉Machine Learning 1: UNIT 5 (B) PPTs: Clustering PPTs

👉Machine Learning 1: UNIT 5 (B) : Clustering Questions

Textbooks:

1. Géron, Aurélien. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O'Reilly Media, 2019.
2. Pradhan, Manaranjan, and U. Dinesh Kumar. Machine Learning using Python. Wiley, IIM Bangalore, 2019.

References: 

1.     Introduction to Machine Learning, Ethem Alpaydin 2nd Edition, MIT Press 2000

2.     Machine Learning, Tom M. Mitchell, McGraw Hill, 1997, ISBN: 0-07-042807-7. 


Machine Learning 1 Syllabus

Machine Learning

Syllabus 

Unit I:

The Machine Learning Landscape: What Is Machine Learning? Why Use Machine Learning? Types of Machine Learning Systems, Supervised/Unsupervised Learning, Batch and Online Learning, Instance-Based Versus Model-Based Learning, Main Challenges of Machine Learning, Insufficient Quantity of Training Data, Nonrepresentative Training Data, Poor-Quality Data, Irrelevant Features, Overfitting the Training Data, Underfitting the Training Data, Stepping Back, Testing and Validating.

Unit II:

Classification: Training a Binary Classifier, Performance Measures, Measuring Accuracy Using Cross-Validation, Confusion Matrix, Precision and Recall, Precision/Recall Tradeoff, The ROC Curve, Multiclass Classification, Error Analysis, Multilabel Classification, Multi Output Classification. k-NN Classifier.

Unit III:

Support Vector Machines: Linear SVM Classification, Soft Margin Classification, Nonlinear SVM Classification, Polynomial Kernel, Adding Similarity Features, Gaussian RBF Kernel, Computational Complexity, SVM Regression, Under the Hood, Decision Function and Predictions, Training Objective, Quadratic Programming, The Dual Problem, Kernelized SVM, Online SVMs.

Unit IV:

Decision Trees: Training and Visualizing a Decision Tree, Making Predictions, Estimating Class Probabilities, The CART Training Algorithm, Computational Complexity, Gini Impurity or Entropy? Regularization Hyperparameters, Regression

Ensemble Learning and Random Forests: Voting Classifiers, Bagging and Pasting, Bagging and Pasting in Scikit-Learn, Out-of-Bag Evaluation, Random Patches and Random Subspaces, Random Forests, Extra-Trees, Feature Importance, Boosting, AdaBoost, Gradient Boosting, Stacking.

Unit V:

Dimensionality Reduction: The Curse of Dimensionality, Main Approaches for Dimensionality Reduction, Projection, PCA.

Clustering: How does clustering work: finding similarities using distances, Euclidean distance and other distance metrics. k-Means Clustering: Plotting customers with their segments, normalizing features, cluster centres and interpreting the Clusters. Hierarchical Clustering.

Textbooks:

1.     Géron, Aurélien. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O'Reilly Media, 2019.

2.     Pradhan, Manaranjan, and U. Dinesh Kumar. Machine Learning using Python. Wiley, IIM Bangalore, 2019.

References:

 

1.     Introduction to Machine Learning, Ethem Alpaydin 2nd Edition, MIT Press 2000

2.     Machine Learning, Tom M. Mitchell, McGraw Hill, 1997, ISBN: 0-07-042807-7.

 

WEEK 7: • Apply EM algorithm to cluster a set of data stored in a .CSV file. Use the same data set for clustering using k-Means algorithm. Compare the results of these two algorithms and comment on the quality of clustering.

 WEEK 7: Apply EM algorithm to cluster a set of data stored in a .CSV file. Use the same data set for clustering using k-Means algorithm. Compare the results of these two algorithms and comment on the quality of clustering.












About Machine Learning

Welcome! Your Hub for AI, Machine Learning, and Emerging Technologies In today’s rapidly evolving tech landscape, staying updated with the ...