Showing posts with label Naive Bayes. Show all posts

Machine Learning Programs

👉Data Preprocessing in Machine Learning

👉Data Preprocessing in Machine learning (Handling Missing values )

👉Linear Regression - ML Program - Weight Prediction

👉Naïve Bayes Classifier - ML Program

👉LOGISTIC REGRESSION - PROGRAM

👉KNN Machine Learning Program

👉Support Vector Machine (SVM) - ML Program

👉Decision Tree Classifier on Iris Dataset

👉Classification of Iris flowers using Random Forest

👉DBSCAN

👉 Implement and demonstrate the FIND-S algorithm for finding the most specific hypothesis based on a given set of training data samples. Read the training data from a .CSV file

👉For a given set of training data examples stored in a .CSV file, implement and demonstrate the Candidate-Elimination algorithm to output a description of the set of all hypotheses consistent with the training examples.

👉Write a program to demonstrate the working of the decision tree based ID3 algorithm. Use an appropriate data set for building the decision tree and apply this knowledge to classify a new sample.

👉Build an Artificial Neural Network by implementing the Backpropagation algorithm and test the same using appropriate data sets.

👉Write a program to construct a Bayesian network considering medical data. Use this model to demonstrate the diagnosis of heart patients using a standard Heart Disease Data Set.

👉Write a program to implement k-Nearest Neighbors algorithm to classify the iris data set. Print both correct and wrong predictions.

👉Implement the non-parametric Locally Weighted Regression algorithm in order to fit data points. Select appropriate data set for your experiment and draw graphs.

👉Write a program to implement SVM algorithm to classify the iris data set. Print both correct and wrong predictions.

👉Apply EM algorithm to cluster a set of data stored in a .CSV file. Use the same data set for clustering using k-Means algorithm. Compare the results of these two algorithms and comment on the quality of clustering.

👉 Write a program using scikit-learn to implement K-means Clustering

👉Program to calculate the entropy and the information gain

👉Program to implement perceptron.

Naïve Bayes Classifier - ML Program

Naïve Bayes Classifier

Steps:

Understand the business problem
Import the library files
Load the dataset
Data preprocessing
Split the data into train and test
Build the model (Naïve Bayes classifier)
Test the model
Performance Measures
Predict the class label for new data.

1. Understand the business problem

Let’s build a classifier that predicts whether I should play tennis given the forecast. It takes four attributes to describe the forecast; namely, the outlook, the temperature, the humidity, and the presence or absence of wind. Furthermore, the values of the four attributes are qualitative (also known as categorical).

p(C_k |x_1,x_2,…,x_n )= p(C_k ) ∏_(i=1)^n p(x_i |C_k )

2. Import the library files

3. Load the dataset

4. Data preprocessing

5.Split the data into train and test

6. Build the model (Navie Bayes classifier)

7. Test the model

8 .Performance Measures

9. Predict the class label for new data.

Machine Learning MCQs-2 (Performance Metrics, Linear Regression, Naïve Bayes Classifier )

Machine Learning MCQs- 2
Performance Metrics, Linear Regression, Naïve Bayes Classifier

1. The greater the value for ROC AUC, better the model:

True
False

Ans: 1

2. A set of data are all close to each other, and they are close to the actual value. This set of data can be described as...

Accurate
Precise
both Precise and accurate
None of the above

Ans: 3

3. The maximum value of the ROC AUC is:

Ans: 3

4. Recall can be increased by increasing the decision threshold. True or False?

True
False

Ans: 2

5. Which of these is a good measure to decide which threshold to use?

Confusion matrix
F1 score
ROC curve
Precision & Recall versus Threshold Curve

Ans: 4

6. Which of these may have to be performed before analyzing and training the dataset?

Shuffling
Cross-Validation
F1 Score
None

Ans: 1

7. For the below confusion matrix, what is the total number of training datasets?

	Not 5	5
Not 5	53272	1307
5	1077	4344

50000
60000
70000
80000

Ans: 2

8. For the below confusion matrix, what is the accuracy?

	Not 5	5
Not 5	53272	1307
5	1077	4344

Ans: 4

9. For the below confusion matrix, what is the recall?

	Not 5	5
Not 5	53272	1307
5	1077	4344

0.7
0.8
0.9
0.95

Ans: 2

10. For the below confusion matrix, what is the precision?

	Not 5	5
Not 5	53272	1307
5	1077	4344

0.73
0.76
0.78
0.82

Ans: 2

11. F1 score is:

absolute mean of precision and recall
harmonic mean of precision and recall
squared mean of precision and recall
None

Ans: 2

12. For the below confusion matrix, what is the F1 score?

	Not 5	5
Not 5	53272	1307
5	1077	4344

0.72
0.784
0.82
0.84

Ans: 2

13. For a model to detect videos that are unsafe for kids, we need (safe video = positive class)

High precision, low recall
High recall, low precision
High Precision, High Recall
Low Precision, Low Recall

Ans: 1

14. For a model to detect shoplifters in surveillance images, we need (shoplifter is positive class)

High precision, low recall
High recall, low precision
High Precision, High Recall
Low Precision, Low Recall

Ans: 2

15. which of these provide out-of-core support for linear regression?

Normal Equation
SGD
Batch Gradient Descent
None

Ans: 2

16. NormalEquation class in scikit-learn solve linear regression using:

Normal Equation
SGD
There is no NormalEquation class in scikit-learn
Mini-Batch Gradient Descent

Ans: 3

17.Training Linear Regression model using Normal Equation is linear with both the number of training dataset that have to be made and the number of features:

True
False

Ans: 2

18.Prediction using Linear Regression model is linear with both the number of predictions that have to be made and the number of features:

True
False

Ans: 1

19.Which of these is more prone to overfitting?

Linear Regression
Polynomial Regression

Ans: 2

20. Naive Baye is?

Conditional Independence
Conditional Dependence
Both 1 and 2
None of the above

Ans:1

21. Naive Bayes requires?

Categorical Values
Numerical Values
Either 1 or 2
Both 1 and 2

Ans: 1

22. Probabilistic Model of data within each class is?

Discriminative classification
Generative classification
Probabilistic classification
Both 2 and 3

Ans: 4

23. A Classification rule is said?

Discriminative classification
Generative classification
Probabilistic classification
Both 1 and 3

Ans: 4

24. Spam Classification is an example for ?

Naive Bayes
Probabilistic condition
Random Forest
All the Above

Ans: 1.

25. Naïve Bayes Algorithm is a ________ learning algorithm.

Supervised
Reinforcement
Unsupervised
None of these

Ans: 1

26. Which of the following is correct about the Naive Bayes?

Assumes that all the features in a dataset are independent
Assumes that all the features in a dataset are equally important
Both 1 and 2
None of the above

Ans: 2

27. Types of Naïve Bayes Model:

Gaussian
Multinomial
Bernoulli
All of the above

Ans: 4

28. Disadvantages of Naïve Bayes Classifier:

Naïve Bayes assumes that all features are independent or unrelated, so it cannot learn the relationship between features.
It performs well in Multi-class predictions as compared to the other Algorithms.
Naïve Bayes is one of the fast and easy ML algorithms to predict a class of datasets
It is the most popular choice for text classification problems

Ans: 1

29. The benefit of Naïve Bayes:-

Naïve Bayes is one of the fast and easy ML algorithms to predict a class of datasets.
It is the most popular choice for text classification problems.
It can be used for Binary as well as Multi-class Classifications.
All of the above

Ans: 4

30. TPR =

TP/(FN+TP)
TN/(TN+FP)
FP/(FP+TN)
FN/(FN+TP)

Ans: 1

31. TNR=

TP/(FN+TP)
TN/(TN+FP)
FP/(FP+TN)
FN/(FN+TP)

Ans: 2

32. FPR=

TP/(FN+TP)
TN/(TN+FP)
FP/(FP+TN)
FN/(FN+TP)

Ans: 3

33. FNR=

TP/(FN+TP)
TN/(TN+FP)
FP/(FP+TN)
FN/(FN+TP)

Ans: 4

34. Precision =

TP/(TP+FP)
TP/(FN+TP)
FP/(TP+FP)
FP/(FN+TP)

Ans: 1

35. Recall=

TP/(TP+FP)
TP/(FN+TP)
FP/(TP+FP)
FP/(FN+TP)

Ans: 2

Machine Learning - Deep Learning

Machine Learning Programs

Machine Learning Programs

Naïve Bayes Classifier - ML Program

Naïve Bayes Classifier

Steps:

1. Understand the business problem

2. Import the library files

3. Load the dataset

4. Data preprocessing

5.Split the data into train and test

6. Build the model (Navie Bayes classifier)

7. Test the model

8 .Performance Measures

9. Predict the class label for new data.

Machine Learning MCQs-2 (Performance Metrics, Linear Regression, Naïve Bayes Classifier )

About Machine Learning

SOFTWARE ENGINEERING