4. Introduction to Supervised, Unsupervised and Reinforcement Learning

4. Introduction to Supervised, Unsupervised and Reinforcement Learning

ü The amount of data generated in the world today is very huge. This data is generated not only by humans but also by smartphones, computers and other devices. Based on the kind of data available and a motive present, certainly, a programmer will choose how to train an algorithm using a specific learning model.

· Machine Learning is a part of Computer Science where the efficiency of a system improves itself by repeatedly performing the tasks by using data instead of explicitly programmed by programmers. Further let us understand the difference between three techniques of Machine Learning- Supervised, Unsupervised and Reinforcement Learning.

How Machine Learning Works?

Types of Learning Algorithms

i. Supervised learning

ii. Unsupervised learning

iii. Reinforcement learning

Supervised Learning

· In Supervised learning, an AI system is presented with data which is labeled, which means that each data tagged with the correct label.

· The goal is to approximate the mapping function so well that when you have new input data (x) that you can predict the output variables (Y) for that data.

· As shown in the example, we have initially taken some data and marked them as ‘Spam’ or ‘Not Spam’. This labeled data is used by the training supervised model, this data is used to train the model.

· Once it is trained we can test our model by testing it with some test new mails and checking of the model is able to predict the right output.

Types of Supervised learning

i. Classification: A classification problem is when the output variable is a category, such as “red” or “blue” or “disease” and “no disease”.

ii. Regression: A regression problem is when the output variable is a real value, such as “dollars” or “weight”.

Unsupervised Learning Algorithm

· In unsupervised learning, an AI system is presented with unlabeled, uncategorized data and the system’s algorithms act on the data without prior training.

· In the example, we have given some characters to our model which are ‘Ducks’ and ‘Not Ducks’.

· In our training data, we don’t provide any label to the corresponding data.

· The unsupervised model is able to separate both the characters by looking at the type of data and models the underlying structure or distribution in the data to learn more about it.

can

Types of Unsupervised Learning

i. Clustering: A clustering problem is where you want to discover the inherent groupings in the data, such as grouping customers by purchasing behavior.

ii. Association: An association rule learning problem is where you want to discover rules that describe large portions of your data, such as people who buy X also tend to buy Y.

Reinforcement Learning

· A reinforcement learning algorithm, or agent, learns by interacting with its environment.

· The agent receives rewards by performing correctly and penalties for performing incorrectly.

· The agent learns without intervention from a human by maximizing its reward and minimizing its penalty.

· It is a type of dynamic programming that trains algorithms using a system of reward and punishment.

In the example, we can see that the agent is given 2 options i.e. a path with water or a path with fire.

· A reinforcement algorithm works on reward a system i.e. if the agent uses the fire path then the rewards are subtracted and agent tries to learn that it should avoid the fire path.

· If it had chosen the water path or the safe path then some points would have been added to the reward points, the agent then would try to learn what path is safe and what path isn’t.

· It is basically leveraging the rewards obtained, the agent improves its environment knowledge to select the next action.

Applications of Machine Learning

•Recognizing patterns:

ü Facial identities or facial expressions

ü Handwritten or spoken words

ü Medical images

•Generating patterns:

ü Generating images or motion sequences

•Recognizing anomalies:

ü Unusual sequences of credit card transactions

ü Unusual patterns of sensor readings in a nuclear power plant or unusual sound in your car engine.

•Prediction:

ü Future stock prices or currency exchange rates

Influence of Disciplines on Machine Learning

i. Artificial intelligence

Learning symbolic representations of concepts. Machine learning as a search problem. Learning as an approach to improving problem solving. Using prior knowledge together with training data to guide learning.

ii. Bayesian methods

Bayes' theorem as the basis for calculating probabilities of hypotheses. The naive Bayes classifier. Algorithms for estimating values of unobserved variables.

iii. Computational complexity theory

Theoretical bounds on the inherent complexity of different learning tasks, measured in terms of the computational effort, number of training examples, number of mistakes, etc. required in order to learn.

iv. Control theory

Procedures that learn to control processes in order to optimize predefined objectives and that learn to predict the next state of the process they are controlling.

v. Information theory

Measures of entropy and information content. Minimum description length approaches to learning. Optimal codes and their relationship to optimal training sequences for encoding a hypothesis.

vi. Philosophy

Occam's razor, suggesting that the simplest hypothesis is the best. Analysis of the justification for generalizing beyond observed data.

vii. Psychology and neurobiology

The power law of practice, which states that over a very broad range of learning problems, people's response time improves with practice according to a power law. Neurobiological studies motivating artificial neural network models of learning.

viii. Statistics

Characterization of errors (e.g., bias and variance) that occur when estimating the accuracy of a hypothesis based on a limited sample of data. Confidence intervals, statistical tests.

	Supervised learning	Unsupervised learning	Reinforcement learning
Definition	Makes predictions from data	Segments and groups data	Reward-punishment system and interactive environment
Types of data	Labelled data	Unlabeled data	Acts according to a policy with a final goal to reach (No or predefined data)
Commercial value	High commercial and business value	Medium commercial and business value	Little commercial use yet
Types of problems	Regression and classification	Association and Clustering	Exploitation or Exploration
Supervision	Extra supervision	No	No supervision
Algorithms	Linear Regression, Logistic Regression, SVM, KNN and so forth	K – Means clustering, C – Means, Apriori	Q – Learning, SARSA
Aim	Calculate outcomes	Discover underlying patterns	Learn a series of action
Application	Risk Evaluation, Forecast Sales	Recommendation System, Anomaly Detection	Self-Driving Cars, Gaming, Healthcare

Which is the better Machine Learning technique?

ü We learned about the three main members of the machine learning family essential for deep learning. Other kinds of learning are also available such as semi-supervised learning, or self-supervised learning.

ü Supervised, unsupervised, and reinforcement learning, are all used for different to complete diverse kinds of tasks. No single algorithm exists that can solve every problem, as problems of different natures require different approaches to resolve them.

ü Despite the many differences between the three types of learning, all of these can be used to build efficient and high-value machine learning and Artificial Intelligence applications. All techniques are used in different areas of research and development to help solve complex tasks and resolve challenges.

Search This Blog

Dr. Rambabu Pemula

4. Introduction to Supervised, Unsupervised and Reinforcement Learning

Comments

Post a Comment

Popular posts from this blog

VERSION SPACES AND THE CANDIDATE-ELIMINATION ALGORITHM

Support Vector Machine (SVM) - ML Program

Support Vector Machines