Data Wrangling Syllabus

 DATA WRANGLING

UNIT-1

Introduction to Data Wrangling: What Is Data Wrangling?- Importance of Data Wrangling -How is Data Wrangling performed?- Tasks of Data Wrangling-Data Wrangling Tools-Introduction to Python-Python Basics-Data Meant to Be Read by Machines-CSV Data-JSON Data-XML Data.

UNIT-2

Working with Excel Files and PDFs: Installing Python Packages-Parsing Excel Files-Parsing Excel Files -Getting Started with Parsing-PDFs and Problem Solving in Python-Programmatic Approaches to PDF Parsing-Converting PDF to Text-Parsing PDFs Using pdf miner-Acquiring and Storing Data-Databases, A Brief Introduction-Relational Databases, MySQL and PostgreSQL-Non-Relational Databases, NoSQL-When to Use a Simple File-Alternative Data Storage.

UNIT-3

Data Cleanup: Why Clean Data? - Data Cleanup Basics-Identifying Values for Data Cleanup-Formatting Data-Finding Outliers and Bad Data-Finding Duplicates-Fuzzy Matching-RegEx Matching-Normalizing and Standardizing the Data-Saving the Data-Determining suitable Data Cleanup-Scripting the Cleanup- Testing with New Data.

UNIT-4

Data Exploration and Analysis: Exploring Data-Importing Data-Exploring Table Functions-Joining Numerous Datasets-Identifying Correlations-Identifying Outliers-Creating Groupings-Analyzing Data-Separating and Focusing the Data-Presenting Data-Visualizing the Data-Charts-Time-Related Data-Maps- Interactives -Words-Images, Video, and Illustrations-Presentation Tools-Publishing the Data-Open Source Platforms.

UNIT-5

Web Scraping: What to Scrape and How-Analyzing a Web Page-Network/Timeline-Interacting with JavaScript-In-Depth Analysis of a Page-Getting Pages-Reading a Web Page-Reading a Web Page with LXML-XPath-Advanced Web Scraping-Browser-Based Parsing-Screen Reading with Selenium-Screen Reading with Ghost.Py, Spidering the Web-Building a Spider with Scrapy-Crawling Whole Websites with Scrapy.


TEXT BOOKS:

1. Jacqueline Kazil& Katharine Jarmul, “Data Wrangling with Python”, O’Reilly Media, Inc,2016
2. McKinney, William, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 2 nd Edition, O’Reilly, 2017

Machine Learning Syllabus

 MACHINE LEARNING

UNIT - I

Introduction- Well-Posed Learning Problems, Designing a Learning System, Perspectives and Issues in Machine Learning, Introduction to Supervised, Unsupervised and Reinforcement Learning.

Concept Learning and the General to Specific Ordering – Introduction, A Concept Learning Task, Concept Learning as Search, Find-S: Finding a Maximally Specific Hypothesis, Version Spaces and the Candidate Elimination Algorithm.

 UNIT - II

Decision Tree Learning Introduction, Decision Tree Representation, Appropriate Problems for Decision Tree Learning, The Basic Decision Tree Learning Algorithm, Issues In Decision Tree Learning.

Artificial Neural Networks- Introduction, Neural Network Representation, Appropriate Problems for Neural Network Learning, Perceptrons, Multilayer Networks and the Back-Propagation Algorithm.

 

UNIT - III

Bayesian Learning – Introduction, Bayes Theorem, Bayes Theorem and Concept Learning, Bayes Optimal Classifier, Naive Bayes Classifier, Bayesian Belief Networks, EM Algorithm.

Instance-Based Learning- Introduction, K-Nearest Neighbor Algorithm, Locally Weighted Regression, Remarks on Lazy and Eager Learning.

 

UNIT -IV

Genetic Algorithms – Motivation, Genetic Algorithms, An Illustrative Example, Genetic Programming, Models of Evolution and Learning, Parallelizing Genetic Algorithms.

Learning Sets of Rules – Introduction, Sequential Covering Algorithms, Learning Rule Sets: Summary, Learning First-Order Rules, Learning Sets Of First-Order Rules: FOIL

           

UNIT - V

Analytical Learning- Introduction, Learning With Perfect Domain Theories: PROLOG-EBG, Explanation-Based Learning Of Search Control Knowledge.

Reinforcement Learning – Introduction, The learning task, Q–learning, Nondeterministic, Rewards and Actions, Temporal Difference Learning, Generalizing from Examples, Relationship to Dynamic Programming.


Text Books:

1.     Machine Learning – Tom M. Mitchell, – MGH

2.     Machine Learning: An Algorithmic Perspective, Stephen Marsland, Taylor & Francis (CRC)

 


 


About

This blog provides information for the following subjects

👉Artificial Intelligence
 ðŸ‘‰Machine Learning
👉Machine Learning Programs 

About Machine Learning

Welcome! Your Hub for AI, Machine Learning, and Emerging Technologies In today’s rapidly evolving tech landscape, staying updated with the ...