5910 Breckenridge Pkwy Suite B, Tampa, FL. 33610
(800) 272-0707

SkillSoft Explore Course

Invisible Assets

Classification, regression, and clustering are some of the most commonly used machine learning (ML) techniques and there are various algorithms available for these tasks. In this 10-video course, learners can explore their application in Pandas ML. First, examine how to load data from a CSV (comma-separated values) file into a Pandas data frame and prepare the data for training a classification model. Then use the scikit-learn library to build and train a LinearSVC classification model and evaluate its performance with available model evaluation functions. You will explore how to install Pandas ML and define and configure a ModelFrame, then compare training and evaluation in Pandas ML with equivalent tasks in scikit-learn. Learn how to build a linear regression model by using Pandas ML. Then evaluate a regression model by using metrics such as r-square and mean squared error, and visualize its performance with Matplotlib. Work with ModelFrames for feature extraction and encoding, and configure and build a clustering model with the K-Means algorithm, analyzing data clusters to determine unique characteristics. Finally, complete an exercise on regression, classification, and clustering.



Objectives

Machine & Deep Learning Algorithms: Data Preperation in Pandas ML

  • Course Overview
  • load data from a CSV file into a Pandas dataframe and prepare the data for training a classification model
  • use the scikit-learn library to build and train a LinearSVC classification model and then evaluate its performance using the available model evaluation functions
  • install Pandas ML and then define and configure a ModelFrame
  • compare training and evaluation in Pandas ML with the equivalent tasks in scikit-learn
  • use Pandas for feature extraction and one-hot encoding, load its contents into a ModelFrame, and initialize and train a linear regression model
  • evaluate a regression model using metrics such as r-square and mean squared error and visualize its performance using Matplotlib
  • work with ModelFrames for feature extraction and label encoding
  • configure and build a clustering model using the K-Means algorithm and analyze data clusters to determine characteristics that are unique to them
  • distinguish between the use of scikit-learn and Pandas ML when training a model and identify some of the metrics used to evaluate a model