5910 Breckenridge Pkwy Suite B, Tampa, FL. 33610
(800) 272-0707

SkillSoft Explore Course

IT Professional Curricula     Enterprise Database Systems Solution Area     Data Science     Introduction to Statistical Concepts
Data is one of the most valuable assets a business has, but it's only as valuable as the methods used to interpret it. Data science, which at its core includes statistics and sampling, is the key to data interpretation. In this course, practice using the pandas library in Python to work with statistics and sampling.
Practice loading data from a CSV file into a pandas DataFrame. Compute a variety of statistics on data. While doing so, see how to visualize the relationship between data and computed statistics.
Moving along, implement several sampling techniques, such as stratified sampling and cluster sampling. Then, explore how a balanced sample can be created from an imbalanced dataset using the imblearn module in Python.
Upon completion, you'll be able to generate samples and compute statistics using various tools and methods.

Objectives

Core Statistical Concepts: Statistics & Sampling with Python

  • discover the key concepts covered in this course
  • install the latest versions of pandas and visualization modules used to analyze data
  • load data from a CSV file into a pandas DataFrame and perform some initial analysis
  • calculate the mean and median of a distribution using your own function and compare it with the built-in pandas function
  • use Seaborn and Matplotlib to visualize a distribution and where the mean, median, and mode fit in
  • compute and visualize the standard deviation and variance of a distribution
  • implement simple random and stratified sampling on a data frame
  • use pandas to generate a sample using cluster and systematic sampling
  • create a balanced sample using random undersampling and oversampling
  • generate synthetic data in order to create a balanced sample using the Synthetic Minority Over-sampling Technique (SMOTE)
  • summarize the key concepts covered in this course