5910 Breckenridge Pkwy Suite B, Tampa, FL. 33610
(800) 272-0707

SkillSoft Explore Course

IT Professional Curricula     Enterprise Database Systems Solution Area     Data Science     Analyzing Data Using Python

For data analysis to be useful and accurate, the analyzed data needs to be cleaned and curated. There are copious methods to achieve this in pandas. In this course, you'll learn how to identify and eliminate duplicates in pandas.


You'll start by using the pandas cut method to discretize data into bins, using bins to plot histograms and identify outliers using box-and-whisker plots. You'll parse and work with datetime objects read in from strings and convert string columns to datetime using the dateutils python library.


Moving on, you'll master different pandas methods for aggregating data - including the groupby, pivot, and pivot_table methods. Lastly, you'll perform various joins - inner, left outer, right outer, and full outer - using both the merge and join methods.



Objectives

Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas

  • discover the key concepts covered in this course
  • identify and deal with duplicate records
  • summarize records into bins or categories
  • compute aggregations on data
  • perform common grouping and aggregation operations
  • use pivot tables to explore data
  • use pivot tables to summarize data
  • combine and merge records
  • perform inner join operations using the merge() method
  • perform left and right join operations using the merge() method
  • implement joins using the join() method
  • manipulate and analyze time series data
  • summarize the key concepts covered in this course