Data Preparation and Manipulation
Details
Start: April 6, 2022, 1 p.m.
End: April 6, 2022, 5 p.m.
Location:
Objectives: At the end of the workshop you will be able to retrieve data from different sources into a tabular data structure – data frame or data series. You will learn how to preprocess the data into a format that will enable accurate and fast analysis. You will be able to implement optimized data processing tools with emphasis on vectorization techniques.
material: Python, Pandas, NumpyEnd: April 6, 2022, 5 p.m.
Location:
Background: description of the data frame and series data structures.
Data loading: importing data from different sources, with emphasis on CSV files into a data frame.
Data cleaning: learning typical cases of data corruption and how to test and clean the data.
Data wrangling: reshaping and merging data from different sources.
Data operations: efficient transformations and aggregation of the data using array operations.
Solve a work example.