This online day school aims to provide learners with a comprehensive understanding of data manipulation using the Pandas library.
The day starts by teaching you how to read and save data from and to different file formats, such as CSV files, Excel sheets, and JSON files. You will also learn how to clean up data by dealing with missing values, duplicate values, sorting based on specific columns, and replacing specific values. In addition, you will gain an understanding of index, multi- and hierarchical index as well as multi-row headers.
The day also covers different ways to select data based on row or column values, which is equivalent to SQL select statements with various filtering conditions. We will explore how to transpose, join, concatenate, merge and reshape tables, with various important concepts and configurations to perform these operations. Additionally, you will learn how to create pivot tables and apply the GroupBy operator, explaining what these concepts are, why they are useful and how to apply them and obtain their results.
Furthermore, you will understand how to create summaries, binning and aggregations of data by applying existing or user-defined functions. Finally, the day will cover how to generate basic plots and visualisations. By the end of the day, you will have gained the necessary skills to work with data using Pandas, a widely used library in the field of data science.
Basic knowledge of Python programming and familiarity with Python data types and data structures, such as dictionaries and lists, is expected to benefit from this day.
Please note: this event will close to enrolments at 23:59 BST on 1 May 2024.