The R part of course (first day) will be focused on presenting the basics of data analysis in R and visualization of data. The course will cover the introduction to the R statistical language introducing the basic data types and workflow. Afterwards, packages from the “tidyverse” collection will be presented. These includes packages for the loading of data, preprocessing data, basic data exploration, and visualization.
The Python oriented part (second day) will introduce essential data-scientific packages and will be complemented with hands-on exercises that will demonstrate their usage with real world data analytic problems, and showing how to tackle such problems.
The course will be up to 50% hands-on exercises covering all topics to practice the techniques, and patterns gained.
Purpose of the course (benefits for the attendees)
Target audience: Users that want to use Python and/or R for data analysis and prototyping. The participants will learn basic and intermediate skills for exploratory data analysis and visualization in the programming languages of R and Python.
About the tutor(s)
Tomáš Martinovič obtained his PhD in computational sciences at IT4Innovations, VSB - Technical University of Ostrava in 2018. From 2015 to 2018 he worked in a team focused on analysis of complex dynamical systems, where he worked on scalable implementations of algorithms from the field of nonlinear time series analysis. Since the start of 2019 he has been working in a team focused on high performance data analysis with the defined objective of research and transfer of knowledge in cooperation with industry.
Stanislav Böhm has a PhD in computer science, and is a researcher at IT4Innovations. He is interested in distributed systems, verification, and scheduling.
|9:30 - 10:00||Registration|
|10:00 - 11:30|
Data import in R
Tidying data in R
|11:30 - 13:00||lunch|
|13:00 - 14:30||Exploratory analysis with tidyverse in R|
|14:30 - 15:00||coffee|
|15:00 - 16:30|
Advanced data visualization and analysis with ggplot2 and trelliscopejs in R
Q & A
|9:45 - 10:45||Introduction to Pandas|
|10:45 - 11:00||coffee|
|11:00 - 12:00||Hands-on|
12:00 - 13:00
|13:00 - 14:30||Exploratory analysis in Pandas|
Capacity and Fees
- See the links below for how to get to the campus of VŠB - Technical University Ostrava, and to the IT4Innovations building.
- Documentation for IT4Innovations' computer systems is available at https://docs.it4i.cz/.
This event was partially supported by The Ministry of Education, Youth and Sports from the National Programme of Sustainability (NPU II) project „IT4Innovations excellence in science - LQ1602“ and partially by the PRACE-5IP project - the European Union’s Horizon 2020 research and innovation programme under grant agreement No 730913.