SylabUZ

Generate PDF for this page

Methods and Tools for Data Processing - course description

General information
Course name Methods and Tools for Data Processing
Course ID 11.3-WK-DEED-MTDP-S23
Faculty Faculty of Exact and Natural Sciences
Field of study Data Engineering
Education profile academic
Level of studies Second-cycle studies leading to MS degree
Beginning semester summer term 2023/2024
Course information
Semester 2
ECTS credits to win 5
Available in specialities Modelling and data analysis
Course type optional
Teaching language english
Author of syllabus
  • dr Maciej Niedziela, prof. UZ
Classes forms
The class form Hours per semester (full-time) Hours per week (full-time) Hours per semester (part-time) Hours per week (part-time) Form of assignment
Lecture 30 2 - - Credit with grade
Laboratory 30 2 - - Credit with grade

Aim of the course

The aim of the course is to familiarize the student with the application of data processing methods and tools using the R program. After completing this course, the student should be prepared to use independently the studied methods and tools to solve practical problems specific to the field of data analysis.

Prerequisites

Fundamentals of programming.

Scope

Lecture/Laboratory:

1. Data preprocessing.

  • Data cleaning.
  • Handling missing data.
  • Improving wrong data.
  • Numerical and categorical variables.
  • Time series.

2. Data presentation and visualization methods.
3. Regression modeling. Linear and logistic regression.
4. Selected association rules and data clustering methods.
5. Examples of applications of data processing methods and tools.

Teaching methods

Lecture: traditional and problem-based. 
Laboratory: solving data mining problems using R program. Discussion.

Learning outcomes and methods of theirs verification

Outcome description Outcome symbols Methods of verification The class form

Assignment conditions

Laboratory evaluation is based on tests (80%) with tasks of varying difficulty, allowing to assess whether the student has achieved the learning outcomes to a minimum degree, and activity in class (20%). The lecture ends with an exam in the form of a test.

The final course evaluation consists of the grade from the laboratory (70%) and the grade from the lecture (30%).

The condition for passing the course is positive grades from the laboratory and the exam.

Recommended reading

  1. D.T. Larose, Data Mining Methods and Models, Wiley, 2006.
  2. M. Bramer, Principles of Data Mining, Springer, 2007.
  3. C.C. Aggarwal, Data Mining, Springer, 2017.

Further reading

  1. D.T. Larose, Discovering Knowledge in Data, Wiley, 2014.

Notes


Modified by dr Maciej Niedziela, prof. UZ (last modification: 11-04-2024 16:04)