SylabUZ
Faculty of Mathematics, Computer Science and Econometrics
summer term 2023/2024
Data Engineering - Second-cycle studies leading to MS degree
Practical Applications of Data Mining Systems
Practical Applications of Data Mining Systems - course description
General information
Course name
Practical Applications of Data Mining Systems
Course ID
11.3-WK-DEED-PADMS-S22
Faculty
Faculty of Mathematics, Computer Science and Econometrics
Field of study
Data Engineering
Education profile
academic
Level of studies
Second-cycle studies leading to MS degree
Beginning semester
summer term 2023/2024
Course information
Semester
3
ECTS credits to win
3
Available in specialities
Data exploration systems
Course type
optional
Teaching language
english
Author of syllabus
Classes forms
The class form
Hours per semester (full-time)
Hours per week (full-time)
Hours per semester (part-time)
Hours per week (part-time)
Form of assignment
Lecture
15
1
-
-
Credit with grade
Laboratory
30
2
-
-
Credit with grade
Aim of the course
Acquire the modeling skills required to understand and store big data in big data sets.
Using skills to make decisions such as cancer detection, fraud detection, customer segmentation and machine downtime prediction.
Learning about the data mining process and modeling techniques using one IBM SPSS Modeler program.
Creating models based on selected data, testing models with historical data, using current data.
Prerequisites
Save changes
Fundamental of statistics.
Scope
Introduction to data mining
CRISP-DM methodology
Introduction to SPSS Modeler - a predictive data mining workshop
SPSS Modeler interface
Data retrieval process
Understanding the business
Understanding data
Data preparation
Modeling techniques
Introduction to modeling techniques
Cluster analysis (unsupervised learning)
Classification and prediction (supervised learning)
Classification, training and testing
Sampling in classification
Predictive Modeling Algorithms in SPSS Modeler
Automatic selection of algorithms
Model evaluation
Performance evaluation data
Accuracy as a performance evaluation tool
Overcoming accuracy limits
ROC Curves
Implementation on IBM Bluemix
Evaluating new data
Implementation of a predictive model
What is IBM Bluemix?
Predictive Modeling: Cloud Deployment
SPSS Collaboration and Implementation Services
Teaching methods
Conventional lecture, problem-based lecture. Laboratory exercises. Discussion.
Learning outcomes and methods of theirs verification
Outcome description
Outcome symbols
Methods of verification
The class form
Assignment conditions
The grade for the laboratory will be based on the results from the colloquium and/or projects (80%) and activity in classes (20%).
Recommended reading
1. Axel Buecker, Theresa Morelli, Colin Shearer IBM SPSS predictive analytics: Optimizing decisions at the point of impact An IBM Redguide publication 2010
Further reading
Notes
Modified by dr Maciej Niedziela (last modification: 11-04-2024 16:06)