Web-based training
Contains PDF course guide, as well as a lab environment where students can work through demonstrations and exercises at their own pace.
This course presents advanced models to predict categorical and continuous targets. Before reviewing the models, data preparation issues are addressed such as partitioning, detecting anomalies, and balancing data. The participant is first introduced to a technique named PCA/Factor, to reduce the number of fields to a number of core fields, referred to as components or factors. The next units focus on supervised models, including Decision List, Support Vector Machines, Random Trees, and XGBoost. Methods are reviewed to combine supervised models and execute them in a single run, both for categorical and continuous targets.
If you are enrolling in a Self Paced Virtual Classroom or Web Based Training course, before you enroll, please review the Self-Paced Virtual Classes and Web-Based Training Classes on our Terms and Conditions page, as well as the system requirements, to ensure that your system meets the minimum requirements for this course. http://www.ibm.com/training/terms
1. Preparing data for modeling
Address general data quality issues
Handle anomalies
Select important predictors
Partition the data to better evaluate models
Balance the data to build better models
2. Reducing data with PCA/Factor
Explain the idea behind PCA/Factor
Determine the number of components/factors
Explain the principle of rotating a solution
3. Creating rulesets for flag targets with Decision List
Explain how Decision List builds a ruleset
Use Decision List interactively
Create rulesets directly with Decision List
4. Exploring advanced supervised models
Explain the principles of Support Vector Machine (SVM)
Explain the principles of Random Trees
Explain the principles of XGBoost
5. Combining models
Use the Ensemble node to combine model predictions
Improve model performance by meta-level modeling
6. Finding the best supervised model
Use the Auto Classifier node to find the best model for categorical targets
Use the Auto Numeric node to find the best model for continuous targets
Please refer to course overview
Familiarity with the IBM SPSS Modeler environment (creating, editing, opening, and saving streams).
Familiarity with basic modeling techniques, either through completion of the courses Predictive Modeling for Categorical Targets Using IBM SPSS Modeler and/or Predictive Modeling for Continuous Targets Using IBM SPSS Modeler, or by experience with predictive models in IBM SPSS Modeler.
Business Analysts
Data Scientists
Users of IBM SPSS Modeler responsible for building predictive models