Chapter 6

Title: Naive Bayes Classification II

Summary

The Chapters 3 to 6 describe classification use cases and introduce the k-nearest neighbors (k-NN) and Naive Bayes learning algorithms. Chapter 3 applies k-NN for the evaluation of teaching assistants. In Chapter 4 k-NN is used to classify different glass types based on chemical components and the RapidMiner process is extended by Principal Component Analysis (PCA) to better pre-process the data and to improve the classification accuracy. Chapter 5 explains Naive Bayes as an algorithm for generating classification models and uses this modeling technique to generate a credit approval model to decide whether a credit loan for which a potential or existing customer applies should be approved or not, i.e. whether it is likely that the customer will pay back the credit loan as desired or not. Chapter 6 uses Naive Bayes to rank applications for nursery schools, introduces the RapidMiner operator for importing Excel sheets, and provides further explanations of Naive Bayes.

Table of Contents

6.1 Dataset
6.1.1 Nursery Dataset
6.1.2 Basic Information
6.1.3 Examples
6.1.4 Attributes
6.2 Operators in this Use Case
6.2.1 Read Excel Operator
6.2.2 Select Attributes Operator
6.3 Use Case
6.3.1 Data Import
6.3.2 Pre-processing
6.3.3 Model Training, Testing, and Performance Evaluation
6.3.4 A Deeper Look into the Naïve Bayes Algorithm

Dataset: Please download the dataset from the following location: http://archive.ics.uci.edu/ml/datasets/Nursery

Processes (Chapters 3-6): Click here to download