Chapter 5

Title: Naive Bayes Classification I 


The Chapters 3 to 6 describe classification use cases and introduce the k-nearest neighbors (k-NN) and Naive Bayes learning algorithms. Chapter 3 applies k-NN for the evaluation of teaching assistants. In Chapter 4 k-NN is used to classify different glass types based on chemical components and the RapidMiner process is extended by Principal Component Analysis (PCA) to better pre-process the data and to improve the classification accuracy. Chapter 5 explains Naive Bayes as an algorithm for generating classification models and uses this modeling technique to generate a credit approval model to decide whether a credit loan for which a potential or existing customer applies should be approved or not, i.e. whether it is likely that the customer will pay back the credit loan as desired or not. Chapter 6 uses Naive Bayes to rank applications for nursery schools, introduces the RapidMiner operator for importing Excel sheets, and provides further explanations of Naive Bayes.

Table of Contents

5.1 Introduction
5.2 Dataset
5.2.1 Credit Approval Dataset
5.2.2 Examples
5.2.3 Attributes
5.3 Operators in This Use Case
5.3.1 Rename by Replacing Operator
5.3.2 Filter Examples Operator
5.3.3 Discretize by Binning Operator
5.3.4 X-Validation Operator
5.3.5 Performance (Binominal Classification) Operator
5.4 Use Case
5.4.1 Data Import
5.4.2 Pre-processing
5.4.3 Model Training, Testing, and Performance Evaluation

Dataset: Please download the dataset from the following location:

Processes (Chapters 3-6): Click here to download