An Analysis on Better Testing than Training Performances on the Iris Dataset

Marten Schutten, Marco Wiering

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

139 Downloads (Pure)

Abstract

The Iris dataset is a well known dataset containing information on three different types of Iris flowers. A typical and popular method for solving classification problems on datasets such as the Iris set is the support vector machine (SVM). In order to do so the dataset is separated in a set used for training and a set used for testing. The error rate, after training, for the training set should be lower than the error rate on the test set. However, in this paper we show that when solving the classification problem for the Iris dataset with SVMs this is not the case. Therefore, we provide an analysis of the Iris dataset and the classification models in order to find the origin of this interesting observation.
Original languageEnglish
Title of host publicationBelgian Dutch Artificial Intelligence Conference
Publication statusPublished - 10-Nov-2016

Keywords

  • Supervised learning
  • Machine learning
  • Support Vector Machine

Cite this