An article “Modeling wine preferences by data mining from physicochemical properties” by Paulo Cortez, António Cerdeira, Fernando Almeida, Telmo Matos and José Reis published on sciencedirect.com in 2009 reviews and proposes a data mining approaches to predict wine taste quality evualations. The analysis is based on dataset which is a large compared to other taking in account the domain of the work.
The article reviews three techniques used for the predictions: the support vector machine, the multiple regression, neural network methods.
The support vector machine will be replicated in this work as an outperforming method in accuracy for this prediction. The naive Bayes classifier will be applied as an addition alernative classification method beside the SVM.
The dataset is decoupled to two separate CSV files. One of them contains samples of white wines and another of red wine. The data