BreastCancerClassification-master

所属分类:matlab编程
开发工具:matlab
文件大小:11KB
下载次数:3
上传日期:2020-08-06 17:46:01
上 传 者容安
说明:  这是一个可以运行成功的决策树分类代码,运行主程序即可。
(The decision tree classification code can run successfully.)

文件列表:
BreastCancerData.csv (19889, 2019-07-25)
LICENSE (1067, 2019-07-25)
dataSetInfo.txt (5657, 2019-07-25)
main.m (3592, 2019-07-25)

# Breast Cancer Classification ## Introduction A small and compact Matlab script that solves a breast cancer classification with KNN, SVM, Naive Bayes and Decision Tree. The dataset used is the breast-cancer-dataset-wisconsin ([http://mlr.cs.umass.edu/ml/datasets/Breast+Cancer+Wisconsin+(Original)](http://mlr.cs.umass.edu/ml/datasets/Breast+Cancer+Wisconsin+(Original))) Info about where this dataset has been used and which are the characteristics can be found in dataSetInfo.txt. The aim of the project was to have a first touch with solving classification problems on Matlab, so I have not spent much time optimizing the parameters or in general taking a lot of metrics into account. ## Steps * We import the .csv with our characteristics. We get rid of the first column which has the id and we create two matrixes. The first one has our data (10xnumbOfRows) and the second one our expected labels (1xnumbOfRows). * We normalise the values of our data on the range [0, 1]. * We then perform KNN, SVM, Naive Bayes and Decision Tree achieving a max of 97% accuracy with correctly parametrised Naive Bayes (kfold cross validation with k = 10). The metrics used where: Accuracy, Sensitivity and Specificity.

近期下载者

相关文件


收藏者