疾病检测分析:用于乳腺癌和糖尿病发病率决策的简单线性凸规划算法

Disease Detection Analytics: A Simple Linear Convex Programming Algorithm for Breast Cancer and Diabetes Incidence Decisions

DECISION SCIENCES · 2018
被引 4
人大 AABS 3

中文导读

提出一种基于线性凸规划的分类算法,用于检测乳腺癌、糖尿病等疾病,并在多个数据集上与其他机器学习方法比较,证明其在假阳性和假阴性测试中表现更优。

Abstract

ABSTRACT In the last couple of decades, data analytics‐based pattern classification methods for disease detection have gained much traction in healthcare research and applications. The current study builds linear programming (LP) models for detecting disease incidence. We propose sequential steps of a convex programming algorithm to construct decision boundary functions to classify patterns in disease detection data. We compare the performance of our LP‐based classifier with others (neural network, decision tree, k ‐nearest‐neighbor, logistic regression, naïve‐Bayes, and support‐vector‐machine) on four datasets: two different ones for breast cancer, and one each for diabetes and diabetic retinopathy. Statistical tests reveal that the LP classifier did significantly better than the other methods in five out of eight false‐positive and false‐negative test cases. There is not a statistically significant difference in performance in the remaining three tests between the LP classifier and the best alternative method. Most importantly, the LP classifier has significantly superior performance in both diabetes detection and diabetic retinopathy data. The success of the proposed LP classifier results from avoiding “modeling noise” and “memorization of training data.” We recommend that our proposed LP classifier be among the set of classifiers for use in disease detection analytics.

疾病检测线性规划分类器机器学习医疗数据分析