Intelligent system Ior the classiIication oI diabetic diseases using
artiIicial neural network
Madhu Bala 1 and Rajeev Kumar 2
1 Student, DAV Institute of Engineering. & Tech. 2 Assistant Professor, IEEE Member, DAV Institute of Engineering. &Tech. Abstract Diabetes Mellitus has become a serious and major issue Iaced by majority oI population. It happens when our body is not able to generate enough insulin. Due to diabetes, major risks arise Ior the development oI many chronic and acute diseases like cardiovascular diseases, nephropathy, neuropathy, stroke, eye, gangrene problem and nerve damage etc. In this paper, an intelligent system is proposed to detect the chronic diabetic diseases, generally Iaced by many people using an artiIicial neural network (ANN). From the literature review, it is Iound that researchers have not proposed a single system that cover all complications oI diabetes based on the symptoms oI diseases that mostly arise due to diabetes. As most oI the diseases can have common symptoms, So, the main goal oI this study is the classiIication oI diabetic diseases based on the symptoms. A total oI Iive chronic diseases are classiIied in this paper that are cardiovascular diseases, nephropathy, gangrene, eye and stroke problem. The dataset considered Ior the training oI an ANN is collected Irom the various physicians and the best validation perIormance is achieved to 12 epochs i.e.,75.95. 2013 Elsevier Science. All rights reserved. Keywords: ANN, Diabetic diseases, Expert system. 1. Introduction This paper deals with designing oI an intelligent system to classiIy the chronic Diabetic diseases that are very common in now-a-days and many people are suIIered Irom these diseases. It is one oI the world`s most growing disease at a large scale. Diabetes is a state associated with very high glucose level in the blood. Approximately, 346 million people are Iacing the problem oI diabetes |1| in which 20 million people are Type-1 diabetic patient i.e., is a state in which pancreas is not able to produce enough insulin to control the regulatory cycle oI glucose. The Irequent Iorm oI diabetes mellitus is Type-2. Due to Type-2 diabetes, mortality rate happens 5-10 years early than people having no diabetes symptoms. In this stage, cells are unable to utilize the insulin produced by pancreas. II any oI the person is diagnosed as diabetic patient, Even that person has not made any discussion with specialists/doctors Ior treatment then the chances are more increased to be suIIered Irom the chronic diseases originated due to diabetes like Cardiovascular diseases, Nephropathy, Neuropathy, Eye problems, Gangrene, Stroke problem, Blindness etc. All these chronic diseases are very harmIul and day-by-day, it makes a home in the human body and leads to the mortality aIter some passage oI time. So, the early and timely detection oI all these diseases is very important to make control over them. Moreover, the diagnostic procedures oI all these diseases are also very cumbersome, time-consuming and expensive. Also, many Iactors are included in their diagnostic process that makes a harder job Ior a physician to take an early decision. Hence, a step is made towards to design an intelligent system to classiIy all the chronic diseases that arises due to diabetes without consuming time. It can also helpIul Ior a physician along with the containment oI the patient's history. Intelligent computational techniques like ArtiIicial Neural Network (ANN), Fuzzy InIerence System (FIS) and Adaptive Neuro-Fuzzy InIerence System (ANFIS) are play a major role to represent the human knowledge. Also, these techniques are widely used in the implementation oI intelligent and expert systems Ior the diagnosis oI many disease like as cancer, liver, lung diseases etc. ANN provides a number oI advantages like 1) It is able to discover all possible interactions between promising variable. 2) Opportunity to use multiple training algorithms. Proceedings oI International ConIerence on Computing Sciences WILKES100 ICCS 2013 ISBN: 978-93-5107-172-3 45 Elsevier Publications, 2013 Madhu Bala and Rajeev Kumar 3) Needs less statistical training. 4) Detection oI complex relationships. They have been used as a substitute to schematic statistical technique like regression and discriminate analysis. The main objective oI this paper is the Iabrication oI an intelligent system Ior the classiIication oI chronic diabetic diseases arises only due to diabetes. For this, an artiIicial neural network is proposed Ior the classiIication oI chronic diabetic diseases. The most important Iactors considered in this study is diagnostic process that is the evaluation oI data gathered Irom patients and determination oI experts. The knowledge/data is acquired Irom various physicians. This system will help those person that are already Iacing the problem oI diabetes and can be suIIered Irom other major diseases and also can be used to control the rapid growth oI diseases. The conclusion is drawn on the basis oI collected symptoms The rest oI paper is incorporated as Iollows. Section 2 deals with previous studies. The methodology adopted is discussed in Section 3. Consequently, Section 4 includes the summarization oI results. Section 5 concerns with the conclusion and Iuture scope and Section 6 includes all the reIerences oI research paper and the other resource considered in this work. 2. Related Work Many strategies have been implemented on Pima Indian diabetes dataset Ior the diagnosis oI diabetes and Ior many other diseases. Several studies and results are provided by many researchers on this dataset by applying numerous intelligent methods and techniques Ior the diagnosis oI diabetes and achieved a very high classiIication accuracy rate. In |2|, authors have described a diagnosis system using general regression neural network (GRNN) by using Pima Indian diabetes dataset. The classiIication accuracy achieved was 80.21 with GRNN. They have also implemented multilayer neural network with Levenberg-Marquardt algorithm and the classiIication accuracy achieved was 77.08, which was lesser than GRNN. Ince et.al. |3|, elaborated two neural network training methods such as conventional back-propagation (BP) and particle swarm optimization (PSO) to deIine the architecture space over Ieed-Iorward and Iully-connected artiIicial neural networks (ANNs). Both techniques were evaluated on the basis oI classiIication over three medical diagnosis problems (breast cancer, heart disease and diabetes. It is Iound that the PSO algorithm has usually a better generalization ability across the architecture space whereas BP can occasionally provide better training and/or test classiIication perIormance Ior some network conIigurations but PSO is capable oI achieving minimum test classiIication errors regardless oI the training depth. In terms oI computational complexity, BP is in general superior to PSO Ior the entire architecture space used. Shanthi et al. |4|, proposed a hybrid model Ior the prediction oI stroke diseases. The back-propagation algorithm was used to train the ANN. The perIormance oI the system was tested by using 50 patients who had some signs oI stroke disease. The results were validated Irom physician aIter the successIul training oI ANN by 50 patients. The accuracy obtained Irom neural network was 88. For the enhancement oI weight optimization and Ieature selection, ANN was combined with genetic algorithm. Due to integration, accuracy was increased to 90 and in case oI weight optimization, it was 99. Kavitha et al. |5|, reported a evolutionary architecture Ior the detection oI heart disease using integration oI Ieed Iorward neural network architecture and genetic algorithm in which neural network was trained by providing the back propagation learning algorithm. To enhance the resultant value, neural network was integrated with genetic algorithm and the perIormance was good as well as reliable that was missing in gradient decent algorithm. Priya et al. |6|, examined two models probabilistic neural network (PNN) and support vector machine (SVM) Ior the diagnosis oI diabetic retinopathy. The operations oI both models were compared. The accuracy obtained Irom PNN and SVM was was 89.60 and 97.608 respectively. The sensitivity obtained Irom both oI models was 90. Sut et al.|7| studied six diIIerent algorithms Ior the prediction oI mortality in stroke patients using six diIIerent MLP neural networks |quick propagation (QP), Levenberg-Marquardt (LM), back propagation (BP), quasi- newton (QN), delta bar delta (DBD) and conjugate gradient descent (CGD)|. The perIormances were compared using the receiver operating characteristic (ROC) curve method. The highest accuracy was achieved by QP with speciIicity 81.3, sensitivity 78.4, accuracy 80.7 and area under curve (AUC) was 0.869 values and CGD achieved the lowest speciIicity 61.5, sensitivity 58.7, accuracy 60.8 and AUC 0.636 values. The multilayer neural networks (MLNN) have been eIIectively used Ior the diagnosis oI various diseases |8-10|. The back-propagation (BP) algorithm |11| is mostly considered as a commanding tool Ior providing training to MLNNs. But, its disadvantages is the slow convergence rate that leads to suboptimal 46 Elsevier Publications, 2013 Intelligent System Ior the ClassiIication oI Diabetic Diseases using ArtiIicial Neural Network solutions |12-13|. LevenbergMarquardt (LM) algorithm |14| provides Iaster convergence and better estimation results than other training algorithms|15|. The probabilistic neural network (PNN) employed a supervised training set to Iormulate distribution Iunctions. Its structure is very useIul Ior the classiIication oI problems and disease diagnosis systems |10,16- 17|.In |18|, authors have oIIered a recurrent neural networks (RNN) model Ior the blood glucose metabolism oI diabetes. In |19|, authors have introduced a Radial Basis Function (RBF) neural networks model Ior the diagnosis oI diabetes mellitus. Ucman et al. have used both Multilayer Perceptron (MLP) and RBF Ior classiIication oI MCA stenosis in diabetes. In |10, 20|, a comparative pima-diabetes disease diagnosis was suggested. They have used a MLP neural network model which is trained by LevenbergMarquardt (LM) algorithm and a probabilistic neural network. In |21|, an MLP model was presented to diagnose the level oI sugar in diabetics Irom bad breath using electronic nose. 3. Proposed System The proposed intelligent system is decomposed into two modules. In the Iirst module, Iour basic Ieatures are collected Irom the patients like name, age, gender and address. In second stage, artiIicial neural network is implemented to diagnose the diabetic disease by inspecting the symptoms entered by patients. In this work, 43 symptoms are undertaken Ior the observation. All these symptoms are passed as an input to ANN Ior the classiIication oI diabetic diseases. The proposed model is shown in Fig.1. 3.1. Reduction of dimensions (input variable) As in this study, total 43 symptoms are considered as input variables, hence, to reduce the total dimensions oI inputs, Iollowing Iormula is used based on the index value (Ii) given to each symptom according to the importance in the respective disease and the binary vector oI symptoms(Si). Disease D i _ (S i * I i ) (1) Where, S i Values oI symptoms, I i Index number oI symptoms. The multiplication oI S i * I i will produce diIIerent values according to the diIIerent diseases. The Iinal diagnosis oI the disease or result will depend on yielding the maximum value. Result Max ( _ D i ) (2) Fig.1. Block Diagram oI proposed system Collection oI Diabetic Disease Dataset Diabetic diseases dataset including 43 Ieatures Implementation oI ArtiIicial Neural Network ClassiIication oI Diabetic diseases Reduce the dimensions oI input variables 47 Elsevier Publications, 2013 Madhu Bala and Rajeev Kumar
3.2. Implementation of ANN ArtiIicial neural network has a set oI connected input and output units through hidden layer units in which each connection has a weight associated with it as shown in Iollowing Fig.2. The goal oI artiIicial neural network is to train a system according to the human prediction. The neurons present in all the layers oI neural network are working like as neurons present in human brain. The main advantage oI ANN is that it provides a variety oI algorithm Ior the training purpose. In this study, the Iollowing type oI neural network is used.
Network Type: Feed-Forward backpropagation Training Function: Trainlm (Levenberg-Marquardt) Adaptation Learning Function: learngdm (Gradient Descent) PerIormance Function: MSE (Mean Squared Error) Number oI Layers: 2
Fig.2. ArtiIicial Neural Network |22|
4. Results The dataset considered in this work contains a total oI 400 cases collected Irom various physicians. Out oI 400, 300 cases are passed as an input to train the neural network and the output is shown in the Iollowing Fig.3 that contains the Iour Iigures. The Iirst Iigure is related with the training oI ANN in which 83 accuracy is achieved by the system. Similarly, the accuracy is achieved by testing and validation is 64 and 73 respectively. The overall perIormance obatined Irom the system is 79 aIter training.
48 Elsevier Publications, 2013 Intelligent System Ior the ClassiIication oI Diabetic Diseases using ArtiIicial Neural Network
Fig.3. Results oI ANN
The Iollowing Fig.4 represents the validation perIormance having the best validation perIormance at epoch 7,12,15 and 6 that is 84, 75, 96 and 76 respectively.
a) at epoch 7 b) at epoch 12
49 Elsevier Publications, 2013 Madhu Bala and Rajeev Kumar
c) at epoch 15 d) at epoch 6
Fig 4: Validation perIormance oI ANN at diIIerent Epochs
The Sensitivity and SpeciIicity oI the proposed system is Iound using the conIusion matrix. To compute the sensitivity and speciIicity, total 50 and 30 cases are tested and validated respectively. From the below Figure 5, it is clear that the sensitivity oI the proposed system is 78 and speciIicity is equal to 90. The accuracy oI proposed system is dependent on the sensitivity and speciIicity that is computed as:
50 Elsevier Publications, 2013 Intelligent System Ior the ClassiIication oI Diabetic Diseases using ArtiIicial Neural Network
5. Discussion and Future Work This study is based on the artiIicial neural network that is used Ior designing an intelligent system Ior the classiIication oI complications oI chronic diabetic diseases like cardiovascular diseases, nephropathy, gangrene, eye and stroke problem. The system is implemented with multilayer Ieed Iorward back-propagation neural network. The best validation perIormance is achieved only to 12 epochs. The proposed system can help an individual to diagnose itselI against the chronic diseases by entering the symptoms without any expense. Besides, advantages oI ANN, it has also some limitations like its black box nature. Due to which, the knowledge is not explicitly represented. To remove this limitation, the system can be upgraded by using another intelligent computing techniques. The Iuture work is concerned with the enhancement oI accuracy. Also, it is hoped that some more interesting results will be Iound out by Iurther exploiting the techniques. References |1| World Health Organization: Diabetes, http : // www . who.int/mediacentre/Iactsheets/Is312/en/index.html, accessed July, 2013. |2| Kamer Kayaer, Tulay Yildirim, Medical diagnosis on Pima Indian diabetes using general regression neural networks, in: Proceedings oI the international conIerence on artiIicial neural networks and neural inIormation processing (ICANN/ICONIP), pp. 181184, 2003. |3| Turker Ince, Serkan Kiranyaz, Jenni Pulkkinen, MonceI Gabbouj, "Evaluation oI global and local training techniques over Ieed-Iorward neural network architecture spaces Ior computer-aided medical diagnosis", Elsevier-Expert Systems with Applications Vol. 37, pp. 8450- 8461, 2010. |4| D. Shanthi, Dr. Sahoo. G, Dr.Saravanan.N, 'Evolving Connection Weights oI ArtiIicial Neural Network Using Genetic Algorithm With Application to the Prediction Stroke Diseases, International Journal oI SoIt Computing, Vol. 2, pp. 95-101, 2009. |5| K.S.Kavitha, K.V.Ramakrishnan, Manoj Kumar Singh, "Modeling and design oI evolutionary neural network Ior heart disease detection", International Journal oI Computer Science Issues, Vol.-7, Issue-5, pp. 272-283,Sep.2010. |6| R.Priya, P. Aruna, SVM and Neural Network based diagnosis oI Diabetic Retinopathy, International Journal oI Computer Application,Vol.-41,Issue-1,March2012. |7| Necdet Sut, Yahya elik, "Prediction oI mortality in stroke patients using multilayer perceptron neural networks", Turk J Med Sci, Vol.- 42, Issue-5, pp.886-893,2012. |8| Delen, D., Walker, G., & Kadam, A,"Predicting breast cancer survivability: A comparison oI three data mining methods", ArtiIicial Intelligence in Medicine ArtiIicial Intelligence in Medicine, Vol.-34, Issue-2,pp.113127, 2005. |9| Kayaer, K., & Yildirim, T.,"Medical diagnosis on Pima Indian diabetes using general regression neural networks", In Proceedings oI the international conIerence on artiIicial neural networks and neural inIormation processing, (ICANN/ ICONIP), pp. 181184, 2003. |10| Temurtas, F.,"A comparative study on thyroid disease diagnosis using neural networks", Expert Systems with Applications, Vol.36, pp.944949,2009. |11| Rumelhart, D. E., Hinton, G. E., & Williams, R. J,"Learning internal representations by error propagation. In D. E. Rumelhart & J. L. McClelland (Eds.)", Parallel distributed processing: Explorations in the microstructure oI cognition,Vol. 1, pp. 318362,. Cambridge, MA: MIT Press.,1986. |12| Brent, R. P.," Fast training algorithms Ior multi-layer neural nets", IEEE Transactions on Neural Networks, Vol. 2, pp.346354.,1991. |13| Gori, M., & Tesi, A.," On the problem oI local minima in backpropagation",. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.14, pp.7685.,1992. |14| Hagan, M. T., & Menhaj, M.,"Training Ieed Iorward networks with the Marquardt algorithm.",IEEE Transactions on Neural Networks, Vol.5, pp.989993, 1994. |15| Gulbag, A., & Temurtas, F.," A study on quantitative classiIication oI binary gas mixture using neural networks and adaptive neuro Iuzzy inIerence systems", Sensors and Actuators B, Vol. 115, pp.252262,2006. |16| Gulbag, A., Temurtas, F., & Yusubov, I,"Quantitative discrimination oI the binary gas mixtures using a combinational structure oI the probabilistic and multilayer neural networks", Sensors and Actuators B: Chemical, Vol. 131, pp.196204,2007. |17| Specht, D. F.,"Probabilistic neural networks. Neural Networks", Vol. 3, pp.109118.,1990. |18| Tresp, V., Briegel, T., & Moody, J.,"Neural network models Ior the blood glucose metabolism oI a diabetic", IEEE Transactions on Neural Networks, Vol.10, Issue 5, pp.12041213,1999. |19| Venkatesan, P., & Anitha, S.," Application oI a radial basis Iunction neural network Ior diagnosis oI diabetes mellitus. Current Science", Vol.91, Issue 9,2006. |20| Ucman, E., Barisci, N., Ozan, A. T., Serhatlioglu, S., Ogur, E., Hardalac, F.,"ClassiIication oI MCA stenosis in diabetes by MLP and RBF neural network", Journal oI Medical Systems, 28(5), 47548,2004. |21| Karlik, B., & Al-Bastaki, Y.,"Bad breathe diagnosis system using OMX-GR sensor and neural network Ior telemedicine", Clinical InIormatics and Telemedicine, Vol. 2, pp.237239,2004. |22| http://www.webpages.ttu.edu/dleverin/neuralnetwork/neuralnetworks.html 51 Elsevier Publications, 2013 Index
A Adaptive neuro-fuzzy inference system (ANFIS), 45 ANFIS. see Adaptive neuro-fuzzy inference system (ANFIS) ANN. see Artificial neural network (ANN) Artificial neural network (ANN), 45 confusion matrix of, 50 Epochs, validation performance of, 50 proposed intelligent system, 4748 results of, 4850
B Back-propagation (BP), 46
F FIS. see Fuzzy inference system (FIS) Fuzzy inference system (FIS), 45
G General regression neural network (GRNN), 46 GRNN. see General regression neural network (GRNN)
L Levenberg-Marquardt (LM) algorithm, 47
M MLNN. see Multilayer neural networks (MLNN) Multilayer neural networks (MLNN), 4647 Multilayer perceptron (MLP), 47
P Particle swarm optimization (PSO), 46 PNN. see Probabilistic neural network (PNN) Probabilistic neural network (PNN), 4647 Proposed intelligent system for ANN implementation of, 48 input variable, 47 PSO. see Particle swarm optimization (PSO)
R Radial basis function (RBF) neural networks model, 47 RBF. see Radial basis function (RBF) Recurrent neural networks (RNN) model, 47