Sunteți pe pagina 1din 7

1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet

AboutCitationPolicyDonateaDataSetContact

Search
Repository Web

ViewALLDataSets
CenterforMachineLearningandIntelligentSystems

HeartDiseaseDataSet
Download:DataFolder,DataSetDescription
Abstract:4databases:Cleveland,Hungary,Switzerland,andtheVALongBeach

DataSet Numberof
Multivariate 303 Area: Life
Characteristics: Instances:

Attribute Categorical,Integer, Numberof 198807


75 DateDonated
Characteristics: Real Attributes: 01

NumberofWeb
AssociatedTasks: Classification MissingValues? Yes 410475
Hits:

Source:

Creators:

1.HungarianInstituteofCardiology.Budapest:AndrasJanosi,M.D.
2.UniversityHospital,Zurich,Switzerland:WilliamSteinbrunn,M.D.
3.UniversityHospital,Basel,Switzerland:MatthiasPfisterer,M.D.
4.V.A.MedicalCenter,LongBeachandClevelandClinicFoundation:RobertDetrano,M.D.,Ph.D.

Donor:

DavidW.Aha(aha'@'ics.uci.edu)(714)8568779

DataSetInformation:

Thisdatabasecontains76attributes,butallpublishedexperimentsrefertousingasubsetof14ofthem.Inparticular,
theClevelanddatabaseistheonlyonethathasbeenusedbyMLresearchersto
thisdate.The"goal"fieldreferstothepresenceofheartdiseaseinthepatient.Itisintegervaluedfrom0(nopresence)
to4.ExperimentswiththeClevelanddatabasehaveconcentratedonsimplyattemptingtodistinguishpresence(values
1,2,3,4)fromabsence(value0).

Thenamesandsocialsecuritynumbersofthepatientswererecentlyremovedfromthedatabase,replacedwith
dummyvalues.

Onefilehasbeen"processed",thatonecontainingtheClevelanddatabase.Allfourunprocessedfilesalsoexistinthis
directory.

ToseeTestCosts(donatedbyPeterTurney),pleaseseethefolder"Costs"

https://archive.ics.uci.edu/ml/datasets/Heart+Disease 1/7
1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet

AttributeInformation:
Only14attributesused:
1.#3(age)
2.#4(sex)
3.#9(cp)
4.#10(trestbps)
5.#12(chol)
6.#16(fbs)
7.#19(restecg)
8.#32(thalach)
9.#38(exang)
10.#40(oldpeak)
11.#41(slope)
12.#44(ca)
13.#51(thal)
14.#58(num)(thepredictedattribute)

Completeattributedocumentation:
1id:patientidentificationnumber
2ccf:socialsecuritynumber(Ireplacedthiswithadummyvalueof0)
3age:ageinyears
4sex:sex(1=male0=female)
5painloc:chestpainlocation(1=substernal0=otherwise)
6painexer(1=provokedbyexertion0=otherwise)
7relrest(1=relievedafterrest0=otherwise)
8pncaden(sumof5,6,and7)
9cp:chestpaintype
Value1:typicalangina
Value2:atypicalangina
Value3:nonanginalpain
Value4:asymptomatic
10trestbps:restingbloodpressure(inmmHgonadmissiontothehospital)
11htn
12chol:serumcholestoralinmg/dl
13smoke:Ibelievethisis1=yes0=no(isorisnotasmoker)
14cigs(cigarettesperday)
15years(numberofyearsasasmoker)
16fbs:(fastingbloodsugar>120mg/dl)(1=true0=false)
17dm(1=historyofdiabetes0=nosuchhistory)
18famhist:familyhistoryofcoronaryarterydisease(1=yes0=no)
19restecg:restingelectrocardiographicresults
Value0:normal
Value1:havingSTTwaveabnormality(Twaveinversionsand/orSTelevationordepressionof>0.05mV)
Value2:showingprobableordefiniteleftventricularhypertrophybyEstes'criteria
20ekgmo(monthofexerciseECGreading)
21ekgday(dayofexerciseECGreading)
22ekgyr(yearofexerciseECGreading)
23dig(digitalisusedfuringexerciseECG:1=yes0=no)
24prop(BetablockerusedduringexerciseECG:1=yes0=no)
25nitr(nitratesusedduringexerciseECG:1=yes0=no)
26pro(calciumchannelblockerusedduringexerciseECG:1=yes0=no)
27diuretic(diureticusedusedduringexerciseECG:1=yes0=no)
28proto:exerciseprotocol
1=Bruce
2=Kottus
3=McHenry
4=fastBalke
5=Balke
6=Noughton
7=bike150kpamin/min(Notsureif"kpamin/min"iswhatwaswritten!)
8=bike125kpamin/min
9=bike100kpamin/min
10=bike75kpamin/min
11=bike50kpamin/min
12=armergometer
29thaldur:durationofexercisetestinminutes
30thaltime:timewhenSTmeasuredepressionwasnoted
31met:metsachieved
https://archive.ics.uci.edu/ml/datasets/Heart+Disease 2/7
1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet

32thalach:maximumheartrateachieved
33thalrest:restingheartrate
34tpeakbps:peakexercisebloodpressure(firstof2parts)
35tpeakbpd:peakexercisebloodpressure(secondof2parts)
36dummy
37trestbpd:restingbloodpressure
38exang:exerciseinducedangina(1=yes0=no)
39xhypo:(1=yes0=no)
40oldpeak=STdepressioninducedbyexerciserelativetorest
41slope:theslopeofthepeakexerciseSTsegment
Value1:upsloping
Value2:flat
Value3:downsloping
42rldv5:heightatrest
43rldv5e:heightatpeakexercise
44ca:numberofmajorvessels(03)coloredbyflourosopy
45restckm:irrelevant
46exerckm:irrelevant
47restef:restraidonuclid(sp?)ejectionfraction
48restwm:restwall(sp?)motionabnormality
0=none
1=mildormoderate
2=moderateorsevere
3=akinesisordyskmem(sp?)
49exeref:exerciseradinalid(sp?)ejectionfraction
50exerwm:exercisewall(sp?)motion
51thal:3=normal6=fixeddefect7=reversabledefect
52thalsev:notused
53thalpul:notused
54earlobe:notused
55cmo:monthofcardiaccath(sp?)(perhaps"call")
56cday:dayofcardiaccath(sp?)
57cyr:yearofcardiaccath(sp?)
58num:diagnosisofheartdisease(angiographicdiseasestatus)
Value0:<50%diameternarrowing
Value1:>50%diameternarrowing
(inanymajorvessel:attributes59through68arevessels)
59lmt
60ladprox
61laddist
62diag
63cxmain
64ramus
65om1
66om2
67rcaprox
68rcadist
69lvx1:notused
70lvx2:notused
71lvx3:notused
72lvx4:notused
73lvf:notused
74cathef:notused
75junk:notused
76name:lastnameofpatient(Ireplacedthiswiththedummystring"name")

RelevantPapers:
Detrano,R.,Janosi,A.,Steinbrunn,W.,Pfisterer,M.,Schmid,J.,Sandhu,S.,Guppy,K.,Lee,S.,&Froelicher,V.
(1989).Internationalapplicationofanewprobabilityalgorithmforthediagnosisofcoronaryarterydisease.American
JournalofCardiology,64,304310.
[WebLink]

DavidW.Aha&DennisKibler."InstancebasedpredictionofheartdiseasepresencewiththeClevelanddatabase."
[WebLink]

Gennari,J.H.,Langley,P,&Fisher,D.(1989).Modelsofincrementalconceptformation.ArtificialIntelligence,40,11
https://archive.ics.uci.edu/ml/datasets/Heart+Disease 3/7
1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet
61.
[WebLink]

PapersThatCiteThisDataSet1:

ZhiHuaZhouandYuanJiang.NeC4.5:NeuralEnsembleBasedC4.5.IEEETrans.Knowl.DataEng,16.2004.[View
Context].

RemcoR.BouckaertandEibeFrank.EvaluatingtheReplicabilityofSignificanceTestsforComparingLearning
Algorithms.PAKDD.2004.[ViewContext].

XiaoyongChaiandLiDengandQiangYangandCharlesX.Ling.TestCostSensitiveNaiveBayesClassification.
ICDM.2004.[ViewContext].

GavinBrown.DiversityinNeuralNetworkEnsembles.TheUniversityofBirmingham.2004.[ViewContext].

KaizhuHuangandHaiqinYangandIrwinKingandMichaelR.LyuandLaiwanChan.BiasedMinimaxProbability
MachineforMedicalDiagnosis.AMAI.2004.[ViewContext].

JeroenEggermontandJoostN.KokandWalterA.Kosters.GeneticProgrammingfordataclassification:partitioning
thesearchspace.SAC.2004.[ViewContext].

DavidPageandSoumyaRay.Skewing:AnEfficientAlternativetoLookaheadforDecisionTreeInduction.IJCAI.
2003.[ViewContext].

JinyanLiandLimsoonWong.UsingRulestoAnalyseBiomedicalData:AComparisonbetweenC4.5andPCL.
WAIM.2003.[ViewContext].

YuanJiangZhiandHuaZhouandZhaoqianChen.RuleLearningbasedonNeuralNetworkEnsemble.Proceedingsof
theInternationalJointConferenceonNeuralNetworks.2002.[ViewContext].

BabackMoghaddamandGregoryShakhnarovich.BoostedDyadicKernelDiscriminants.NIPS.2002.[ViewContext].

ThomasMelluishandCraigSaundersandIliaNouretdinovandVolodyaVovkandCarolS.SaundersandI.
NouretdinovV..Thetypicalnessframework:acomparisonwiththeBayesianapproach.DepartmentofComputer
Science.2001.[ViewContext].

RobertBurbidgeandMatthewTrotterandBernardF.BuxtonandSeanB.Holden.STARSparsitythroughAutomated
Rejection.IWANN(1).2001.[ViewContext].

PeterL.HammerandAlexanderKoganandBrunoSimeoneandSandorSzedm'ak.RutcorResearchReport.
RutgersCenterforOperationsResearchRutgersUniversity.2001.[ViewContext].

RudySetionoandWeeKhengLeow.FERNN:AnAlgorithmforFastExtractionofRulesfromNeuralNetworks.Appl.
Intell,12.2000.[ViewContext].

KristinP.BennettandAyhanDemirizandJohnShaweTaylor.AColumnGenerationAlgorithmForBoosting.ICML.
2000.[ViewContext].

ThomasG.Dietterich.AnExperimentalComparisonofThreeMethodsforConstructingEnsemblesofDecisionTrees:
Bagging,Boosting,andRandomization.MachineLearning,40.2000.[ViewContext].

LorneMasonandPeterL.BartlettandJonathanBaxter.ImprovedGeneralizationThroughExplicitOptimizationof
Margins.MachineLearning,38.2000.[ViewContext].

EndreBorosandPeterHammerandToshihideIbarakiandAlexanderKoganandEddyMayorazandIlyaB.Muchnik.
AnImplementationofLogicalAnalysisofData.IEEETrans.Knowl.DataEng,12.2000.[ViewContext].

PetriKontkanenandPetriMyllymandTomiSilanderandHenryTirriandPeterGr.Onpredictivedistributionsand
Bayesiannetworks.DepartmentofComputerScience,StanfordUniversity.2000.[ViewContext].

IakiInzaandPedroLarraagaandBasilioSierraandRamonEtxeberriaandJoseAntonioLozanoandJosManuel
Pea.RepresentingthebehaviourofsupervisedclassificationlearningalgorithmsbyBayesiannetworks.Pattern
https://archive.ics.uci.edu/ml/datasets/Heart+Disease 4/7
1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet

RecognitionLetters,20.1999.[ViewContext].

YoavFreundandLorneMason.TheAlternatingDecisionTreeLearningAlgorithm.ICML.1999.[ViewContext].

JinyanLiandXiuzhenZhangandGuozhuDongandKotagiriRamamohanaraoandQunSun.EfficientMiningofHigh
ConfidienceAssociationRuleswithoutSupportThresholds.PKDD.1999.[ViewContext].

ChunNanHsuandHilmarSchuschelandYaTingYang.TheANNIGMAWrapperApproachtoNeuralNetsFeature
SelectionforKnowledgeDiscoveryandDataMining.InstituteofInformationScience.1999.[ViewContext].

KaiMingTingandIanH.Witten.IssuesinStackedGeneralization.J.Artif.Intell.Res.(JAIR,10.1999.[View
Context].

RudySetionoandHuanLiu.NeuroLinear:Fromneuralnetworkstoobliquedecisionrules.Neurocomputing,17.1997.
[ViewContext].

.PrototypeSelectionforCompositeNearestNeighborClassifiers.DepartmentofComputerScienceUniversityof
Massachusetts.1997.[ViewContext].

IgorKononenkoandEdvardSimecandMarkoRobnikSikonja.OvercomingtheMyopiaofInductiveLearning
AlgorithmswithRELIEFF.Appl.Intell,7.1997.[ViewContext].

JanC.BiochandD.MeerandRobPotharst.BivariateDecisionTrees.PKDD.1997.[ViewContext].

D.RandallWilsonandRoelMartinez.MachineLearning:ProceedingsoftheFourteenthInternationalConference,
Morgan.InFisher.1997.[ViewContext].

PedroDomingos.ControlSensitiveFeatureSelectionforLazyLearners.Artif.Intell.Rev,11.1997.[ViewContext].

FlorianaEspositoandDonatoMalerbaandGiovanniSemeraro.AComparativeAnalysisofMethodsforPruning
DecisionTrees.IEEETrans.PatternAnal.Mach.Intell,19.1997.[ViewContext].

KamalAliandMichaelJ.Pazzani.ErrorReductionthroughLearningMultipleDescriptions.MachineLearning,24.1996.
[ViewContext].

RonKohavi.ThePowerofDecisionTables.ECML.1995.[ViewContext].

RonKohaviandDanSommerfield.FeatureSubsetSelectionUsingtheWrapperMethod:OverfittingandDynamic
SearchSpaceTopology.KDD.1995.[ViewContext].

PeterD.Turney.CostSensitiveClassification:EmpiricalEvaluationofaHybridGeneticDecisionTreeInduction
Algorithm.CoRR,csAI/9503102.1995.[ViewContext].

GaborMelli.ALazyModelBasedApproachtoOnLineClassification.UniversityofBritishColumbia.1989.[View
Context].

WlodzislandRafalAdamczakandKrzysztofGrabczewskiandGrzegorzZal.Ahybridmethodforextractionoflogical
rulesfromdata.DepartmentofComputerMethods,NicholasCopernicusUniversity.[ViewContext].

Wlodzisl/awDuchandKarolGrudzinski.Searchandglobalminimizationinsimilaritybasedmethods.Departmentof
ComputerMethods,NicholasCopernicusUniversity.[ViewContext].

RudySetionoandWeeKhengLeow.Generatingrulesfromtrainednetworkusingfastpruning.SchoolofComputing
NationalUniversityofSingapore.[ViewContext].

ElenaSmirnovaandIdaG.SprinkhuizenKuyperandI.Nalbantisandb.ERIMandUniversiteitRotterdam.Unanimous
VotingusingSupportVectorMachines.IKAT,UniversiteitMaastricht.[ViewContext].

KristaLagusandEsaAlhoniemiandJeremiasSeppaandAnttiHonkelaandArnoWagner.INDEPENDENT
VARIABLEGROUPANALYSISINLEARNINGCOMPACTREPRESENTATIONSFORDATA.NeuralNetworks
ResearchCentre,HelsinkiUniversityofTechnology.[ViewContext].

ChiranjibBhattacharyyaandPannagadattaK.SandAlexanderJ.Smola.ASecondorderConeProgramming
FormulationforClassifyingMissingData.DepartmentofComputerScienceandAutomationIndianInstituteof
Science.[ViewContext].

AyhanDemirizandKristinP.Bennett.Chapter1OPTIMIZATIONAPPROACHESTOSEMISUPERVISEDLEARNING.
DepartmentofDecisionSciencesandEngineeringSystems&DepartmentofMathematicalSciences,Rensselaer
PolytechnicInstitute.[ViewContext].

https://archive.ics.uci.edu/ml/datasets/Heart+Disease 5/7
1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet

AdilM.BagirovandJohnYearwood.Anewnonsmoothoptimizationalgorithmforclustering.CentreforInformaticsand
AppliedOptimization,SchoolofInformationTechnologyandMathematicalSciences,UniversityofBallarat.[View
Context].

AdilM.BagirovandAlexRubinovandA.N.SoukhojakandJohnYearwood.Unsupervisedandsuperviseddata
classificationvianonsmoothandglobaloptimization.SchoolofInformationTechnologyandMathematicalSciences,
TheUniversityofBallarat.[ViewContext].

KristinP.BennettandErinJ.Bredensteiner.GeometryinLearning.DepartmentofMathematicalSciencesRensselaer
PolytechnicInstitute.[ViewContext].

BruceH.Edmonds.UsingLocalised`Gossip'toStructureDistributedLearning.CentreforPolicyModelling.[View
Context].

RafaelS.ParpinelliandHeitorS.LopesandAlexAlvesFreitas.PARTFOUR:ANTCOLONYOPTIMIZATIONAND
IMMUNESYSTEMSChapterXAnAntColonyAlgorithmforClassificationRuleDiscovery.CEFETPR,Curitiba.
[ViewContext].

Wl/odzisl/awDuchandKarolGrudzinskiandGeerdH.FDiercksen.Minimaldistanceneuralmethods.Departmentof
ComputerMethods,NicholasCopernicusUniversity.[ViewContext].

JohnG.ClearyandLeonardE.Trigg.ExperienceswithOB1,AnOptimalBayesDecisionTreeLearner.Departmentof
ComputerScienceUniversityofWaikato.[ViewContext].

GlennFungandSathyakamaSandilyaandR.BharatRao.RuleextractionfromLinearSupportVectorMachines.
ComputerAidedDiagnosis&Therapy,SiemensMedicalSolutions,Inc.[ViewContext].

AyhanDemirizandKristinP.BennettandJohnShaweandI.NouretdinovV..LinearProgrammingBoostingviaColumn
Generation.Dept.ofDecisionSciencesandEng.Systems,RensselaerPolytechnicInstitute.[ViewContext].

ZhiHuaZhouandXuYingLiu.TrainingCostSensitiveNeuralNetworkswithMethodsAddressingtheClass
ImbalanceProblem.[ViewContext].

LipingWeiandRussB.Altman.AnAutomatedSystemforGeneratingComparativeDiseaseProfilesandMaking
Diagnoses.SectiononMedicalInformaticsStanfordUniversitySchoolofMedicine,MSOBX215.[ViewContext].

FedericoDivinaandElenaMarchiori.HandlingContinuousAttributesinanEvolutionaryInductiveLearner.Department
ofComputerScienceVrijeUniversiteit.[ViewContext].

RonKohaviandGeorgeH.John.AutomaticParameterSelectionbyMinimizingEstimatedError.ComputerScience
Dept.StanfordUniversity.[ViewContext].

H.TLinandC.JLin.AStudyonSigmoidKernelsforSVMandtheTrainingofnonPSDKernelsbySMOtype
Methods.DepartmentofComputerScienceandInformationEngineeringNationalTaiwanUniversity.[ViewContext].

AlexanderK.Seewald.DissertationTowardsUnderstandingStackingStudiesofaGeneralEnsembleLearningScheme
ausgefuhrtzumZweckederErlangungdesakademischenGradeseinesDoktorsdertechnischen
Naturwissenschaften.[ViewContext].

CitationRequest:

Theauthorsofthedatabaseshaverequestedthatanypublicationsresultingfromtheuseofthedataincludethenames
oftheprincipalinvestigatorresponsibleforthedatacollectionateachinstitution.Theywouldbe:
1.HungarianInstituteofCardiology.Budapest:AndrasJanosi,M.D.
2.UniversityHospital,Zurich,Switzerland:WilliamSteinbrunn,M.D.
3.UniversityHospital,Basel,Switzerland:MatthiasPfisterer,M.D.
4.V.A.MedicalCenter,LongBeachandClevelandClinicFoundation:RobertDetrano,M.D.,Ph.D.

[1]Paperswereautomaticallyharvestedandassociatedwiththisdataset,incollaborationwithRexa.info

SupportedBy: InCollaborationWith:

https://archive.ics.uci.edu/ml/datasets/Heart+Disease 6/7
1/20/2017 UCIMachineLearningRepository:HeartDiseaseDataSet

About||CitationPolicy||DonationPolicy||Contact||CML

https://archive.ics.uci.edu/ml/datasets/Heart+Disease 7/7

S-ar putea să vă placă și