Documente Academic
Documente Profesional
Documente Cultură
Abstract:
This report is the result of exploration of current utilization of Data Mining and Machine
Learning in the field of Software Engineering. This report will use several papers
proposed or published in ACM, IEEE Journal and conferences.
2. Introduction:
Data mining and Machine learning has been used vastly in software engineering. In all
phases of software engineering whether it is requirement gathering, design, development,
testing or maintenance, everywhere DM and ML are being used. There are many
applications including natural language processing, syntactic pattern recognition, search
engines, speech and handwriting recognition, object recognition in computer vision,
game playing which can be considered when DM and ML are discussed.
3. Discussion:
Let’s discuss some roles played by data mining and machine learning in software
engineering. We will separate out these by DM and ML.
Conclusion:
So far we have seen how ML was used in prediction and estimation, property and model
discovery, transformation, generation and synthesis, reuse, requirement acquisition and
management of development knowledge. We have also seen how data mining plays an
important role in defect detection and correction, reuse patterns, bug finding, classifying
the failures of software etc.
Reference:
[1] Chadd C. Williams and Jeffrey K. Hollingsworth, “Automatic Mining of Source Code
Repositories to Improve Bug Finding Techniques” Software Engineering, IEEE
transactions on Publication Date: June 2005 Volume: 31, Issue: 6 on page(s): 466- 480.
[2] Qinbao Song, Martin Shepperd, Michelle Cartwright, and Carolyn Mair, “Software
Defect Association Mining and Defect Correction Effort Prediction” Software
Engineering, IEEE Transactions on Volume 32, Issue 2, Feb. 2006 Page(s): 69 - 82
[3] Amir Michail, “Data Mining Library Reuse Patterns in User-Selected Applications”,
Automated Software Engineering.14th IEEE International Conference. On Volume,
Issue, Oct 1999 Page(s):24 - 33
[4] Per Runeson, Magnus Alexandersson and Oskar Nyholm, “Detection of Duplicate
Defect Reports Using Natural Language Processing” Software Engineering, 2007. ICSE
2007. 29th International Conference on Volume, Issue, 20-26 May 2007 Page:499 - 510
[5] R. P. Jagadeesh Chandra Bose, S. H. Srinivasan, “Data Mining Approaches to
Software Fault Diagnosis”, Research Issues in Data Engineering: Stream Data Mining
and Applications, 2005. RIDE-SDMA 2005. 15th International Workshop on Publication
Date: 3-4 April 2005 On page(s): 45- 52
[6] Patrick Francis, David Leon, Melinda Minch, Andy Podgurski, “Tree-Based Methods
for Classifying Software Failures”, Software Reliability Engineering, 2004. ISSRE 2004.
5th International Symposium on Volume , Issue , 2-5 Nov. 2004 Page(s): 451 - 462
[7] Ben Liblit, Jake Rosin, Ting Chen, Piramanayagam Arumuga Nainar, “Statistical
Debugging Using Compound Boolean Predicates” Proceedings of the 2007 international
symposium on Software testing and analysis. Year of Publication: 2007, Pages: 5 – 15.
[8] Tao Xie, Evan Martin, “Understanding software application interfaces via string
analysis”, Proceedings of the 28th international conference on Software engineering,
Year of Publication: 2006, Pages: 901 - 904
[9] Antoniol, G.; Gueheneuc, Y.-G.; Merlo, E.; Tonella, P., “Mining the Lexicon Used by
Programmers during Sofware Evolution” Software Maintenance, 2007. ICSM 2007.
IEEE International Conference on Volume, Issue, 2-5 Oct. 2007 Page(s):14 – 23
[10] David Lo, Siau-Cheng Khoo, “Mining patterns and rules for software specification
discovery”, Proceedings of the VLDB Endowment, Volume 1, Issue 2 (August 2008),
Year of Publication: 2008, Pages 1609-1616
[11] Ankori, R., “Automatic requirements elicitation in agile processes”, Software -
Science, Technology and Engineering, 2005. Proceedings. IEEE International Conference
on 22-23 Feb. 2005 Page(s):101 - 109
[12] Iker Gondra, “Applying machine learning to software fault-proneness prediction”,
Journal of Systems and Software, on Publication Date: February 2008 Volume 81 , Issue
2 on Page(s): 186-195
[13] Briand, Lionel C.; Labiche, Yvan; Liu, Xuetao, “Using Machine Learning to
Support Debugging with Tarantula”, Software Reliability, 2007. ISSRE '07. The 18th
IEEE International Symposium on 5-9 Nov. 2007 Page(s):137 - 146
[14] Briand, L.C., “Novel Applications of Machine Learning in Software Testing”,
Quality Software, 2008. QSIC '08. The Eighth International Conference on 12-13 Aug.
2008 Page(s):3 - 10
[15] Du Zhang, “Machine Learning in Value-Based Software Test Data Generation”,
Tools with Artificial Intelligence, 2006. ICTAI '06. 8th IEEE International Conference on
Nov. 2006 Page(s):732 - 736
[16] Fei Xing, Ping Guo, Michael R. Lyu “A Novel Method for Early Software Quality
Prediction Based on Support Vector Machine”, Software Reliability Engineering, 2005.
ISSRE 2005. 16th IEEE International Symposium on 1-1 Nov. 2005 Page(s): 10 pp.-222
[17] Briand, L.C.; Labiche, Y.; Bawar, Z. “Using Machine Learning to Refine Black-Box
Test Specifications and Test Suites”, Quality Software, 2008. QSIC '08. The Eighth
International Conference on 12-13 Aug. 2008 Page(s): 135-144