Academia.eduAcademia.edu

Outline

An investigation of machine learning based prediction systems

2000, Journal of Systems and …

Abstract
sparkles

AI

This research investigates the effectiveness of various machine learning (ML) techniques in predicting software development effort, specifically focusing on accuracy, explanatory value, and configurability. The study finds that artificial neural networks (ANNs) offer the highest accuracy but require significant configuration effort, while rule induction (RI) methods are the least accurate and also less configurable. A comparative analysis of ANNs, case-based reasoning (CBR), and regression methods reveals substantial variations in performance contingent on the data used, indicating the need for improved models that can adapt to heterogeneous software environments.

FAQs

sparkles

AI

What is the comparative accuracy of ML techniques for software effort prediction?add

The study finds that Artificial Neural Networks (ANNs) outperform Case-Based Reasoning (CBR) and Least Squares Regression (LSR), achieving the highest accuracy, whereas Rule Induction (RI) is consistently the least accurate.

How does feature pruning impact prediction accuracy in ML models?add

Pruning significantly enhances the performance of both CBR and RI techniques; for instance, RI shows a marked improvement from 86% to 41% accuracy when redundant features are removed.

What advantages do Case-Based Reasoning systems have over other techniques?add

CBR systems, such as ANGEL, demonstrate effective functionality even with small datasets and provide greater user collaboration, aiding in better estimation precision compared to algorithmic methods.

What challenges exist in configuring Artificial Neural Networks for effort predictions?add

Configurations of ANNs can hinder their superior accuracy benefits; misconfigured networks showed reduced prediction performance due to complexities in determining appropriate heuristics for activation functions.

How do ML prediction techniques engage end-users during software project estimation?add

The paper reveals that user involvement with prediction systems may improve both accuracy and user confidence in estimations, highlighting the importance of interactive feedback mechanisms.

References (27)

  1. Aarmodt, A. and Plaza, E. (1994). Case-Based Reasoning:Foundational issues, Methodical Variations and System Approaches. AI Communications, vol 7(1).
  2. Aha, W.D. (1991). Case-Based Learning Algorithms. in 1991 DARPA Case-Based Reasoning Workshop. Morgan Kaufmann.
  3. Bisio, R. and Malabocchia, F. (1995). Cost Estimation of Software Projects Through Case Based Reasoning. in International Conference on Case Based Reasoning. Sesimbra, Portugal.
  4. Boehm, B.W. (1981). Software Engineering Economics. New York: Prentice Hall.
  5. Conte, S., Dunsmore, H.E., and Shen, V.Y. (1986). Software Engineering Metrics and Models. Benjamin/Cummings.
  6. Debuse, J.C.W. and V.J. Rayward-Smith (1997), 'Feature subset selection within a simulated annealing data mining algorithm', J. of Intelligent Information Systems, 9, pp57-81, 1997.
  7. Desharnais, J.M., (1989). Analyse statistique de la productivitie des projets informatique a partie de la technique des point des fonction. Unpublished Masters Thesis, University of Montreal.
  8. Gray, A.R. and MacDonell, S.G. (1997). 'A comparison of techniques for developing predictive models of software metrics', Information & Softw. Technol., 39(6), pp425-437, 1997.
  9. Hegazy, T. and Moselhi, O. (1994). Analogy Based Solution To Markup Estimation Problem. Journal of Computing in Civil Engineering, vol 8(1), pp72-87.
  10. Jorgensen, M. (1995). Experience With the Accuracy of Software Maintenance Task Effort Prediction Models. IEEE Transactions on Software Engineering, vol 21(8), pp674-681.
  11. Karunanithi, N., Whitley,. and Malaiya, Y.K. (1992). Using Neural Networks in Reliabilty Prediction. IEEE Software, vol 9(4), pp53 -59.
  12. Kemerer, C.F. (1987). An Empirical Validation of Cost Estimation Models. Comms. Of the ACM, 30 (5): 416-429.
  13. Kennedy, H.C., Chinniah, C., Bradbeer, P. and Morss, L. (1997). The Construction and Evaluation of Decision Trees: A Comparison of Evolutionary and Concept Learning Methods in Corne, D. and Shapiro, J. L. (Eds.). (1997) Evolutionary Computing. AISB International Workshop, Manchester, UK April 7-8, 1997. Springer-Verlag, LNCS 1305.
  14. Kok, P., B.A. Kitchenham, and J. Kirakowski. (1990) The MERMAID approach to software cost estimation, in Proc. Esprit Technical Week.
  15. Leake, D. (1996). Case-Based Reasoning: Experiences Lessons and Future Directions. AAAI Press: Menlo Park.
  16. Narendra, K.S. and Parthasarathy, K. (1987). Identification and control of dynamical systems using neural networks. IEEE Transactions on Neural Networks, vol. 1. pp4-27.
  17. Rich, E. and Knight, K. (1995). Artificial Intelligence. McGraw-Hill.
  18. Schank, R. (1982). Dynamic Memory: A theory of reminding and learning in computers and people. Cambridge University Press.
  19. Sejnowski, T.J. and Rosenberg, C.R. (1987). Parallel networks that learn to pronounce English text. Complex Systems, vol 1pp145-168.
  20. Shepperd, M.J., and Schofield, C. (1997). Estimating software project effort using analogies. IEEE Transactions on Software Engineering, 23(11), 736-743.
  21. Srinivasan, K. and Fisher,D. (1995). Machine Learning Approaches to Estimating SoftwareDevelopment Effort. IEEE Transactions on Software Engineering, vol. 21(2), pp126-136.
  22. Stensrud, E. and Myrtveit. I. (1998). Human performance estimating with analogy and regression models: an empirical validation, in Proc. 5th Intl. Metrics Symp. Bethesda, MD: IEEE Computer Society.
  23. Venkatachalam, A.R. (1993). Software Cost Estimation Using Artificial Neural Networks. In International Joint Conference on Neural Networks. Nagoya: IEEE.
  24. Vincinanza, S. and Prietula, M.J. (1990). Case Based Reasoning In Software Effort Estimation. In Proceddings 11th Int Conf on Information Systems.
  25. Walczak, S. and N. Cerpa, (1999) 'Heuristic principles for the design of artificial neural networks', Information & Softw. Technol., 41(2), pp107-117, 1999.
  26. Watson, I. and Marir, F. (1994). Case-Based Reasoning: A Review. The Knowledge Engineering Review, vol 9(4), pp327-354.
  27. Wittig, G. and Finnie, G. (1997). Estimating Software Development Effort with Connectionist Models. Information and Software Technology, vol 39pp469 -476.