Abstract
Abstract Software compiles and therefore is characterized by a parseable grammar. Natural language text rarely conforms to prescriptive grammars and therefore is much harder to parse. Mining parseable structures is easier than mining less structured entities. Therefore, most work on mining repositories focuses on software, not natural language text.
References (21)
- R. Baeza-Yates and B. Ribeiro-Neto. Modern Infor mation Retrieval. ACM Press, Addison-Wesley, 1999.
- B. Boehm. Software Engineering Economics. Prentice- Hall, Englewood Cliffs, NJ, 1981.
- P. Clark and T. Ng. The cn2 induction algorithm. Machine Learning, 3:261-283, 1989.
- S. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent se mantic analysis. Journal of the Society for Information Science, 41(6):391-407, 1990.
- G. Di Lucca, M. Di Penta, and S. Gradara. An ap proach to classify software maintenance requests. In Proc., International Conference on Software Mainte nance (ICSM), 2002.
- S. Easterbrook, R. R. Lutz, R. Covington, J. Kelly, Y. Ampo, and D. Hamilton. Experiences using lightweight formal methods for requirements modeling. IEEE Transactions on Software Engineering, pages 4- 14, 1998.
- M. Halstead. Elements of Software Science. Elsevier, 1977.
- J. Hayes. Risk reduction through requirements trac ing. In The Conference Proceedings of Software Quality Week, 1990.
- J. H. Hayes. Input validation testing: A system level, early lifecycle technique. In ICSE '97 Doctoral Con sortium, published in the Proceedings of the Seven teenth International Conference on Software Engineer ing Doctoral Consortium, May 1997.
- J. H. Hayes, A. Dekhtyar, and J. Osbourne. Improving requirements tracing via information retrieval. In In ternational Conference on Requirements Engineering, Monterey, California, pages 151-161, 2003.
- J. H. Hayes, A. Dekhtyar, S. Sundaram, and S. Howard. Helping analysts trace requirements: An objective look. In International Conference on Re quirements Engineering (RE'2004), 2004.
- J. H. Hayes and J. Offutt. Input validation testing: A requirements-driven, system level, early lifecycle tech nique. In Proceedings of the 11th International Con ference on Software Engineering and its Applications, October 1998.
- J. H. Hayes and J. Offutt. Increased software reliabil ity through input validation analysis and testing. In Proceedings of The Tenth IEEE International Sympo sium on Software Reliability Engineering, pages 199- 209, 1999.
- W. Hersh and P. Over. The trec-9 interactive track report. In Proc. Text Retrieval Conference (TREC-9), pages 41-50, 2000.
- G. Holzmann. The model checker SPIN. IEEE Trans actions on Software Engineering, 23(5):279-295, May 1997.
- T. Joachims. Text categorization with suport vector machines: Learning with many relevant features. In Proc. ECML, pages 137-142, 1998.
- B. Lee and B. Bryant. Contextual knowledge repre sentation for requirements documents in natural lan guage. In Proceedings of FLAIRS, the 15th Interna tional Florida Artificial Intelligence Research Sympo sium, 2002.
- T. McCabe. A complexity measure. IEEE Trans actions on Software Engineering, 2(4):308-320, Dec. 1976.
- T. Menzies, J. S. D. Stefano, C. Cunanan, and R. M. Chapman. Mining repositories to assist in project planning and resource allocation. In International Workshop on Mining Software Repositories (submit ted), 2004. Available from http://menzies.us/pdf/ 04msrdefects.pdf.
- T. Mundie and F. Hallsworth. Requirements analy sis using supertrace pc. In Proc. American Society of Mechanical Engineers (ASME) for Computers in Engi neering Symposium at the Energy and Environmental Expo, 1995.
- D. R. Smith. KIDS: A semiautomatic program devel opment system. IEEE Transactions on Software En gineering, 16(9):1024-1043, 1990.