Incorporating Wikis into software repository mining
2007
Abstract
Software engineering projects produce large volumes of information that are collected in specialized, decentralized repositories. Combining information from these sources may aid in gaining accurate knowledge about the arguments and alternatives (e.g. rationale) that factor into the decision making and design process. This information is captured in synchronized systems: version control systems (such as CVS or Subversion), defect tracking systems (such as Bugzilla or Trac), and Wikis and in unsynchronized systems: email discussion lists and Web-based discussion forums. Our proposed research path involves combining these synchronized and unsynchronized sources of information in order to advance the field of mining software repositories (MSR). In this paper we discuss plans for the integration of these five sources of information and how unsynchronized information might be used to reason about actions captured in the synchronized systems.
References (3)
- Klaus Berberich, Srikanta Bedathur, Thomas Neumann, and Gerhard Weikum. A time ma- chine for text search. In SIGIR '07: Proceed- ings of the 30th annual international ACM SIGIR conference on Research and development in infor- mation retrieval, pages 519-526, New York, NY, USA, 2007. ACM Press.
- Daniel M. German, Davor Cubranić, and Margaret-Anne D. Storey. A framework for de- scribing and understanding mining tools in soft- ware development. In MSR '05: Proceedings of the 2005 international workshop on Mining software repositories, pages 1-5, New York, NY, USA, 2005. ACM Press.
- J. Lee. Design rationale systems: understanding the issues. Expert, IEEE [see also IEEE Intelli- gent Systems and Their Applications], 12(3):78- 85, 1997.