A Survey On Parallelization Of Data Mining Techniques
2013
Abstract
This paper contains the overview of various parallelization techniques to improve the performance of existing data mining algorithms and make the capable of handling large amount of data. There are variety of techniques to achieve the parallelization in data mining field, in this paper a brief introduction to few of the popular techniques is presented. The second part of this paper contains information regarding various data algorithms that are proposed by various authors based on these techniques. In Introduction various results corresponding to a survey are provided.
References (11)
- "5th Annual Data Miner Survey -2011 Survey Summary Report", Rexer Analytics, Karl Rexer, PhD krexer@RexerAnalytics.com www.RexerAnalytics.com
- "Data mining on grids" Maarten Altorf, maltorf@yahoo.com, Universiteit Leiden - August 2007.
- D. Talia, P. Trunfio, O. Verta. Weka4WS: a WSRF-enabled Weka Toolkit for Distributed Data Mining on Grids. Proc. of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 2005), Porto, Portugal, October 2005, LNAI vol. 3721, pp. 309- 320, Springer-Verlag, 2005.
- Juan Li, Pallavi Roy, Samee U. Khan, Lizhe Wang, Yan Bai "Data Mining Using Clouds: An Experimental Implementation of Apriori over MapReduce".
- "Mining Association Rules In Cloud" By Pallavi Roy
- Y. Yorozu, M. Hirano, K. Oka, and Y. Tagawa, "Electron spectroscopy studies on magneto-optical media and plastic substrate interface," IEEE Transl. J. Magn. Japan, vol. 2, pp. 740-741, August 1987 [Digests 9th Annual Conf. Magnetics Japan, p. 301, 1982].
- M. Young, The Technical Writer's Handbook. Mill Valley, CA: University Science, 1989.
- D. Kornack and P. Rakic, "Cell Proliferation without Neurogenesis in Adult Primate Neocortex," Science, vol. 294, Dec. 2001, pp. 2127-2130, doi:10.1126/science.1065467.
- H. Goto, Y. Hasegawa, and M. Tanaka, "Efficient Scheduling Focusing on the Duality of MPL Representatives," Proc. IEEE Symp. Computational Intelligence in Scheduling (SCIS 07), IEEE Press, Dec. 2007, pp. 57-64, doi:10.1109/SCIS.2007.357670.
- "Data Mining Using Clouds: An Experimental Implementation of Apriori over MapReduce" Juan Li, Pallavi Roy, Samee U. Khan, Lizhe Wang, Yan Bai,North Dakota State University, Fargo, USA
- Wenbin Fang, Ka Keung Lau, Mian Lu, Xiangye Xiao, Chi Kit Lam, Philip Yang Yang, "Parallel Data Mining on Graphics Processors", Technical Report HKUSTCS0807,Oct 2008.