LSA Based Text Summarization
2020, International Journal of Recent Technology and Engineering (IJRTE)
https://doi.org/10.35940/IJRTE.B3288.079220Abstract
In this study we propose an automatic single document text summarization technique using Latent Semantic Analysis (LSA) and diversity constraint in combination. The proposed technique uses the query based sentence ranking. Here we are not considering the concept of IR (Information Retrieval) so we generate the query by using the TF-IDF(Term Frequency-Inverse Document Frequency). For producing the query vector, we identify the terms having the high IDF. We know that LSA utilizes the vectorial semantics to analyze the relationships between documents in a corpus or between sentences within a document and key terms they carry by producing a list of ideas interconnected to the documents and terms. LSA helps to represent the latent structure of documents. For selecting the sentences from the document Latent Semantic Indexing (LSI) is used. LSI helps to arrange the sentences with its score. Traditionally the highest score sentences have been chosen for summary but here we calculate the div...
References (23)
- A. Bellaachia, A. Mahajan, "Text Summary Using Latent Semantic Indexing and Information Retrieval Technique: Comparison of Four Strategies", In EGC 2004, vol. RNTI-E-2, pp.453-464.
- B. Baldwin and T.S. Morton. Dynamic coreference based summarization. In Proceedings of The Third Conference on Empirical Methods in Natural Language Processing (EMNLP3), Granada, Spain, June 1998.
- C. Buckley and et al.. The smart/empire tipster ir system. In Proceedings of TIPSTER Phase III Workshop. 1999.
- D. J. Gillick "The Elements of Automatic Summarization" Electrical Engineering and Computer Sciences, University of California, May 2011.
- D. McDonald et al., "Using Sentence-Sentence Heuristics to Rank Text Segments in TXTRACTOR, " MIS Dept., U. of Arizona, Tucson, AZ.
- D. Oluwajana," Single-Document summarization using Latent Semantic Analysis", DOI: 10.13140/RG.2.1.4075.6320.
- E. Hovy and C-Y. Lin, "automated text summarization and the SUMMARIST system" ,Information Sciences Institute of the University of Southern California, 1998.
- E.Hovy and C. Lin. Automated text summarization in summarist. In Proceedings of the TIPSTER Workshop, Baltimore, MD, 1998.
- G. Murray, S. Renals, J. Carletta "Extractive Summarization of Meeting Recordings", Centre for Speech Technology Research, University of Edinburgh, Scotland, 2005.
- G. Salton and C. Buckley" Term Weighting Approaches in automatic Text Retrieval", Department of Computer Science, Cornell University, New York, November 1987.
- G.W. Furnas, S.C. Deerwester, S.T , Dumais, T.K. Landauer, R.A. Harshman, L.A.Streeter, K.E. Lochbaum, Information retrieval using a singular value decomposition model of latent semantic structure. SIGIR Forum 51(2), 90-105 (2017).
- H. Daume III and D. Marcu "A Tree-Position Kernel for Document Compression Proceedings of the Document Understanding Conference", Boston, MA. May 6-7, 2004.
- H. P. Edmundson "New Methods in Automatic Extracting" Journal of the Association for Computing Machinery, Vol. 16, No. 2, April 1969.
- H.P. Luhn, The Automatic Creation of Literature Abstracts. in Maybury, M.T. ed. Advances in Automatic Text Summarization. The MIT Press, Cambridge, 1958, 15-22.
- J. Goldstain, M. Kantrowitz, V. Mittal and J. Carbonell. Summarizing text documents: Sentence selection and evaluation metrics. In Proceedings of ACM SIGIR ' 99, Berkeley, CA, Aug 1999.
- Jezek and Steinberger "Automatic Text Summarization" The State of Art 2008 and the challenges, Bratislava. 2008.
- K. Knight and M. Daniel "Statistical-Based Summarization One step: Sentence compression", Information sciences Institute and Department of Computer Sciences University of Southern California, 2002.
- M. Gülçin Özsoy "Text Summarization Using Latent Semantic Analysis" Graduate School of Natural and Applied Sciences, Middle East Technical University, Ankara.2011.
- R. Barzilay and M. Elhadad. Using lexical chains for text summarization", in Proceedings of the Workshop on Intelligent Scalable Text Summarization, Madrid, Spain, Aug. 1997.
- S. Mandal,G. K. Singh,A. Pal," PSO Based Text Summarization Approach Using Sentiment Analysis", Advances in Intelligent Systems and Computing ,Springer ,Vol 810,p.p.-845-854, 2019, https://doi.org/10.1007/978-981-13-1513-8_86.
- S. Mandal,G. K. Singh, A. Pal," Text Summarization Technique by Sentiment Analysis and Cuckoo Search Algorithm",Advances in Intelligent Systems and Computing, Springer , Vol 1025 ,p.p.- 357-366, 2020.
- T. Firmin, and M.J. Chrzanowski, An Evaluation of Automatic ,1999.
- Y. Gong and X. Liu. Generic Text Summarization Using Relevance Measure and latent Semantic Analysis. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 19 -25, 2001.