Academia.eduAcademia.edu

Outline

Overview of the CL-SciSumm 2016 Shared Task

2016

Abstract

The CL-SciSumm 2016 Shared Task is the first medium-scale shared task on scientific document summarization in the computational linguistics (CL) domain. The task built off of the experience and training data set created in its namesake pilot task, which was conducted in 2014 by the same organizing committee. The track included three tasks involving: (1A) identifying relationships between citing documents and the referred document, (1B) classifying the discourse facets, and (2) generating the abstractive summary. The dataset comprised 30 annotated sets of citing and reference papers from the open access research papers in the CL domain. This overview paper describes the participation and the official results of the second CL-SciSumm Shared Task, organized as a part of the Joint Workshop onBibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2016), held in New Jersey,USA in June, 2016. The annotated dataset used for this shared task...

References (25)

  1. Aggarwal, P., Sharma, R.: Lexical and Syntactic cues to identify Reference Scope of Citance. In: Proc. of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2016). pp. 103-112. Newark, NJ, USA (June 2016)
  2. Cao, Z., Li, W., Wu, D.: PolyU at CL-SciSumm 2016. In: Proc. of the Joint Work- shop on Bibliometric-enhanced Information Retrieval and Natural Language Pro- cessing for Digital Libraries (BIRNDL2016). pp. 132-138. Newark, NJ, USA (June 2016)
  3. Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for re- ordering documents and producing summaries. In: 21st annual international ACM SIGIR conference on Research and development in information retrieval. pp. 335- 336. Association of Computational Linguistics (1998)
  4. Conroy, J., Davis, S.: Vector space and language models for scientific document summarization. In: NAACL-HLT. pp. 186-191. Association of Computational Lin- guistics, Newark, NJ, USA (2015)
  5. Drouin, P.: Extracting a bilingual transdisciplinary scientific lexicon. In: eLexicog- raphy in the 21st century: new challenges, new applications. pp. 43-53. Louvain- la-Neuve: Presses Universitaires de Louvain (2010)
  6. Hoang, C., Kan, M.: Towards automated related work summarization. In: Proc. of COLING: Posters. pp. 427-435. ACL (2010)
  7. Jaidka, K., Chandrasekaran, M.K., Elizalde, B.F., Jha, R., Jones, C., Kan, M.Y., Khanna, A., Molla-Aliod, D., Radev, D.R., Ronzano, F., et al.: The Computational Linguistics Summarization Pilot Task. In: Proceedings of Text Analysis Confer- ence. Gaithersburg, USA (2014)
  8. Jaidka, K., Khoo, C.S., Na, J.C.: Deconstructing human literature reviews-a frame- work for multi-document summarization. In: Proc. of ENLG. pp. 125-135 (2013)
  9. Jones, K.S.: Automatic summarising: The state of the art. Information Processing and Management 43(6), 1449-1481 (2007)
  10. Klampfl, S., Rexha, A., Kern, R.: Identifying Referenced Text in Scientific Pub- lications by Summarisation and Classification Techniques. In: Proc. of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2016). pp. 122-131. Newark, NJ, USA (June 2016)
  11. Li, L., Mao, L., Zhang, Y., Chi, J., Huang, T., Cong, X., Peng, H.: CIST System for CL-SciSumm 2016 Shared Task. In: Proc. of the Joint Workshop on Bibliometric- enhanced Information Retrieval and Natural Language Processing for Digital Li- braries (BIRNDL2016). pp. 156-167. Newark, NJ, USA (June 2016)
  12. Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. Text summa- rization branches out: Proceedings of the ACL-04 workshop 8 (2004)
  13. Lu, K., Mao, J., Li, G., Xu, J.: Recognizing reference spans and classifying their discourse facets. In: Proc. of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2016). pp. 139-145. Newark, NJ, USA (June 2016)
  14. Malenfant, B., Lapalme, G.: RALI System Description for CL-SciSumm 2016
  15. Shared Task. In: Proc. of the Joint Workshop on Bibliometric-enhanced In- formation Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2016). pp. 146-155. Newark, NJ, USA (June 2016)
  16. Mayr, P., Frommholz, I., Cabanac, G., Wolfram, D.: Editorial for the Joint Work- shop on Bibliometric-enhanced Information Retrieval and Natural Language Pro- cessing for Digital Libraries (BIRNDL) at JCDL 2016. In: Proc. of the Joint Work- shop on Bibliometric-enhanced Information Retrieval and Natural Language Pro- cessing for Digital Libraries (BIRNDL2016). pp. 1-5. Newark, NJ, USA (June 2016)
  17. Mihalcea, R., Corley, C., Strapparava, C.: Corpus-based and knowledge-based mea- sures of text semantic similarity. In: 21st national conference on Artificial Intelli- gence. pp. 775-780. AAAI (2006)
  18. Mohammad, S., Dorr, B., Egan, M., Hassan, A., Muthukrishan, P., Qazvinian, V., Radev, D.R., Zajic, D.: Using citations to generate surveys of scientific paradigms. In: Proc. of NAACL. pp. 584-592. ACL (2009)
  19. Moraes, L., Baki, S., Verma, R., Lee, D.: University of Houston at CL-SciSumm 2016: SVMs with tree kernels and Sentence Similarity. In: Proc. of the Joint Work- shop on Bibliometric-enhanced Information Retrieval and Natural Language Pro- cessing for Digital Libraries (BIRNDL2016). pp. 113-121. Newark, NJ, USA (June 2016)
  20. Nakov, P.I., Schwartz, A.S., Hearst, M.: Citances: Citation sentences for semantic analysis of bioscience text. In: Proceedings of the SIGIR'04 workshop on Search and Discovery in Bioinformatics. pp. 81-88 (2004)
  21. Nomoto, T.: NEAL: A neurally enhanced approach to linking citation and ref- erence. In: Proc. of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2016). pp. 168-174. Newark, NJ, USA (June 2016)
  22. Qazvinian, V., Radev, D.: Scientific paper summarization using citation summary networks. In: Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. pp. 689-696. ACL (2008)
  23. Saggion, H.: SUMMA: A Robust and Adaptable Summarization Tool. Traitement Automatique des Langues 49(2), 103-125 (2002)
  24. Saggion, H., AbuRa'Ed, A., Ronzano, F.: Trainable Citation-enhanced Summa- rization of Scientific Articles. In: Proc. of the Joint Workshop on Bibliometric- enhanced Information Retrieval and Natural Language Processing for Digital Li- braries (BIRNDL2016). pp. 175-186. Newark, NJ, USA (June 2016)
  25. Teufel, S., Moens, M.: Summarizing scientific articles: experiments with relevance and rhetorical status. Computational Linguistics 28(4), 4099-445 (2002)