Journal of Machine and Computing


Experimental Evaluation and Approach of Enhancement in Generation of Automatic Unsupervised Extractive Text Summarization of Marathi Text By Using Machine Learning Algorithm



Journal of Machine and Computing

Received On : 10 December 2021

Revised On : 25 December 2021

Accepted On : 30 December 2021

Published On : 05 January 2022

Volume 02, Issue 01

Pages : 026-032


Abstract


The Text summarization has immense importance in the arena of Natural Language Processing. The summarization can be done in two ways: abstractive and extractive text summarization. Here, the language is a vast query, because we have large number of national and international languages. By studying the literature, the Marathi language is chosen for this work and we are trying to make a framework where all the students giving competent examinations can have summarized Marathi e-news articles by using extractive text summarization. For this objective we have used TextRank algorithm which has proven very effective for different languages. This paper demonstrates the summarization techniques on Marathi e-news articles using Gensim Library of TextRank algorithm and comparative analysis of summaries generated by using both TextRank and ROUGH method.


Keywords


Machine Learning approach, unsupervised method, Gensim, Extractive text summarization, TextRank algorithm, Ratio, Wordcount


  1. Apurva D. Dhawale, Sonali B. Kulkarni, Vaishali M. Kumbhakarna, “Automatic Pre-Processing of Marathi Text for Summarization”, International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249-8958, Volume-10 Issue-1, October 2020.
  2. Apurva D. Dhawale, Sonali B. Kulkarni, Vaishali M. Kumbhakarna, “Automatic Unsupervised Extractive Summarization of Marathi Text Using Natural Language Processing”, IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 22, Issue 6, Ser. II, PP 21-25, Nov. – Dec. 2020.
  3. Dhawale A.D., Kulkarni S.B., Kumbhakarna V.M. (2021)” A Survey of Distinctive Prominence of Automatic Text Summarization Techniques Using Natural Language Processing”. In: Raj J.S. (eds) International Conference on Mobile Computing and Sustainable Informatics. ICMCSI 2020.
  4. Divyanshu Kakwani, Anoop Kunchukuttan, Satish Golla, Gokul N.C., Avik Bhattacharyya, Mitesh M. Khapra, Pratyush Kumar, “IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages”, Findings of the Association for Computational Linguistics: EMNLP 2020, pages 4948–4961. Association for Computational Linguistics, November 16 - 20, 2020.
  5. Dhawale A.D., Kulkarni S.B., Kumbhakarna V.M., “Survey of Progressive Era of Text Summarization for Indian and Foreign Languages Using Natural Language Processing”, In: Raj J., Bashar A., Ramson S. (eds) Innovative Data Communication Technologies and Application. ICIDCA 2019. Lecture Notes on Data Engineering and Communications Technologies, vol 46. Springer, Cham.
  6. https://github.com/chiragsanghvi/TextSummarizer
  7. Y. Chen and Q. Song, "News Text Summarization Method based on BART-TextRank Model," 2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, China, 2021, pp. 2005-2010, doi: 10.1109/IAEAC50856.2021.9390683.
  8. Anishka Chaudhari, Akash Dole, Deepali Kadam, “Marathi text summarization using neural networks”, International Journal of Advance Research and Development(IJARnD), Volume 4, Issue 11,2019.
  9. Prafulla B. Bafna, Jatinderkumar R. Saini, “Marathi Text Analysis using Unsupervised Learning and Word Cloud”, International Journal of Engineering and Advanced Technology (IJEAT), ISSN: 2249 – 8958, Volume-9 Issue-3, February, 2020.
  10. Abdelkrime Aries Djamel eddine Zegour Walid Khaled Hidouci, “Automatic text summarization: What has been done and what has to be done”, ArXiv Journal, volume abs/1904.00688, 1st April 2019
  11. Pradeepika Verma, Anshul Verma, “Accountability of NLP Tools in Text Summarization for Indian Languages”, Journal of Scientific Research, Institute of Science, Banaras Hindu University, Varanasi, India, Volume 64, Issue 1, 2020.
  12. https://tedboy.github.io/nlps/generated/generated/gensim.summarization.summarize.html#:~:text=summarize(),-gensim.summarization.&text=Returns%20a%20summarized%20version%20of,be%20given%20as%20a%20string.
  13. https://www.tutorialspoint.com/gensim/gensim_introduction.htm
  14. Christopher D. Manning, Prabhakar Raghavan, H.S.: Introduction to Information Retrieval. Cambridge University Press (2008).
  15. Apurva D. Dhawale, Sonali B. Kulkarni, Vaishali M. Kumbhakarna, “A Machine Learning Approach for Automatic Unsupervised Extractive Summarization of Marathi Text”, International Journal of Creative Research Thoughts (IJCRT), Volume 8, Issue 11 | ISSN: 2320-2882, November 2020.

Acknowledgements


The author(s) received no financial support for the research, authorship, and/or publication of this article.


Funding


Authors thanks to Department of Computer Science and Information Technology for this research support.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


No data available for above study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Apurva D. Dhawale, Sonali B. Kulkarni, Vaishali M. Kumbhakarna, “Experimental Evaluation and Approach of Enhancement in Generation of Automatic Unsupervised Extractive Text Summarization of Marathi Text By Using Machine Learning Algorithm”, Journal of Machine and Computing, vol.2, no.1, pp. 026-032, January 2022. doi: 10.53759/7669/jmc202202004.


Copyright


© 2022 Apurva D. Dhawale, Sonali B. Kulkarni, Vaishali M. Kumbhakarna. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.