Journal of Computing and Natural Science


A Review of Textual and Voice Processing Algorithms in the Field of Natural Language Processing



Journal of Computing and Natural Science

Received On : 02 November 2022

Revised On : 25 January 2023

Accepted On : 20 April 2023

Published On : 05 October 2023

Volume 03, Issue 04

Pages : 194-203


Abstract


Currently, there is a significant focus on natural language processing (NLP) within academic circles. As one of the initial domains of inquiry in the domain of machine learning, it has been utilized in a variety of significant sub-disciplines, such as text processing, speech recognition, and machine translation. Natural language processing has contributed to notable progress in computing and artificial intelligence. The recurrent neural network serves as a fundamental component for numerous techniques in domain of NLP. The present article conducts a comprehensive evaluation of various algorithms for processing textual and voice data, accompanied by illustrative instances of their functionality. Various algorithmic outcomes exhibit the advancements achieved in this field during the preceding decade. Our endeavor involved the classification of algorithms based on their respective types and expounding on the scope for future research in this domain. Furthermore, the study elucidates the potential applications of these heterogeneous algorithms and also evaluates the disparities among them through an analysis of the findings. Despite the fact that natural language processing has not yet achieved its ultimate objective of flawlessness, it is plausible that with sufficient exertion, the field will eventually attain it. Currently, a wide variety of artificial intelligence systems use natural language processing algorithms to comprehend human-spoken directions.


Keywords


Natural Language Processing, Machine Learning, Speech Recognition, Machine Translation, Text Processing.


  1. K. Ashok, M. Ashraf, J. Thimmia Raja, M. Z. Hussain, D. K. Singh, and A. Haldorai, “Collaborative analysis of audio-visual speech synthesis with sensor measurements for regulating human–robot interaction,” International Journal of System Assurance Engineering and Management, Aug. 2022, doi: 10.1007/s13198-022-01709-y.
  2. H and A. R, “Artificial Intelligence and Machine Learning for Enterprise Management,” 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), Nov. 2019, doi: 10.1109/icssit46314.2019.8987964.
  3. L. Jin, Z. Li, and J. Tang, “Deep semantic multimodal hashing network for scalable image-text and video-text retrievals,” IEEE Trans. Neural Netw. Learn. Syst., vol. 34, no. 4, pp. 1838–1851, 2023.
  4. W. Wang, R. Ma, T. Luo, C. Long, Q. Ye, and D. Chen, “X-12 seasonal adjustment combined with long and short-term memory neural networks for monthly electricity sales forecasting,” in 2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE), 2022.
  5. N. Rajput and S. K. Verma, “Speech recognition using the epochwise back propagation through time algorithm,” Int. J. Comput. Appl., vol. 95, no. 21, pp. 17–21, 2014.
  6. J.-J. Xiong, G.-B. Zhang, J.-X. Wang, and T.-H. Yan, “Improved sliding mode control for finite-time synchronization of nonidentical delayed recurrent neural networks,” IEEE Trans. Neural Netw. Learn. Syst., vol. 31, no. 6, pp. 2209–2216, 2020.
  7. W. Luan, R. Zhang, B. Liu, B. Zhao, and Y. Yu, “Leveraging sequence-to-sequence learning for online non-intrusive load monitoring in edge device,” Int. J. Electr. Power Energy Syst., vol. 148, no. 108910, p. 108910, 2023.
  8. Haldorai and U. Kandaswamy, “Energy Efficient Network Selection for Cognitive Spectrum Handovers,” EAI/Springer Innovations in Communication and Computing, pp. 41–64, 2019, doi: 10.1007/978-3-030-15416-5_3.
  9. H. Cao, T. Zhao, W. Wang, and W. Peng, “Bilingual word embedding fusion for robust unsupervised bilingual lexicon induction,” Inf. Fusion, vol. 97, no. 101818, p. 101818, 2023.
  10. G. Qiu, “Parallel algorithm of hierarchical phrase machine translation based on distributed network memory,” Int. J. Inf. Syst. Supply Chain Manag., vol. 15, no. 1, pp. 1–16, 2021.
  11. Haldorai, A. Ramu, and S. Murugan, “Computing and Communication Systems in Urban Development,” Urban Computing, 2019, doi: 10.1007/978-3-030-26013-2.
  12. V. Markov, V. Rastunkov, A. Deshmukh, D. Fry, and C. Stefanski, “Implementation and learning of quantum hidden Markov models,” arXiv [quant-ph], 2022.
  13. J.-Y. Wang and J.-S. R. Jang, “Training a singing transcription model using connectionist temporal classification loss and cross-entropy loss,” IEEE ACM Trans. Audio Speech Lang. Process., vol. 31, pp. 383–396, 2023.
  14. D. Yolanda, M. H. Hersyah, and R. N. Pratama, “Implementation of Mel frequency cepstrum coefficients and dynamic time warping algorithms on door physical security system using voice recognition pattern,” in 2020 International Conference on Information Technology Systems and Innovation (ICITSI), 2020.
  15. D. S. Maylawati, Y. J. Kumar, and F. Binti Kasmin, “Feature-based approach and sequential pattern mining to enhance quality of Indonesian automatic text summarization,” Indones. J. Electr. Eng. Comput. Sci., vol. 30, no. 3, p. 1795, 2023.

Acknowledgements


We would like to thank Reviewers for taking the time and effort necessary to review the manuscript. We sincerely appreciate all valuable comments and suggestions, which helped us to improve the quality of the manuscript.


Funding


No funding was received to assist with the preparation of this manuscript.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


No data available for above study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article‟s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article‟s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Matt Bowden, “A Review of Textual and Voice Processing Algorithms in the Field of Natural Language Processing”, Journal of Computing and Natural Science, vol.3, no.4, pp. 194-203, October 2023. doi: 10.53759//181X/JCNS/202303018.


Copyright


© 2023 Matt Bowden. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.