Currently, there is a significant focus on natural language processing (NLP) within academic circles. As one of the initial domains of inquiry in the domain of machine learning, it has been utilized in a variety of significant sub-disciplines, such as text processing, speech recognition, and machine translation. Natural language processing has contributed to notable progress in computing and artificial intelligence. The recurrent neural network serves as a fundamental component for numerous techniques in domain of NLP. The present article conducts a comprehensive evaluation of various algorithms for processing textual and voice data, accompanied by illustrative instances of their functionality. Various algorithmic outcomes exhibit the advancements achieved in this field during the preceding decade. Our endeavor involved the classification of algorithms based on their respective types and expounding on the scope for future research in this domain. Furthermore, the study elucidates the potential applications of these heterogeneous algorithms and also evaluates the disparities among them through an analysis of the findings. Despite the fact that natural language processing has not yet achieved its ultimate objective of flawlessness, it is plausible that with sufficient exertion, the field will eventually attain it. Currently, a wide variety of artificial intelligence systems use natural language processing algorithms to comprehend human-spoken directions.
Keywords
Natural Language Processing, Machine Learning, Speech Recognition, Machine Translation, Text Processing.
K. Ashok, M. Ashraf, J. Thimmia Raja, M. Z. Hussain, D. K. Singh, and A. Haldorai, “Collaborative analysis of audio-visual speech synthesis with sensor measurements for regulating human–robot interaction,” International Journal of System Assurance Engineering and Management, Aug. 2022, doi: 10.1007/s13198-022-01709-y.
H and A. R, “Artificial Intelligence and Machine Learning for Enterprise Management,” 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), Nov. 2019, doi: 10.1109/icssit46314.2019.8987964.
L. Jin, Z. Li, and J. Tang, “Deep semantic multimodal hashing network for scalable image-text and video-text retrievals,” IEEE Trans. Neural Netw. Learn. Syst., vol. 34, no. 4, pp. 1838–1851, 2023.
W. Wang, R. Ma, T. Luo, C. Long, Q. Ye, and D. Chen, “X-12 seasonal adjustment combined with long and short-term memory neural networks for monthly electricity sales forecasting,” in 2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE), 2022.
N. Rajput and S. K. Verma, “Speech recognition using the epochwise back propagation through time algorithm,” Int. J. Comput. Appl., vol. 95, no. 21, pp. 17–21, 2014.
J.-J. Xiong, G.-B. Zhang, J.-X. Wang, and T.-H. Yan, “Improved sliding mode control for finite-time synchronization of nonidentical delayed recurrent neural networks,” IEEE Trans. Neural Netw. Learn. Syst., vol. 31, no. 6, pp. 2209–2216, 2020.
W. Luan, R. Zhang, B. Liu, B. Zhao, and Y. Yu, “Leveraging sequence-to-sequence learning for online non-intrusive load monitoring in edge device,” Int. J. Electr. Power Energy Syst., vol. 148, no. 108910, p. 108910, 2023.
Haldorai and U. Kandaswamy, “Energy Efficient Network Selection for Cognitive Spectrum Handovers,” EAI/Springer Innovations in Communication and Computing, pp. 41–64, 2019, doi: 10.1007/978-3-030-15416-5_3.
H. Cao, T. Zhao, W. Wang, and W. Peng, “Bilingual word embedding fusion for robust unsupervised bilingual lexicon induction,” Inf. Fusion, vol. 97, no. 101818, p. 101818, 2023.
G. Qiu, “Parallel algorithm of hierarchical phrase machine translation based on distributed network memory,” Int. J. Inf. Syst. Supply Chain Manag., vol. 15, no. 1, pp. 1–16, 2021.
Haldorai, A. Ramu, and S. Murugan, “Computing and Communication Systems in Urban Development,” Urban Computing, 2019, doi: 10.1007/978-3-030-26013-2.
V. Markov, V. Rastunkov, A. Deshmukh, D. Fry, and C. Stefanski, “Implementation and learning of quantum hidden Markov models,” arXiv [quant-ph], 2022.
J.-Y. Wang and J.-S. R. Jang, “Training a singing transcription model using connectionist temporal classification loss and cross-entropy loss,” IEEE ACM Trans. Audio Speech Lang. Process., vol. 31, pp. 383–396, 2023.
D. Yolanda, M. H. Hersyah, and R. N. Pratama, “Implementation of Mel frequency cepstrum coefficients and dynamic time warping algorithms on door physical security system using voice recognition pattern,” in 2020 International Conference on Information Technology Systems and Innovation (ICITSI), 2020.
D. S. Maylawati, Y. J. Kumar, and F. Binti Kasmin, “Feature-based approach and sequential pattern mining to enhance quality of Indonesian automatic text summarization,” Indones. J. Electr. Eng. Comput. Sci., vol. 30, no. 3, p. 1795, 2023.
Acknowledgements
We would like to thank Reviewers for taking the time and effort necessary to review the manuscript. We sincerely appreciate all valuable comments and suggestions, which helped us to improve the quality of the manuscript.
Funding
No funding was received to assist with the preparation of this manuscript.
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Availability of data and materials
No data available for above study.
Author information
Contributions
All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.
Corresponding author
Matt Bowden
Matt Bowden
Computer Science and Engineering, Australian National University, Australia.
This article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you
give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and
indicate if changes were made. The images or other third party material in this article are included in the article‟s
Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the
article‟s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the
permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/
Cite this article
Matt Bowden, “A Review of Textual and Voice Processing Algorithms in the Field of Natural Language Processing”, Journal of Computing and Natural Science, vol.3, no.4, pp. 194-203, October 2023. doi: 10.53759//181X/JCNS/202303018.