Journal of Computational Intelligence in Materials Science


An in Depth Exploration of Machine Learning and Statistical Learning Techniques in Data Analytics



Journal of Computational Intelligence in Materials Science

Received On : 10 September 2024

Revised On : 26 October 2024

Accepted On : 12 November 2024

Published On : 23 November 2024

Volume 02, 2024

Pages : 130-141


Abstract


In this article, we describe a data analytics methodology for gleaning insights from the production lines of a power transfer unit, such as the critical measurements needed to construct a shim used to align shafts. This study also outlines the most effective methods and analytical methodologies for domains used in determining which measurements are afflicted by faults; determining which measurements are afflicted by shim dimensions; determining which relationships exist between station codes; forecasting shim dimensions; determining which duplicate samples are present in faulty data; and determining which error distributions are afflicted by measurement. Both statistical analysis and analysis based on machine learning (ML) are used to these domains. These findings demonstrate the reproduction rate of defective units, the relative significance of measurement in relation to the shim dimensions, error distribution and faulty units of measurements. The 'PTU housing measurement' was shown to be the most critical measurement out of all the shim dimensions by both statistical and ML-based analyses.


Keywords


Power Transfer Units, Data Analytics, Big Data, Machine Learning, Artificial Intelligence, Housing Measurement.


  1. Y. Han, L. Zhou, Y. Liang, Z. Li, and Y. Zhu, “Fabrication and properties of silica/mullite porous ceramic by foam-gelcasting process using silicon kerf waste as raw material,” Mater. Chem. Phys., vol. 240, no. 122248, p. 122248, 2020.
  2. X. Wang, J. Xu, S. Lu, S. Ren, M. Leng, and H. Ma, “Single-receiver multioutput inductive power transfer system with independent regulation and unity power factor,” IEEE Trans. Power Electron., vol. 37, no. 1, pp. 1159–1171, 2022.
  3. Y. Yang, P. Li, H. Pei, and Y. Zou, “Design of all-wheel-drive power-split hybrid configuration schemes based on hierarchical topology graph theory,” Energy (Oxf.), vol. 242, no. 122944, p. 122944, 2022.
  4. A. R. Kulkarni, N. Kumar, and K. R. Rao, “Efficacy of Bluetooth-based data collection for road traffic analysis and visualization using big data analytics,” Big Data Min. Anal., vol. 6, no. 2, pp. 139–153, 2023.
  5. J. Han, T. Zhang, Y. Li, and Z. Liu, “RD-NMSVM: neural mapping support vector machine based on parameter regularization and knowledge distillation,” Int. J. Mach. Learn. Cybern., vol. 13, no. 9, pp. 2785–2798, 2022.
  6. J. Lee and M. Park, “Estimation of p-values with reduced set of genomic association data,” Korean Data Anal. Soc., vol. 20, no. 4, pp. 1633–1643, 2018.
  7. R. K. Mishra, J. A. A. Jothi, S. Urolagin, and K. Irani, “Knowledge based topic retrieval for recommendations and tourism promotions,” International Journal of Information Management Data Insights, vol. 3, no. 1, p. 100145, 2023.
  8. R. Gao and K. Ye, “SVMs-SKSM: Protein Function Multi-label Classification based on SVM-SVM classifiers Fusion and sequences kernel similarity matrix,” Research Square, 2022.
  9. X. Fan, Y. X. R. Wang, P. Sarkar, and Y. Yue, “A unified framework for tuning hyperparameters in clustering problems,” Stat. Sin., 2024.
  10. D.-M. Ge, L.-C. Zhao, and M. Esmaeili-Falak, “Estimation of rapid chloride permeability of SCC using hyperparameters optimized random forest models,” J. Sustain. Cem.-based Mater., pp. 1–19, 2022.

Acknowledgements


We would like to thank Reviewers for taking the time and effort necessary to review the manuscript. We sincerely appreciate all valuable comments and suggestions, which helped us to improve the quality of the manuscript.


Funding


No funding was received to assist with the preparation of this manuscript.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


No data available for above study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Shangguan Jiang, “An in Depth Exploration of Machine Learning and Statistical Learning Techniques in Data Analytics”, Journal of Computational Intelligence in Materials Science, vol.2, pp. 130-141, 2024. doi: 10.53759/832X/JCIMS202402013.


Copyright


© 2024 Shangguan Jiang. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.