Journal of Machine and Computing


Enhancing Cloud Data Deduplication with Dynamic Chunking and Public Blockchain



Journal of Machine and Computing

Received On : 11 July 2023

Revised On : 15 March 2024

Accepted On : 22 May 2024

Published On : 05 July 2024

Volume 04, Issue 03

Pages : 521-530


Abstract


The majority of cloud service providers (CSPs) store and remove customer data according to certain principles. The majority of them have designed their cloud platform to have very high levels of consistency, speed, availability, and durability. Their systems are built with these performance characteristics in mind, and the requirement to ensure precise and rapid data deletion must be carefully balanced. In the public blockchain, this paper suggests employing the rapid content-defined Chunking algorithm for data duplication. Acute data is frequently outsourced by individuals and organizations to distant cloud servers since doing so greatly reduces the headache of maintaining infrastructure and software. However, because user data is transmitted to cloud storage providers and stored on a remote cloud, ownership and control rights are nonetheless separated. Users thus have significant challenges when attempting to confirm the integrity of private information. According to the experiment results, the suggested dynamic chunking has a fast processing time that is on par with fixed-length chunking and significantly improves deduplication processing capability.


Keywords


Cloud Service Providers (CSPs), Chunking Algorithm, Data Duplication, Blockchain.


  1. S. Guo, X. Mao, M. Sun, and S. Wang, “Double Sliding Window Chunking Algorithm for Data Deduplication in Ocean Observation,” IEEE Access, vol. 11, pp. 70470–70481, 2023, doi: 10.1109/access.2023.3276785.
  2. M. El Ghazouani, M. A. E. kiram, E.-R. Latifa, and Y. El Khanboubi, “Efficient Method Based on Blockchain Ensuring Data Integrity Auditing with Deduplication in Cloud,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 6, no. 3, p. 32, 2020, doi: 10.9781/ijimai.2020.08.001.
  3. Sumathi, M, “Secure blockchain based data storage and integrity auditing in cloud”, Turkish Journal of Computer and Mathematics Education, Vol.12 No.9 (2021), 159-165.
  4. H. Yuan, X. Chen, J. Wang, J. Yuan, H. Yan, and W. Susilo, “Blockchain-based public auditing and secure deduplication with fair arbitration,” Information Sciences, vol. 541, pp. 409–425, Dec. 2020, doi: 10.1016/j.ins.2020.07.005.
  5. L. Liu, X. Liu, and J. Wan, “Design of Updating Encryption Algorithm for Privacy Big Data Based on Consortium Blockchain Technology,” Journal of Mathematics, vol. 2022, pp. 1–11, Oct. 2022, doi: 10.1155/2022/7138173.
  6. J. Gnana Jeslin and P. Mohan Kumar, “Decentralized and Privacy Sensitive Data De-Duplication Framework for Convenient Big Data Management in Cloud Backup Systems,” Symmetry, vol. 14, no. 7, p. 1392, Jul. 2022, doi: 10.3390/sym14071392.
  7. B. Zhou, S. Zhang, Y. Zhang, and J. Tan, “A Bit String Content Aware Chunking Strategy for Reduced CPU Energy on Cloud Storage,” Journal of Electrical and Computer Engineering, vol. 2015, pp. 1–8, 2015, doi: 10.1155/2015/242086.
  8. C. Zhang, D. Qi, W. Li, and J. Guo, “Function of Content Defined Chunking Algorithms in Incremental Synchronization,” IEEE Access, vol. 8, pp. 5316–5330, 2020, doi: 10.1109/access.2019.2963625.
  9. P. Prajapati and P. Shah, “A Review on Secure Data Deduplication: Cloud Storage Security Issue,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 7, pp. 3996–4007, Jul. 2022, doi: 10.1016/j.jksuci.2020.10.021.
  10. Y.-W. KO, H.-M. JUNG, W.-Y. LEE, M.-J. KIM, and C. YOO, “Stride Static Chunking Algorithm for Deduplication System,” IEICE Transactions on Information and Systems, vol. E96.D, no. 7, pp. 1544–1547, 2013, doi: 10.1587/transinf. e96.d.1544.
  11. Rao, K. P., Efficient and Reliable Secure Cloud Storage Schema of Block chain for Data De-duplication in Cloud, Turkish Journal of Computer and Mathematics Education. Vol.12 No.9 (2021),1547-1556.
  12. T. R. Nisha, S. Abirami, and E. Manohar, “Experimental Study on Chunking Algorithms of Data Deduplication System on Large Scale Data,” Advances in Intelligent Systems and Computing, pp. 91–98, Dec. 2015, doi: 10.1007/978-81-322-2674-1_9.
  13. C. Bo, Z. F. Li, and W. Can, “Research on Chunking Algorithms of Data De-duplication,” Advances in Intelligent Systems and Computing, pp. 1019–1025, 2013, doi: 10.1007/978-3-642-31698-2_144.
  14. Y. El Khanboubi, M. Hanoune, and M. El Ghazouani, “A New Data Deletion Scheme for a Blockchain-based De-duplication System in the Cloud,” International Journal of Communication Networks and Information Security (IJCNIS), vol. 13, no. 2, Apr. 2022, doi: 10.17762/ijcnis. v13i2.4975.
  15. B.-H. Kim, A. Haldorai, and S. S, “A Battery Lifetime Monitoring and Estimation using Split Learning Algorithm in Smart Mobile Consumer Electronics,” IEEE Transactions on Consumer Electronics, pp. 1–1, 2024, doi: 10.1109/tce.2024.3397714.
  16. S. P. Paul and D. Vetrithangam, “A Scientometric Study of Research Development on Cloud Computing-Based Data Management Technique,” Lecture Notes in Networks and Systems, pp. 617–625, 2023, doi: 10.1007/978-981-99-3716-5_50.

Acknowledgements


Author(s) thanks to Dr.Vetrithangam D for this research completion and support.


Funding


No funding was received to assist with the preparation of this manuscript.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


Data sharing is not applicable to this article as no new data were created or analysed in this study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Richa Arora and Vetrithangam D, “Enhancing Cloud Data Deduplication with Dynamic Chunking and Public Blockchain”, Journal of Machine and Computing, pp. 521-530, July 2024. doi: 10.53759/7669/jmc202404050.


Copyright


© 2024 Richa Arora and Vetrithangam D. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.