Paper Title : Data Duplication Removal using File Checksum
ISSN : 2394-2231
Year of Publication : 2022
10.5281/zenodo.6484045
MLA Style: Data Duplication Removal using File Checksum "Debashree Sadagar, Dr. Mir Aadil" Volume 9 - Issue 2 International Journal of Computer Techniques (IJCT) ,ISSN:2394-2231 , www.ijctjournal.org
APA Style: Data Duplication Removal using File Checksum "Debashree Sadagar, Dr. Mir Aadil" Volume 9 - Issue 2 International Journal of Computer Techniques (IJCT) ,ISSN:2394-2231 , www.ijctjournal.org
Abstract
The project enables the user to check for any duplicates in the database by checking the hash value of the file uploaded. If the file already exists in the database, it won’t be stored otherwise the file will be saved in the database. The goal of the project is to develop software that uses file checksums to prevent data duplication. The project's main goal is to reduce the number of duplicates in the database, particularly the key-value store, to improve process performance so that the backup window is not impacted, and to design for horizontal scaling so that it can compete on a Cloud Platform.
Reference
[1]S.Usharani,K.Dhanalakshmi,N.Dh analakshm, “De-Duplication Techniques: A Study” International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277- 3878, Volume-7, Issue-6S5, April 2019. [2] V.Sathiya Suntharam, Sheo Kumar, Chandu Ravi Kumar, “Research Method of Data Deduplication Backup System” International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278- 3075, Volume-8, Issue-11S2, September 2019. [3] Shynu P.G, Nadesh R.K, Varun G. Menon, Venu P., Mahdi Abbasi & Mohammad R. Khosravi, “A secure data deduplication system for integrated cloudedge networks” Journal of Cloud Computing 9, article number 61(2020). [4] Osuolalea.Festus, “Data Finding, Sharing and Duplication Removal in the Cloud Using File Checksum Algorithm” International Journal of Research Studies in Computer Science and Engineering (IJRSCSE), Volume 6, Issue 1, 2019, PP 26-47, ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online). [5] Nourah Almrezeq , Mamoona Humayun , A. A. Abd El-Aziz,and NZ Jhanjhi “An Enhanced Approach to Improve the Security and Performance for Deduplication”, Turkish Journal of Computer and Mathematics Education , 2866 Research Article Vol.12 No.6 (2021), 2866-2882.
Keywords
— Database, Duplication, Entity, Data, Checksum, Redundant, User id.