Hash Table Assisted Efficient File Level De-Duplication Scheme in SD-IoV Assisted Sensing Devices


Said G., Ullah A., Ghani A., Azeem M., YAHYA H., Bilal M., ...More

Intelligent Automation and Soft Computing, vol.38, no.1, pp.83-99, 2024 (SCI-Expanded) identifier

  • Publication Type: Article / Article
  • Volume: 38 Issue: 1
  • Publication Date: 2024
  • Doi Number: 10.32604/iasc.2023.036079
  • Journal Name: Intelligent Automation and Soft Computing
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Applied Science & Technology Source, Computer & Applied Sciences
  • Page Numbers: pp.83-99
  • Keywords: de-duplication, Hash table, IoT, linked list, sensing devices
  • Istanbul Gelisim University Affiliated: Yes

Abstract

The Internet of Things (IoT) and cloud technologies have encouraged massive data storage at central repositories. Software-defined networks (SDN) support the processing of data and restrict the transmission of duplicate values. It is necessary to use a data de-duplication mechanism to reduce communication costs and storage overhead. Existing State of the art schemes suffer from computational overhead due to deterministic or random tree-based tags generation which further increases as the file size grows. This paper presents an efficient file-level de-duplication scheme (EFDS) where the cost of creating tags is reduced by employing a hash table with key-value pair for each block of the file. Further, an algorithm for hash table-based duplicate block identification and storage (HDBIS) is presented based on fingerprints that maintain a linked list of similar duplicate blocks on the same index. Hash tables normally have a consistent time complexity for lookup, generating, and deleting stored data regardless of the input size. The experiential results show that the proposed EFDS scheme performs better compared to its counterparts.