Performance enhancement of hash based parallel deduplication model

dc.contributor.guideSundarakantham K
dc.coverage.spatialPerformance enhancement of hash based parallel deduplication model
dc.creator.researcherJane rubel angelina J
dc.date.accessioned2020-09-14T10:52:54Z
dc.date.available2020-09-14T10:52:54Z
dc.date.awarded30/09/2019
dc.date.completed2019
dc.date.registeredn.d.
dc.description.abstractIn the recent years, a man-made digital universe is created by millions of devices such as mobile phones, digital cameras, surveillance cameras, embedded systems and organizations providing solutions for handling this enormous amount of data. This digital universe is increasing twofold every two years and is expected to reach 44 trillion gigabytes by the year 2020. In order to protect and preserve this voluminous data, backup solutions are provided. However, a large proportion as large as 75% of this data contains duplicates. This leads to the need of data reduction techniques that can optimize the storage requirements. Deduplication is an effective data reduction technique that not only removes inter-file and intra-file redundancy but also helps to remove the duplicates among the files and file constituents present across various users and even across organizations. A hash based deduplication split the incoming data stream into fragments called chunks. An identity signature, also called fingerprint is created for each chunk using a cryptographic hash algorithm. A hash indexing structure is used to store the metadata, the fingerprints. The fingerprint insertion and lookup operations are CPU intensive in nature. Moreover, as the size of the incoming data stream increases, the indexing structure also grows leading to frequent disk lookups to access the metadata. Hence, maintaining the indexing structure, improving the fingerprint insertion and lookup operations on the indexing structure and addressing the disk lookup bottleneck problems continue to be the open issues in hash based deduplication. newline
dc.description.note
dc.format.accompanyingmaterialNone
dc.format.dimensions21cm
dc.format.extentxvii, 112p.
dc.identifier.urihttp://hdl.handle.net/10603/299276
dc.languageEnglish
dc.publisher.institutionFaculty of Information and Communication Engineering
dc.publisher.placeChennai
dc.publisher.universityAnna University
dc.relationp.105-111
dc.rightsuniversity
dc.source.universityUniversity
dc.subject.keywordEngineering and Technology
dc.subject.keywordComputer Science
dc.subject.keywordComputer Science Information Systems
dc.subject.keywordhash
dc.subject.keywordparallel
dc.titlePerformance enhancement of hash based parallel deduplication model
dc.title.alternative
dc.type.degreePh.D.

Files

Original bundle

Now showing 1 - 5 of 15
Loading...
Thumbnail Image
Name:
01_title.pdf
Size:
24.64 KB
Format:
Adobe Portable Document Format
Description:
Attached File
Loading...
Thumbnail Image
Name:
02_certificates.pdf
Size:
530 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
03_abstracts.pdf
Size:
11.17 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
04_acknowledgements.pdf
Size:
5.13 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
05_contents.pdf
Size:
69.92 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Plain Text
Description: