Controlling the chunk-size in deduplication systems

Michael Hirsch, Shmuel T. Klein, Dana Shapira, Yair Toaff

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرمنشور من مؤتمرمراجعة النظراء

2 اقتباسات (Scopus)

ملخص

A special case of data compression in which repeated chunks of data are stored only once is known as deduplication. The input data is cut into chunks and a cryptographically strong hash value of each (different) chunk is stored. To restrict the influence of small inserts and deletes to local perturbations, the chunk boundaries are usually defined in a data dependent way, which implies that the chunks are of variable length. Usually, the chunk sizes may spread over a large range, which may have a negative impact on the storage performance. This may be dealt with by imposing artificial lower and upper bounds. This paper suggests an alternative by which the chunk size distribution is controlled in a natural way. Some analytical and experimental results are given.

اللغة الأصليةالإنجليزيّة
عنوان منشور المضيفProceedings of the Prague Stringology Conference 2015, PSC 2015
المحررونJan Zd'arek, Jan Holub
الصفحات78-89
عدد الصفحات12
رقم المعيار الدولي للكتب (الإلكتروني)9788001057872
حالة النشرنُشِر - 2015
الحدث19th Prague Stringology Conference, PSC 2015 - Prague, التشيك
المدة: ٢٤ أغسطس ٢٠١٥٢٦ أغسطس ٢٠١٥

سلسلة المنشورات

الاسمProceedings of the Prague Stringology Conference 2015, PSC 2015

!!Conference

!!Conference19th Prague Stringology Conference, PSC 2015
الدولة/الإقليمالتشيك
المدينةPrague
المدة٢٤/٠٨/١٥٢٦/٠٨/١٥

بصمة

أدرس بدقة موضوعات البحث “Controlling the chunk-size in deduplication systems'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا