 




<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Storage Channel Pipeline &#187; Primary storage data reduction</title>
	<atom:link href="http://itknowledgeexchange.techtarget.com/storage-channel-pipeline/tag/primary-storage-data-reduction/feed/" rel="self" type="application/rss+xml" />
	<link>http://itknowledgeexchange.techtarget.com/storage-channel-pipeline</link>
	<description>A SearchStorageChannel.com blog</description>
	<lastBuildDate>Thu, 11 Apr 2013 23:17:00 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Primary storage data reduction: Combine compression and deduplication</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-channel-pipeline/primary-storage-data-reduction-combine-compression-and-deduplication/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-channel-pipeline/primary-storage-data-reduction-combine-compression-and-deduplication/#comments</comments>
		<pubDate>Tue, 21 Dec 2010 02:09:19 +0000</pubDate>
		<dc:creator>Eric Slack</dc:creator>
				<category><![CDATA[compression]]></category>
		<category><![CDATA[Data deduplication]]></category>
		<category><![CDATA[Eric Slack]]></category>
		<category><![CDATA[Primary storage data reduction]]></category>
		<category><![CDATA[Storage Channel]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-channel-pipeline/?p=346</guid>
		<description><![CDATA[Data reduction is one of those “compass point” activities, meaning that it’s fundamental to so many other storage processes that it has become an objective in and of itself. Data compression was an early data reduction technology, one that evolved into a standard process for most storage components. Deduplication is now in a similar position [...]]]></description>
				<content:encoded><![CDATA[<p class="Body" style="margin: 0cm 0cm 0pt"><a href="http://searchdatabackup.techtarget.com/sDefinition/0,,sid187_gci1417617,00.html"><span style="font-family: Helvetica;color: #800080;font-size: small">Data reduction</span></a><span style="font-family: Helvetica;font-size: small"> is one of those “compass point” activities, meaning that it’s fundamental to so many other storage processes that it has become an objective in and of itself. Data compression was an early data reduction technology, one that evolved into a standard process for most storage components. Deduplication is now in a similar position as it’s evolved from a technology enabling cost-effective disk backup into another fundamental data reduction process, one that’s getting integrated into more and more storage components. With deduplication moving further up the stack to </span><a href="http://searchstorage.techtarget.com/generic/0,295582,sid5_gci1523132,00.html"><span style="font-family: Helvetica;font-size: small">primary storage</span></a><span style="font-size: small"><span style="font-family: Helvetica">, it’s being included in storage systems that also have compression. This has prompted the questions of which technology is better for primary storage and whether they can be used in conjunction with each other.<span id="more-346"></span> </span></span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-family: Helvetica;font-size: small"> </span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-size: small"><span style="font-family: Helvetica">Basically, dedupe requires (surprise) duplicate data blocks in the same data set in order to generate the opportunity to reference a block multiple times. The applications that suit this process are (surprise, again) backup, virtual machine images, traditional office documents and email, to name a few. These applications involve multiple instances of the same data associated with redundant operations or multiple people doing similar work. </span></span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-family: Helvetica;font-size: small"> </span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-size: small"><span style="font-family: Helvetica">Compression, thankfully, works well on data types that don’t do well with dedupe. It works on all data, although with varying effectiveness, and is a good solution for data sets that contain essentially unique data blocks. Media content, like audio and video files, are a natural fit, as are scientific and sensor-based data sets. These files can get very large, so even a small percentage of effective compression can make a difference. But what about running compression and dedupe together?</span></span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-family: Helvetica;font-size: small"> </span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-family: Helvetica;font-size: small">When dedupe first came out it was assumed that it wouldn’t work on compressed data. But this wasn’t accurate; combining compression and dedupe is actually a good practice, especially in primary storage systems, where data sets include a range of file types and software applications. In fact, </span><a href="http://www.storage-switzerland.com/Articles/Entries/2010/12/7_Optimizing_Primary_Storage.html"><span style="font-family: Helvetica;color: #800080;font-size: small">compression and dedupe together</span></a><span style="font-size: small"><span style="font-family: Helvetica"> can produce a higher rate of data reduction than the sum of both when run separately. </span></span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-family: Helvetica;font-size: small"> </span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-size: small"><span style="font-family: Helvetica">Primary storage systems are coming to market with deduplication embedded in their controllers, and dedupe technologies are evolving to meet this new use case. Like compression before it, primary storage dedupe carries lower overall reduction rates than backup. But the combination of dedupe and compression for primary storage can produce better results.</span></span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span style="font-family: Helvetica;font-size: small"> </span></p>
<p class="Body" style="margin: 0cm 0cm 0pt"><span class="EmphasisA"><em><span style="font-size: 10pt"><span style="font-family: Times New Roman">Follow me on Twitter: </span></span></em></span><a href="http://twitter.com/EricSSwiss"><span class="EmphasisA"><em><span style="color: #152133;font-size: 10pt;text-decoration: none"><span style="font-family: Times New Roman">EricSSwiss</span></span></em></span></a></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-channel-pipeline/primary-storage-data-reduction-combine-compression-and-deduplication/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
