 




<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Yottabytes: Storage and Disaster Recovery &#187; big data</title>
	<atom:link href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/tag/big-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery</link>
	<description>Sharon Fisher on issues, trends, and analysis in storage and disaster recovery.</description>
	<lastBuildDate>Mon, 20 May 2013 03:05:54 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>EMC&#8217;s &#8216;The Lazy Song&#8217; Makes Way for &#8216;When I Was Your Man&#8217;</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/emcs-the-lazy-song-makes-way-for-when-i-was-your-man/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/emcs-the-lazy-song-makes-way-for-when-i-was-your-man/#comments</comments>
		<pubDate>Thu, 09 May 2013 03:42:54 +0000</pubDate>
		<dc:creator>Sharon Fisher</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[emc]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[tucci]]></category>
		<category><![CDATA[vmware]]></category>
		<category><![CDATA[emc world]]></category>
		<category><![CDATA[pivotal]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/?p=636</guid>
		<description><![CDATA[It&#8217;s ironic that EMC&#8217;s EMC World party is featuring Bruno Mars, whose most recent hit, &#8220;When I Was Your Man,&#8221; features the singer lamenting the loss of his girlfriend because he took her for granted. Billed as the &#8220;Customer Appreciation Event,&#8221; the party is the culmination of almost a week in Las Vegas where EMC [...]]]></description>
				<content:encoded><![CDATA[<p>It&#8217;s ironic that EMC&#8217;s EMC World party is featuring Bruno Mars, whose most recent hit, &#8220;<a href="https://www.google.com/url?sa=t&amp;rct=j&amp;q=&amp;esrc=s&amp;source=web&amp;cd=1&amp;cad=rja&amp;ved=0CDAQtwIwAA&amp;url=http%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DekzHIouo8Q4&amp;ei=3BiLUanSJ6LqiwKuxIBY&amp;usg=AFQjCNHWig-cUmQyMqh6XCTCkmxi-3epzQ&amp;sig2=64EvlejTC-jGQ8GEiSo4ag&amp;bvm=bv.46340616,d.cGE">When I Was Your Man</a>,&#8221; features the singer lamenting the loss of his girlfriend because he took her for granted. Billed as the &#8220;Customer Appreciation Event,&#8221; the party is the culmination of almost a week in Las Vegas where EMC is attempting to demonstrate to its customers that it does, too, notice that other things are going on in the storage industry besides bigger and bigger, more expensive boxes to hold data.</p>
<p>EMC is sort of the IBM of the storage industry &#8212; big, not necessarily terribly exciting or innovative, but continuing to be a major player because, remember big? Just like IBM can suddenly decide to make a particular technology front-page news by throwing a billion dollars at it, <a href="http://searchstorage.techtarget.com/news/2240181402/Big-Blues-billion-dollar-bet-on-SSD-includes-all-flash-storage-array">like it did with flash a few weeks ago</a>, EMC can make a big deal about storage virtualization, software-defined storage, mobile, cloud, and so on simply by virtue of being EMC, even though other storage vendors have been doing it for years.</p>
<p>There&#8217;s other places to read about the <a href="http://searchstorage.techtarget.com/news/2240183499/EMC-World-2013-EMC-ViPR-has-vendor-on-software-defined-storage-route">specific announcements</a> so I won&#8217;t go into them, other than to observe that EMC is saying you will be able to use them to have your own Facebook-like data center. Except the whole point of the <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/facebook-starts-designing-its-own-storage/">Facebook&#8217;s data center storage</a> is that it uses commodity hardware, and if you&#8217;re using commodity hardware, then what do you need EMC for anyway? I know, I know, it&#8217;s a metaphor, never mind.</p>
<p>Befitting the conference&#8217;s theme of &#8220;transformation,&#8221; EMC seemed to be <a href="http://gigaom.com/2013/05/07/how-emcs-cto-is-trying-to-keep-emc-vmware-and-pivotal-orbiting-the-same-sun/?utm_source=feedly">spending an awful lot of time explaining the various reorganizations</a> it&#8217;s had over the past few years, starting when CEO <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ho-hum-emc-ceo-announces-plans-to-step-down/">Joe Tucci decided he was going to retire</a>, then changed his mind, followed by <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/emc-vmware-move-executives-around/">a lot of musical chairs between EMC and VMware</a>, and culminating in the <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/emc-vmware-spin-off-big-data-apps-into-pivotal-unit/">recent announcement of Pivotal</a>, which rearranges yet more pieces of EMC and VMware.</p>
<p>At the same time, the company also spent a lot of time talking about the &#8220;<a href="http://www.itpro.co.uk/apps/19763/emc-world-2013-emc-hails-rise-third-platform-apps">third platform</a>&#8221; &#8212; a conglomeration of mobile, big data, cloud, and so on, after the first platform of mainframes and the second platform of client/server. After all, if EMC can make mobile and the cloud sound like just another generational version of mainframes, it sounds more like they&#8217;ll continue to be the logical alternative, right?</p>
<p>And of course EMC is going to do all it can to promote big data. Like <a href="http://www.tv.com/shows/pee-wees-playhouse/the-cowboy-and-the-cowntess-158505/">Cowboy Curtis</a>, who knows that &#8220;big feet&#8221; means &#8220;big boots,&#8221; EMC knows that big data means big hardware to put it on, and nobody does it bigger than EMC.</p>
<p>Ironically, this was all happening against a backdrop of EMC announcing it was <a href="http://www.bostonglobe.com/business/2013/05/07/emc-restructuring-plan-cuts-jobs/cWWavZ125CLHoQFjLgfdsO/story.html">laying off more than 1,000 people</a>, with VMware laying off another 800. The company said it was always doing this and that by the end of the year it would actually have more people than it started with. Okay. But seriously? After all the investment in hiring and training those people, the company sees no other way but to do a forklift upgrade of its employees?</p>
<p>On second thought, for EMC, maybe that isn&#8217;t so surprising after all.</p>
<p>In any event, EMC has to at least go through the motions of being up on what users are interested in, lest it sound too much like another Bruno Mars number, &#8220;<a href="https://www.google.com/url?sa=t&amp;rct=j&amp;q=&amp;esrc=s&amp;source=web&amp;cd=1&amp;cad=rja&amp;ved=0CDUQtwIwAA&amp;url=http%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DfLexgOxsZu0&amp;ei=_RiLUaSiAYmkigKby4G4Cg&amp;usg=AFQjCNFiBmCbKOFZvjOVE1TEofRoYvGk9A&amp;sig2=3J3DpQ3oZIhm3s3S3kHyyQ&amp;bvm=bv.46340616,d.cGE">The Lazy Song</a>&#8221; [mildly NSFW]:</p>
<p><em>Today I don&#8217;t feel like doing anything</em></p>
<p><em>I just wanna lay in my bed</em></p>
<p><em>Don&#8217;t feel like picking up my phone, so leave a message at the tone</em></p>
<p><em>&#8216;Cause today I swear I&#8217;m not doing anything</em></p>
<p><em>I&#8217;ll be lounging on the couch just chilling in my Snuggie</em></p>
<p><em>Click to MTV so they can teach me <a href="http://www.youtube.com/watch?v=Uf2mBVcYS10">how to dougie</a></em></p>
<p><em>&#8216;Cause in my castle I&#8217;m the freaking man</em></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/emcs-the-lazy-song-makes-way-for-when-i-was-your-man/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>IBM Acquires StoredIQ, But What is It?</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibm-acquires-storediq-but-what-is-it/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibm-acquires-storediq-but-what-is-it/#comments</comments>
		<pubDate>Fri, 28 Dec 2012 00:04:41 +0000</pubDate>
		<dc:creator>Sharon Fisher</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[e-discovery]]></category>
		<category><![CDATA[ediscovery]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[storediq]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/?p=549</guid>
		<description><![CDATA[IBM announced earlier this month that it was acquiring StoredIQ, but exactly what the company does isn&#8217;t quite obvious. Part big data, part e-discovery, it&#8217;s sort of neither fish nor fowl. As an example, analyst firm IDC included the Austin, Texas-based StoredIQ in its IDC MarketScape: Worldwide Standalone Early Case Assessment Applications 2011 Vendor Analysis, [...]]]></description>
				<content:encoded><![CDATA[<p>IBM announced earlier this month that it was acquiring StoredIQ, but exactly what the company does isn&#8217;t quite obvious. Part big data, part e-discovery, it&#8217;s sort of neither fish nor fowl.</p>
<p>As an example, analyst firm IDC included the Austin, Texas-based StoredIQ in its <a href="http://www.guidancesoftware.com/DocumentRegistration.aspx?did=1000017580&amp;cmpid=newsrelease">IDC MarketScape: Worldwide Standalone Early Case Assessment Applications 2011 Vendor Analysis</a>, but Gartner hasn&#8217;t included it in either of its <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/gartner-releases-second-e-discovery-magic-quadrant-shows-big-changes/">e-discovery Magic Quadrants</a> &#8212; from which a number of larger vendors have <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/expect-a-wave-of-e-discovery-acquisitions/">plucked other acquisitions</a>. (However, Gartner did name <a href="http://www.storediq.com/company/press-releases/storediq-named-cool-vendor-leading-analyst-firm">StoredIQ as a &#8220;Cool Vendor</a>&#8221; in April of this year.&#8221;)</p>
<p>Instead, IBM is working on creating a family of &#8220;<a href="http://searchstorage.techtarget.com/definition/information-life-cycle-management">information lifecycle management</a>&#8221; applications, which are kinda both &#8212; big data, because it covers all an organization&#8217;s data, but also e-discovery, because part of the reason for having such applications is for litigation support, both for identifying data needed in legal situations but also to help reduce, in a legally justifiable way, the amount of such data in the first place.</p>
<p>StoredIQ&#8217;s advantage is that it manages the data <em>in situ </em>rather than by moving it to a secondary location, which saves the cost of the secondary storage, noted <a href="http://www.zacks.com/stock/news/89040/ibm-set-to-buy-storediq">Zacks Equity Research</a>, adding that the company had received $11.4 million in funding in August and had 120 clients &#8212; though it warns that IBM faces competition from vendors such as EMC, Oracle, and SAP.</p>
<p>The company has also been working to make its product, which includes software and an appliance, <a href="http://www.networkcomputing.com/e-discovery/storediq-makes-e-discovery-user-friendly/229501347">easy enough for even legal professionals</a> to use, rather than requiring IT people to operate. In addition, it has partnered with a wide variety of other vendors over the years, including NetApp, EMC, and NewsGator, and supported a number of formats, including SharePoint and Office 365.</p>
<p>As <a href="http://h30565.www3.hp.com/t5/Feature-Articles/Big-Data-What-It-Is-Why-You-Care-What-Users-Are-Doing-With-It/ba-p/4848">big data</a> has become more prevalent, companies are interested in saving their data in hopes of being able to analyze it at some point and improve their businesses. But what it calls <a href="http://www.law.com/jsp/lawtechnologynews/PubArticleLTN.jsp?id=1202581938140&amp;Hoarders_The_Corporate_Data_Edition=&amp;et=editorial&amp;bu=LTN&amp;cn=20121220&amp;src=EMC-Email&amp;pt=Law%20Technology%20News&amp;kw=%27Hoarders%27%3A%20The%20Corporate%20Data%20Edition">data hoarding</a> is a problem for two reasons, notes Law Technology News.</p>
<p>First, there&#8217;s the cost. Though the price of storage itself has been dropping, it still costs something, plus there&#8217;s the cost of managing it, backing it up, and so on &#8212; which could amount to $5,000 per terabyte, Law Technology News said.</p>
<p>Second, there&#8217;s the legal cost. Should an organization be sued, it not only needs to provide all the pertinent information that the other side asks for, but it has to find it in the first place &#8212; and the more data a company has, the more expensive that search is. Also, companies have to balance the value of the data for analysis with what it might cost them should it reveal something in a lawsuit. This cost is on the order of $15,000 per gigabyte, Law Technology News said.</p>
<p>In fact, legal organizations have been advising companies to look for opportunities to delete data, pointing out <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/information-retention-saves-legal-costs-but-many-dont-do-it/">how much money they can save.</a> However, they have to do this in a regular fashion, because once a lawsuit is filed, a &#8220;legal hold&#8221; is put on the data and it can&#8217;t be deleted, or a company is subject to large fines.</p>
<p>The acquisition becomes part of IBM&#8217;s Information Lifecycle Governance suite, headed by Deidre Paknad, vice president of Information Lifecycle Governance. Paknad had been CEO of <a href="http://www-01.ibm.com/software/ecm/pss/">PSS Systems Inc.</a> in Mountain View, Calif., a pioneer in the e-discovery space, which itself was acquired by IBM in 2010. The group also includes <a href="http://vivisimo.com/">Vivisimo</a>, which IBM acquired <a href="http://www-03.ibm.com/press/us/en/pressrelease/37833.wss">earlier this year</a>.</p>
<p>The acquisition was not a surprise; IBM had partnered with StoredIQ for two years. As is typical for IBM, it did not reveal the cost of the acquisition. It is expected to be finalized in the first calendar quarter of 2013.</p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibm-acquires-storediq-but-what-is-it/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exabyte, Zettabyte, Yottabyte&#8230;Then What? Opinions Vary</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/exabyte-zettabyte-yottabyte-then-what-opinions-vary/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/exabyte-zettabyte-yottabyte-then-what-opinions-vary/#comments</comments>
		<pubDate>Fri, 16 Nov 2012 01:10:10 +0000</pubDate>
		<dc:creator>Sharon Fisher</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[data storage]]></category>
		<category><![CDATA[intel]]></category>
		<category><![CDATA[seagate]]></category>
		<category><![CDATA[brontobyte]]></category>
		<category><![CDATA[yottabyte]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/?p=481</guid>
		<description><![CDATA[When I first started this blog almost two years ago, I called it &#8220;yottabytes&#8221; because that was the term commonly accepted for the biggest size of storage (1000^8, or a 10 followed by 24 zeroes, compared with, say, 1000^4 for a terabyte). But as people are actually starting to refer to petabytes (1000^5) and exabytes [...]]]></description>
				<content:encoded><![CDATA[<p>When I first started this blog almost two years ago, I called it &#8220;yottabytes&#8221; because that was the term commonly accepted for the biggest size of storage (1000^8, or a 10 followed by 24 zeroes, compared with, say, 1000^4 for a terabyte). But as people are actually starting to refer to petabytes (1000^5) and exabytes (1000^6) of storage, there&#8217;s starting to be more discussion of what comes next.</p>
<p>No, I&#8217;m not changing the name of the blog.</p>
<p>The proximate cause for the discussion now is a presentation by Shantanu Gupta, director of Connected Intelligent Solutions for Intel&#8217;s Data Center and Connected Systems Group, which <a href="http://gigaom.com/data/as-data-gets-bigger-what-comes-after-a-yottabyte/">showed up in GigaOm</a> the other day. According to this presentation, what comes after yottabyte is brontobyte, or 10 followed by 27 zeroes.</p>
<p>This is not definite; as GigaOm&#8217;s Stacey Higginbotham points out, it&#8217;s not an <a href="http://en.wikipedia.org/wiki/Metric_prefix">official prefix</a>, though it has been <a href="http://en.wikipedia.org/wiki/Unit_prefix">discussed since at least 1991</a> &#8212; though, that far back, it was 10 followed by 15 zeroes. It does, however, appear to be more accepted for the number than does <a href="http://www.time.com/time/health/article/0,8599,1970849,00.html">hella</a>-, which had a brief flurry a couple of years ago as people tried to promote it.</p>
<p>Past bronto, to 10 followed by 30 zeroes, it gets more complicated, partly due to what honestly looks like typographical errors.</p>
<p>Gupta refers to &#8220;geobyte&#8221; in his presentation &#8212; but also refers to &#8220;bronobyte&#8221; as opposed to &#8220;brontobyte&#8221; for 10 followed by 27 zeroes . <a href="http://wiki.answers.com/Q/What_comes_after_a_geobyte">Wikianswers</a> also refers to &#8220;geobyte.&#8221;</p>
<p>Higginbotham refers to &#8220;gegobyte&#8221; for the figure, as does <a href="http://storageeffect.media.seagate.com/2012/11/storage-effect/a-gegobyte-hard-drive-would-cover-the-earth-23000000-times/">Seagate in a blog posting</a> riffing on the GigaOm post.</p>
<p>On the other hands, answers.yahoo.com uses &#8220;<a href="http://answers.yahoo.com/question/index?qid=20100125170012AA9sL0h">geopbyte</a>&#8221; for the figure, as does the <a href="http://www.urbandictionary.com/define.php?term=Geopbyte">Urban Dictionary</a> and <a href="http://www.whatsabyte.com/">whatisabyte.com</a>.</p>
<p>Geo-, gego-, or geop-? It kind of doesn&#8217;t matter, because it&#8217;s all unofficial anyway, but somebody might want to figure it out at some point.</p>
<p>Beyond what-do-we-call-it, we also have the obligatory how-to-put-it-in-terms-we-puny-humans-can-understand discussion, aka the Flurry of Analogies that came up when <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibms-120-petabyte-hard-drive-causes-flurry-of-analogies/">IBM announced a 120-petabyte hard drive</a> a year ago. Depending on where you read about it, that drive was:</p>
<ul>
<li>2.4 million Blu-ray disks</li>
<li>24 million HD movies</li>
<li>24 billion MP3s</li>
<li>1 trillion files</li>
<li>Eight times as largest as the biggest disk array available previously</li>
<li>More than twice the entire written works of mankind from the beginning of recorded history in all languages</li>
<li>6,000 Libraries of Congress (a standard unit of data measure)</li>
<li>Almost as much data as Google processes every week</li>
<li>Or, <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/how-would-you-move-30-petabytes-of-data/">four Facebooks</a></li>
</ul>
<p>So, how big are bronto- and geo/gego/geop-?</p>
<p>Well, GigaOm wrote, &#8221;Cisco estimates we’ll see a 1.3 zettabytes of traffic annually over the internet in 2016.&#8221; On the other hand,  GigaOm cited a piece with the Cisco estimate being <a href="http://gigaom.com/2012/02/14/despite-critics-cisco-stands-by-its-data-deluge/">130 exabytes</a>, which would only be .13 zettabytes if I have my math right. Seagate estimates that total storage capacity demand will reach 7 zettabytes in 2020.</p>
<p>Yottabytes is in the realm of <a href="http://www.examiner.com/article/nsa-2-billion-utah-based-facility-can-process-yottabytes-of-information">CIA and NSA spy data</a>, noted a piece in the Examiner.com, which went on to point out, &#8220;As of 2011, no storage system has achieved one zettabyte of information. The combined space of all computer hard drives in the world does not amount to even one yottabyte, but was estimated at approximately 160 exabytes in 2006. As of 2009, the entire Internet was estimated to contain close to 500 exabytes.&#8221; A yottabyte would also be 250 trillion DVDs, GigaOm wrote.</p>
<p>For brontobyte, which Gupta said would be used primarily for the &#8220;<a href="http://h30565.www3.hp.com/t5/Feature-Articles/Implementing-The-Internet-of-Things-in-Your-Business/ba-p/1772">Internet of Things</a>&#8221; ubiquitous sensors, there are also somewhat fanciful definitions such as &#8220;More than the number of all the cells of the human body in each person living in Indiana and then some,&#8221; and &#8221;You would need a brontobyte computer to download everything on the Internet&#8221; (though, apparently not, according to Examiner.com).</p>
<p>Of course, once we start talking in terms of trillions of DVDs, obviously we&#8217;ve got to find another unit of measure. Interestingly, Seagate used geographic area.</p>
<p>&#8220;If today’s 4 terabyte 3.5-inch drive is roughly .16 square feet, you can get approximately 24 terabytes per square foot. That’s .0046 square miles of land mass per 4 terabytes. Assuming 1 terabyte per disk was the maximum areal density, and hard drives will not get any thicker than 1 inch:</p>
<ul>
<li>An exabyte hard drive would be about the size of Connecticut [or, I would add, <a href="http://en.wikipedia.org/wiki/Owyhee_County,_Idaho">Owyhee County in Idaho</a>]</li>
<li>A zettabyte hard drive would be about the size of Antarctica</li>
<li>A yottabyte hard drive would cover the earth 23 times</li>
<li>A brontobyte hard drive would cover the earth 23,000 times</li>
<li>A gegobyte hard drive would cover the earth 23,000,000 times&#8221;</li>
</ul>
<div>Of course, that would be using today&#8217;s technology.</div>
<p>Beyond that? Wikianswers postulated Saganbyte, Jotabyte, and Gatobyte, while Wikipedia referred to a system working backward through the Greek alphabet &#8212; though that one wouldn&#8217;t include brontobyte or geo/gego/geopbyte.</p>
<div></div>
<p>&nbsp;</p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/exabyte-zettabyte-yottabyte-then-what-opinions-vary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Looks at Backing Up Oracle</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/new-looks-at-backing-up-oracle/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/new-looks-at-backing-up-oracle/#comments</comments>
		<pubDate>Thu, 18 Oct 2012 15:00:52 +0000</pubDate>
		<dc:creator>Sharon Fisher</dc:creator>
				<category><![CDATA[backup]]></category>
		<category><![CDATA[backups]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud storage]]></category>
		<category><![CDATA[oracle]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/?p=457</guid>
		<description><![CDATA[All of a sudden, backing up Oracle databases is big news, with Wikibon and Amazon Web Services each releasing new insights about how to do it. What makes this a big deal? As Wikibon mentions, nearly 30% of Oracle shops are managing more than 100 TB of data that needs to be backed up. And [...]]]></description>
				<content:encoded><![CDATA[<p>All of a sudden, backing up Oracle databases is big news, with Wikibon and Amazon Web Services each releasing new insights about how to do it.</p>
<p>What makes this a big deal? As <a href="http://wikibon.org/wiki/v/The_Changing_Face_Of_Oracle_Backup_And_Recovery">Wikibon mentions</a>, nearly 30% of Oracle shops are managing more than 100 TB of data that needs to be backed up. And with &#8216;big data&#8217; becoming a buzzword, not only is the data getting bigger, but people are paying more attention to it.</p>
<p>Wikibon points out several trends, including increasing virtualization, more space devoted to backups, and that tape is still around. 45% of customers report that more than half of backup data resides on tape, Wikibon says.</p>
<p>But one of the newer backup choices that Wikibon mentions is <a href="http://www.oracle.com/technetwork/database/features/availability/rman-overview-096633.html">RMAN</a>. And the advantage to that is brought up in one of the other big recent developments in Oracle backup, which is RMAN&#8217;s newer ability to back up to the cloud.</p>
<p>That&#8217;s where the <a href="http://media.amazonwebservices.com/AWS_Amazon_Oracle_Backups.pdf">Amazon Web Services white paper</a> comes in. It describes how Amazon itself started backing up all its Oracle databases to the cloud using RMAN. While such white papers are often pretty self-serving &#8212; and now we&#8217;re talking about one where a vendor is using its own product, or what EMC&#8217;s Paul Maritz refers to as <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/emc-vmware-move-executives-around/">&#8220;eating your own dog food&#8221;</a> &#8212; this one has some hard numbers behind it.</p>
<p>&#8220;The transition to S3-based backup started last year and by summer, 30 percent of backups were on S3; three months later it was 50 percent. The company expects the transition to be done by year’s end — except for databases in regions where Amazon s3 is not available,&#8221; <a href="http://gigaom.com/cloud/how-oracle-yes-oracle-helped-amazon-suck-the-cost-out-of-database-backup/">writes Barb Darrow for GigaOm</a>. Moreover, the company is saving $1 million per year for backups that take only half as long, she writes.</p>
<p>Whether you want to go the AWS route for Oracle backups or not, the Wikibon report has some interesting information on the backup subject. Granted, some of them are pretty Mom-and-apple pie &#8212; implement redundancy, test your backups, use dedupe &#8212; but others are more nuanced.</p>
<p>For example, the company notes, organizations are increasingly virtualizing their Oracle servers &#8212; which could have an impact on the speed of backing them up.  &#8221;The big initial attraction of server virtualization is that it increased average utilization from 15% to about 85%,&#8221; Wikibon writes. &#8220;This means that virtualized environments will see a drastic reduction in overall server capacity, some of which was used to run backups.&#8221;</p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/new-looks-at-backing-up-oracle/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Where&#8217;s the White House Going to Put All That Data?</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/wheres-the-white-house-going-to-put-all-that-data/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/wheres-the-white-house-going-to-put-all-that-data/#comments</comments>
		<pubDate>Tue, 29 May 2012 04:21:23 +0000</pubDate>
		<dc:creator>Sharon Fisher</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[digital government]]></category>
		<category><![CDATA[federal government]]></category>
		<category><![CDATA[meritalk]]></category>
		<category><![CDATA[obama]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/?p=300</guid>
		<description><![CDATA[While the White House released its digital government plan last week, it appears to have left out one major factor: just where the heck all that data is going to be stored, especially when storage already appears to be an issue for federal agencies, according to a recent survey. The Digital Government plan doesn&#8217;t even [...]]]></description>
				<content:encoded><![CDATA[<p>While the White House released its digital government plan last week, it appears to have left out one major factor: just where the heck all that data is going to be stored, especially when storage already appears to be an issue for federal agencies, according to a recent survey.</p>
<p>The <a href="http://www.whitehouse.gov/sites/default/files/omb/egov/digital-government/digital-government.html">Digital Government plan</a> doesn&#8217;t even mention the word &#8220;storage,&#8221; even though open data accessible to everyone is one of the linchpins of the plan.</p>
<p>But a <a href="http://www.meritalk.com/big-data-report-register.php">recent survey by MeriTalk</a> of 151 federal government IT professionals about big data found that storage was already an issue.</p>
<p>Factors found in the survey indicate the following:</p>
<ul>
<li>87% of IT professionals say their stored data has grown in the last two years (by an average of 61%)</li>
<li>96% expect their data to grow in the next two years (by an average of 64%)</li>
<li>31% of data is unstructured, and that amount is increasing</li>
<li>Agencies estimate they have just 49% of the data storage/access they need to leverage big data and drive mission results</li>
<li>40% of respondents pointed to storage capacity as one of the most significant challenges their agency faced when it came to managing large amounts of data</li>
<li>Agencies currently store an average of 1.61 petabytes of data, but expect to get to 2.63 petabytes in just the next two years</li>
<li>57% of agencies say they have at least one dataset that’s grown too big to work with using their current data management tools and/or infrastructure</li>
<li>While 64% of IT professionals say their agency’s data management system can be easily expanded/upgraded on demand, they estimate10 months as the average time they could double their short-to medium-term capacity</li>
<li>The #1 step that agencies say they are taking to improve their ability to manage and make decisions with big data is to invest in IT infrastructure to optimize data storage (39%)</li>
</ul>
<div>Exactly how the federal government is going to manage this (other than <a href="http://dilbert.com/strips/comic/2011-01-07/">&#8220;blah blah cloud&#8221;</a>) is not clear, but the government is going to need to find <a href="http://www.google.com/url?sa=t&amp;rct=j&amp;q=&amp;esrc=s&amp;source=web&amp;cd=3&amp;ved=0CFcQtwIwAg&amp;url=http%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DJLoge6QzcGY&amp;ei=G07ET5noMsnPiAL-9v2TCA&amp;usg=AFQjCNGvFPppl-1kV3181TSeOZ3nYkPiMQ">a place for its stuff</a> before it can figure out how to save it or leverage it.</div>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/wheres-the-white-house-going-to-put-all-that-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM&#8217;s 120-Petabyte Hard Drive Causes Flurry of Analogies</title>
		<link>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibms-120-petabyte-hard-drive-causes-flurry-of-analogies/</link>
		<comments>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibms-120-petabyte-hard-drive-causes-flurry-of-analogies/#comments</comments>
		<pubDate>Wed, 31 Aug 2011 19:46:55 +0000</pubDate>
		<dc:creator>Sharon Fisher</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[ibm]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/?p=156</guid>
		<description><![CDATA[The best part about IBM&#8217;s experimental 120-petabyte hard drive is reading all the ways that writers try to explain how big it is. 2.4 million Blu-ray disks 24 million HD movies 24 billion MP3s 1 trillion files Eight times as largest as the biggest disk array available previously More than twice the entire written works of [...]]]></description>
				<content:encoded><![CDATA[<p>The best part about IBM&#8217;s experimental 120-petabyte hard drive is reading all the ways that writers try to explain how big it is.</p>
<ul>
<li>2.4 million Blu-ray disks</li>
<li>24 million HD movies</li>
<li>24 billion MP3s</li>
<li>1 trillion files</li>
<li>Eight times as largest as the biggest disk array available previously</li>
<li>More than twice the <span>entire written works of mankind from the beginning of recorded history in all languages</span></li>
<li><span>6,000 Libraries of Congress (a standard unit of data measure)</span></li>
<li>Almost as much data as Google processes every week</li>
<li>Or, <a href="http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/how-would-you-move-30-petabytes-of-data/">four Facebooks</a></li>
</ul>
<p>It is not one humungo drive; it is, in fact, an array of 200,000 conventional hard drives (not even solid-state disk) hooked together (which would make them an average of 600 GB each).</p>
<p>Unfortunately, you&#8217;re not going to be able to trundle down to Fry&#8217;s and get one anytime soon. No, this is something being put together by the <a href="http://www.technologyreview.com/computing/38440/">IBM Almaden research lab</a> in San Jose, Calif., according to MIT <em>Technology Review</em>.</p>
<p>What exactly it&#8217;s going to be used for IBM wouldn&#8217;t say, only that it was &#8220;an unnamed client that needs a new supercomputer for detailed simulations of real-world phenomena.&#8221; Most writers speculated that that meant weather, though <em>Popular Science </em>thought it could be used for seismic monitoring &#8212; or by the NSA for spying on people.</p>
<p>Like the Cray supercomputer back in the day, and some high-powered PCs even now, the system is reportedly <a href="http://en.wikipedia.org/wiki/Water_cooling#Computer_usage">water-cooled</a> rather than by using fans.</p>
<p style="text-align: left">Needless to say, it also uses a different file system than a typical PC: IBM&#8217;s <a href="http://www-03.ibm.com/systems/software/gpfs/">General Parallel File System</a> (GPFS), which <a href="http://en.wikipedia.org/wiki/IBM_General_Parallel_File_System">according to Wikipedia</a> has been available on GPFS has been available on IBM&#8217;s AIX since 1998, on Linux since 2001 and on Microsoft Windows Server since 2008 and which some tests have shown can work up to <a href="http://tech2.in.com/news/general/ibm-violin-develop-storage-system-capable-of-5gbs-transfer-rate/232292">37 times faster</a> than a typical system. (The Wikipedia entry also has an interesting comparison with the file system used by big data provider Hadoop.)</p>
<blockquote>
<p style="text-align: left"><span>GPFS provides higher input/output performance by &#8220;striping&#8221; blocks of data from individual files over multiple disks, and reading and writing these blocks in parallel.&#8221;</span></p>
</blockquote>
<p style="text-align: left">The system also has a kind of super-mondo RAID that lets dying disks store copies of themselves and then get replaced, which reportedly gives the system a mean time between failure of a million years.</p>
<p style="text-align: left"><em>Technology Review</em> didn&#8217;t say how much space it took up, but if a typical drive is, say, 4 in. x 5.75 in. x 1 in, we&#8217;re talking 4.6 million cubic inches just for the drives themselves, not counting the cooling system and cables and so on. That&#8217;s a 20-ft. x 20-ft. square almost 7.5 feet high, just of drives.  (This is all back-of-the-envelope calculations.)</p>
<p style="text-align: left">In fact, the system needs two petabytes of its storage just to keep track of all the index files and metadata, <em>Technology Review</em> reported.</p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/storage-disaster-recovery/ibms-120-petabyte-hard-drive-causes-flurry-of-analogies/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
