A funny thing happened on the way to Backblaze’s automated backup product — it sort of turned into a storage design company.
The company has been known for some time for its storage designs, which, instead of making real real big storage, uses a whole whole lot of commodity storage devices hooked together into “pods,” with as much of the extraneous stuff stripped off as possible. This reduces costs and is more scalable than large storage systems that require forklift upgrades to be expandable.
Backblaze has been getting so well known for its storage system that other companies, such as Netflix, have taken to using it as well, and several vendors have started selling storage systems based on the Backblaze designs.
The system has its flaws — such as, if the company has trouble finding commodity disk drives — but in general it works pretty well. (Facebook has also taken to designing its own disk drives, as well as servers, for a similar reason: economies of scale make it more efficient to design its own hardware.)
The system works so well that the fact that Backblaze has designed a new generation of the storage pods it uses has itself made the news, because so many organizations — Vanderbilt University, Crispin Porter + Bogusky, Rensselaer Polytechnic Institute, NASA’s Jet Propulsion Laboratory, and Shutterfly, along with Netflix — have been using the Backblaze designs.
“In the world of high-volume storage, we’ve come to a place similar to the PC market decades ago when it was cheaper to just buy the parts and build your own than it was to buy a pre-assembled computer,” writes GigaOm’s Derrick Harris.
Version 3.0 of the storage pods now have a capacity of up to 180 TB — up from 135 TB, because they’re based on 4 TB, rather than 3 TB, commodity drives. In addition, a number of the other components have also been replaced. The result is a drive that is more reliable, easier to manage — and cheaper than the 135 TB second-generation systems it replaced.
Backblaze also releases the specs of the system — including a parts list, prices and all, right down to the screws, as well as very detailed instructions — to enable other companies to use its designs as well. That is, if they can. “To obtain these prices we do purchase them in quantity,” Backblaze warns.
Disclosure: I am a BackBlaze customer.
House on fire image via Creative Commons
We’ve all heard how “three moves are equal to one house fire” (a sentiment so old it’s attributed to Ben Franklin’s Poor Richard’s Almanack in 1757). Apparently the modern equivalent is that “three migrations equal one disk crash.”
That’s the finding of a December 2012 survey by Gridstore, which found that, for medium-sized businesses during a major storage upgrade:
- 55% experienced business and end user disruption
- 32% experienced failed upgrade process where migrations didn’t complete
- 9% lost data or access to data during the migration
While there’s admittedly problems with the survey methodology (Gridstore surveyed only its own customers, we don’t know how many were asked, we don’t know how many responded, we don’t see a full copy of the survey), one thing seems clear: Migrations are fraught.
And it’s not just the migration itself, but preparing for one. The survey found that businesses preparing for purchasing new storage faced either high business disruption or stopping business for forecasting capacity (almost 40%), slow or complex purchasing cycles (almost 25%), requiring multiple approvals (almost 35%), and complexity or time required to evaluate (almost 30%).
On the other hand, not doing anything wasn’t any better, with respondents reporting that, before an upgrade, they suffered consistent disruptions from limited performance (35%), limited bandwidth (about 15%), limited capacity (about 15%, with another 10% reporting it as a “putting out fires” situation), or reliability (about 25%).
What particularly makes this a problem is that midsized organizations are reportedly doing an awful lot of migrating — 71% of the survey respondents reported that they were adding storage and capacity every 6 to 12 months.
Next time you plan an upgrade, maybe you’d better add a fire extinguisher.
What is storage?
“The privacy group is filing an amicus brief asking the high court to accept an email privacy case from South Carolina that’s exacerbated confusion over what courts consider electronic storage,” writes the political journalism site Politico. “In the filing, submitted on behalf of nearly 20 privacy advocates, EPIC tells the Supreme Court that email privacy rules and definitions have become increasingly unclear, thanks to the rise of cloud computing, and Congress has yet to step in to fill the gap.”
The whole issue of what “storage” is became an issue last fall, when a South Carolina Supreme Court ruled that, under the Stored Communications Act (SCA), email in a Yahoo! account should not be considered protected from unauthorized access because email sitting in the cloud was not “stored” the same way as it would be sitting on one’s own computer — which was protected.
This means that was also true for anyone who uses a cloud-based email system — not just Yahoo, but also Gmail and a plethora of other systems. Not to mention some components of the federal government itself that have moved to cloud-based email, EPIC notes in its brief.
The original case was a domestic dispute — a husband was cheating on his wife, and the wife’s daughter-in-law figured out the husband’s e-mail password and logged in to his personal account to read the e-mails between the husband and his paramour, wrote Orin Kerr in The Volokh Conspiracy legal blog. “The daughter-in-law found the e-mails and shared them. The husband filed suit under several laws including the Stored Communications Act, 18 U.S.C. 2701, which only allows a civil suit if the e-mails accessed were in “electronic storage.””
The Supreme Court may get involved because this decision conflicts with a similar case by the Ninth Circuit Court in 2004, wrote Andrew Hoffman at the Information Law Group blog.
“The Jennings opinion establishes a split with the Ninth Circuit’s opinion in Theofel v. Farey-Jones, 359 F.3d 1066 (9th Cir. 2004), which found that emails that had been received, read, and left on the server were stored “for purposes of backup protection” and therefore within the ambit of the SCA,” Hoffman wrote.
This is a problem because it’s not good for different courts to have different ideas of what does and doesn’t constitute a legal issue, Hoffman wrote. “Thus, until the split of authority is resolved, the same conduct will disparately subject some individuals to civil liability, depending on the interpretation of the SCA applied by the court. Such disparate interpretations could create an incentive for forum shopping and pose conflict of law questions, when multiple states (and even nations) could be involved in an email hacking case. Such disparate interpretations may also pose problems for employers investigating suspected employee misconduct involving webmail.”
Just to show how confused the South Carolina court was, its judges couldn’t even agree on why the email wasn’t stored, but instead had three different opinions, Kerr wrote.
Aside from the issue of protection, the issue of defining what storage is is important because it is the primary difference between the Stored Communications Act — the law under which the original suit was filed — and the Electronic Communications Privacy Act, according to EPIC.
A related question is “What is a backup?” because some of the legal arguments also hinged on whether the email retrieved from the account was the “only copy” or a backup — a question that is kind of irrelevant in cloud storage, which may feature multiple replicated copies of data, EPIC writes.
“A wealth of personal and private messages are now stored remotely in the cloud, and their protection depends on the interpretation of ‘electronic storage’ under ECPA,” EPIC writes.
Optical discs are either dead, or better than ever. It depends on whom you ask.
They’re dead according to CNN, which points to the fact that some recent Apple computer no longer have optical disc drives, that of course tablets and smartphones don’t have them, that vendors such as Sony no longer even make them, and that increasingly people are downloading music, movies, and software fro the cloud.
“Over time, an optical disc will be as much of an historical curiosity as a floppy disk,” CNN quoted Michael Gartenberg, a tech-industry analyst with research firm Gartner Inc., as saying.
CNN also quoted a piece by Chris Pirillo, founder of blogging network Lockergnome, as saying, “While it may be too early to say for certain that the optical drive is absolutely dead, it is certainly showing all the early warning signs of a technology that is past its prime.”
On the other hand, not so fast, maybe.
Pirillo’s piece was written two years ago. “As this trend continues, it’s unlikely that optical drives will be included on game consoles at all,” he predicted. Yet as recently as this month, analyst firm IDC said it didn’t see optical drives leaving gaming consoles anytime soon.
“Discs will remain the console game revenue mainstay for years to come,” said Lewis Ward, research manager of IDC’s Gaming service, in Games Industry International. “With the advent of eighth-generation consoles, starting with the Wii U, historical norms strongly imply that game disc revenue will stop bleeding in 2013 and rise substantively in 2014.”
Okay, you say, that’s fine for games, but what about business users? While some new machines may no longer include optical drives, they continue to be a medium of choice for archival storage.
Moreover, vendors are still announcing new innovations in optical storage, such as Millenniata, which earlier this month announced it will offer ulltra-durable Blu-ray M-Discs in the second quarter of 2013 that will store five times as much data as a standard disc and yet be able to be read and written in a standard Blu-ray player. The drives are even being included in some Acer computers.
Similarly, vendors such as JVC and IoSafe are releasing archive-quality optical discs for use in specialized verticals such as law enforcement, noted Tim Dees of PoliceOne.Com. The large amount of storage provided by optical drives is necessary to store video files, he said.
While it may be true that optical drives are on the way out, they’re obviously far from dead.
Now that everyone’s used the holidays to digest the news about HP and Autonomy, HP is reportedly receiving offers to purchase the troubled UK e-discovery company — but people in the know say HP won’t go for it.
HP sort of brought all this interest upon itself by saying in a December 27 10-K regulatory filing that it would consider divesting some businesses “that may no longer help us meet our objectives.” Consequently, HP has received expressions of interest from several high-tech companies, but company executives aren’t interested, according to the Wall Street Journal. And reporters such as Arik Hesseldahl from AllThingsD say that the phrasing was due to the lawyers and had nothing to do with the company’s real intentions.
And, you know, honestly, that’s the news. More than 80 news stories, and that’s all there is. Everything else is speculation.
Analysts can’t even decide whether selling Autonomy would be hard, with some of them saying it would be easy because the acquisition is recent and the company was fairly, uh, autonomous under HP (one of the criticisms against it, actually). Others are saying it would be hard because HP is involved in litigation against Autonomy.
So far, most of the evidence does seem to indicate that HP is not going to sell Autonomy. After all, HP held a conference call with analysts a few days ago talking about its plans for the company, and would it do that if it were planning to sell? the proponents of that side said. The new head of Autonomy also reportedly recently told staffers that HP intended to hire at least 50 new engineers for Autonomy in the next year.
In fact, some believe that HP might be planning to sell its printing unit instead. There have been rumors that HP is receiving offers for EDS as well.
On the other hand, HP’s stock went up after the rumors started, and boards of directors tend to pay attention to such things. But analysts quoted in the Financial Times said HP might get only “pennies on the dollar” if it did sell Autonomy.
You know what they say about guys with big thumbs.*
The Consumer Electronics Show was this week in Las Vegas, and while I didn’t go (do you know how much walking around that involves?) it wasn’t hard to figure out that, with all the music and movies and pictures and so on that consumers are, well, consuming these days that — cloud or no cloud — storage is becoming increasingly important. Plus, to make the CES even more enticing to storage vendors, there’s a whole little show within a show, Storage Visions, held just before CES. “Petabytes are the new terabytes!” it proclaimed.
Besides, CES is typically a hardware show. It’s always been all about the gadgets, and the more extreme the better: bigger, smaller, harder, softer, faster.
Which brings us to the Kingston DataTraveler HyperX Predator 3.0, a thumb drive that holds a terabyte of data.
A. Terabyte. On. A. Thumb. Drive.
You know, just a year and a half ago I was whining because I wanted a terabyte on a laptop. Now I can put a terabyte in my pocket? (Despite its name, it doesn’t seem to have any particularly aggressive features.)
It’s expected to ship sometime this quarter for an undisclosed price, but as the New York Times points out, a 512-GB version available now costs $1,750. (In comparison, a 1-TB internal or external drive is less than a hundred bucks these days, and I still remember when a 10-MB hard drive was as big as the PC and cost as much and by the way, what are you kids doing on my lawn?)
Of course, as people have pointed out, such a gigundo thumb drive has its own problems, not the least of which is what happens and you run it through the laundry (my big fear), lose it out a hole in your pocket, or get a virus on it.
As much as the size, people were also impressed by the speed — which reportedly reads at 240MB/s and writes at 160MB/s. This is because it uses USB 3.0, which itself is expected to have its speed doubled, to 10 Gbps, in a year or so.
That wasn’t all the storage news at the show, but of course most other announcements pale in comparison:
- Seagate announced the Seagate Central network home storage system — which provides 4TB of automatic backup for the home, access to movies, music and photos from networked devices, as well as remote access — and Seagate Wireless Plus mobile storage, which can stream up to 10 hours of 500 HD movies to up to 10 devices over wireless.
- HP announced a similar unit, the Pocket Playlist — 32 GB for up to 5 devices — for $129, due in the middle of next month.
- LaCie announced BladeRunner, a pretty pretty 4TB external drive for $300, where “the warm interior electronics are encased in a mystifying shell, and the blades are the radiator that cools it down,” according to the designer. Um, okay. But’s Limited Edition!
- Beam me up, Scotty. With the Transporter, you back up your files to it, and then it backs them up to other Transporter units that your friends own. This model seems to be predicated on wide penetration. Good luck with that.
- Buffalo Technology announced some new network attached storage devices.
*What do they say about guys with big thumbs? Why, big thumbs, big gloves. What did you think I was going to say?
You know how after Christmas, you go out and buy yourself all the stuff nobody gave you? Apparently Imation feels the same way; it just dropped $120 million on Nexsan.
Imation makes storage products, particularly media, for the enterprise, while Nexsan makes storage hardware and software for small to medium businesses. So really, this is not so much an acquisition about synergistic technology but simply one of one company buying another for its products and customers — 11,000, in Nexsan’s case.
The other advantage is that Nexsan is doing much better financially than Imation, which according to VentureBeat has failed to make a profit for the last five of its six years and restructured in October. Nexsan, on the other hand, had revenue of $82 million in 2011, has gross margins in the 40% range, and was close to breaking even on net earnings, according to the Star-Tribune. In comparison, Imation had $1.3 billion in sales and a 21.3% gross margin.
It’s not all one-sided; Imation, which has an international presence, is expected to provide a broader market for Nexsan, which is primarily in the U.S., Canada, and the U.K. The move also provides an exit strategy for Nexsan investors, which according to Forbes includes MFP Partners, a fund controlled by the legendary mutual fund manager Michael Price; RRE Ventures; VantagePoint Venture Partners; and the Fonds de Solidaritie des Travailleurs du Quebec, a fund managed by the Quebec Federation of Labor, the largest union in Quebec. The company had planned an IPO — after two failed attempts in 2008 and 2010 — but withdrew it in April.
Imation is based in Oakdale, Minn., while Nexsan is based in Thousand Oaks, Calif., where it will continue to reside after the acquisition. Imation has 1,100 employees, 400 of which are in the Minneapolis area, while Nexsan has 200, according to the Minneapolis Star-Tribune.
The other interesting aspect of the acquisition is how it’s being financed. It was funded by about $105 million in cash and 3.3 million Imation common shares, equivalent to about $15 million, to make the total of $120 million. But the $105 million was about 56% of the company’s cash on hand, and the company had to issue new stock for the shares, which meant it diluted the value of existing shares.
The two companies will determine integration and global expansion activities between Imation’s Tiered Storage and Security Solutions business and Nexsan in the first quarter of 2013, NASDAQ said.
IBM announced earlier this month that it was acquiring StoredIQ, but exactly what the company does isn’t quite obvious. Part big data, part e-discovery, it’s sort of neither fish nor fowl.
As an example, analyst firm IDC included the Austin, Texas-based StoredIQ in its IDC MarketScape: Worldwide Standalone Early Case Assessment Applications 2011 Vendor Analysis, but Gartner hasn’t included it in either of its e-discovery Magic Quadrants — from which a number of larger vendors have plucked other acquisitions. (However, Gartner did name StoredIQ as a “Cool Vendor” in April of this year.”)
Instead, IBM is working on creating a family of “information lifecycle management” applications, which are kinda both — big data, because it covers all an organization’s data, but also e-discovery, because part of the reason for having such applications is for litigation support, both for identifying data needed in legal situations but also to help reduce, in a legally justifiable way, the amount of such data in the first place.
StoredIQ’s advantage is that it manages the data in situ rather than by moving it to a secondary location, which saves the cost of the secondary storage, noted Zacks Equity Research, adding that the company had received $11.4 million in funding in August and had 120 clients — though it warns that IBM faces competition from vendors such as EMC, Oracle, and SAP.
The company has also been working to make its product, which includes software and an appliance, easy enough for even legal professionals to use, rather than requiring IT people to operate. In addition, it has partnered with a wide variety of other vendors over the years, including NetApp, EMC, and NewsGator, and supported a number of formats, including SharePoint and Office 365.
As big data has become more prevalent, companies are interested in saving their data in hopes of being able to analyze it at some point and improve their businesses. But what it calls data hoarding is a problem for two reasons, notes Law Technology News.
First, there’s the cost. Though the price of storage itself has been dropping, it still costs something, plus there’s the cost of managing it, backing it up, and so on — which could amount to $5,000 per terabyte, Law Technology News said.
Second, there’s the legal cost. Should an organization be sued, it not only needs to provide all the pertinent information that the other side asks for, but it has to find it in the first place — and the more data a company has, the more expensive that search is. Also, companies have to balance the value of the data for analysis with what it might cost them should it reveal something in a lawsuit. This cost is on the order of $15,000 per gigabyte, Law Technology News said.
In fact, legal organizations have been advising companies to look for opportunities to delete data, pointing out how much money they can save. However, they have to do this in a regular fashion, because once a lawsuit is filed, a “legal hold” is put on the data and it can’t be deleted, or a company is subject to large fines.
The acquisition becomes part of IBM’s Information Lifecycle Governance suite, headed by Deidre Paknad, vice president of Information Lifecycle Governance. Paknad had been CEO of PSS Systems Inc. in Mountain View, Calif., a pioneer in the e-discovery space, which itself was acquired by IBM in 2010. The group also includes Vivisimo, which IBM acquired earlier this year.
The acquisition was not a surprise; IBM had partnered with StoredIQ for two years. As is typical for IBM, it did not reveal the cost of the acquisition. It is expected to be finalized in the first calendar quarter of 2013.
It shouldn’t be any surprise in this incident, about which nothing makes any sense, but it isn’t clear what the status is of Adam Lanza’s computer hard drive, which was smashed/damaged/destroyed by a hammer/screwdriver/sharp object that left data on it irretrievable/able to be recovered, according to which publication you read and which data forensics expert they consulted.
Here’s a breakdown of the issues involved.
Was the disk drive solid-state or traditional spinning disk? There has been increasing use of solid-state drives in computers, either due to interest in improved performance or in reaction to last year’s Thai flooding, which damaged a number of hard disk manufacturing plants and made spinning disk storage more scarce and expensive.
What’s the difference? While both kinds of drive are susceptible to damage — as anyone who’s lost a drive by dropping it knows — solid-state drives are even more susceptible to damage.
“Many SSD hard drive failures are in fact unrecoverable,” writes The Inquisitor. “If the remapping tables that keep track of data in memory cells get trashed the data is effectively randomized and mixed up with data blocks which were marked as corrupted and unusable even before the SSD failed. Many SSD models also come with internal encryption that will make the lives of data forensics experts difficult.”
If it was a spinning disk, how was it damaged? For the sake of argument, though, let’s assume it’s a traditional spinning disk drive. Then the question becomes, how was it damaged? Neither reporters nor crime investigators are necessarily computer experts, and the descriptions of the damage have been vague — they don’t even specify whether Lanza had a desktop or a laptop.
Some reports indicate that Lanza removed the hard drive from the computer before damaging it, which would make it more likely that the drive itself would actually have sustained damage.
But because the platters in the hard drives that hold data are so sensitive, manufacturers tend to do what they can to protect them. Consequently, depending on how the hard drive was damaged, the platters inside could have been anything from undamaged to shattered.
How could the data be retrieved from the damaged hard drive? There are all sorts of third-party data recovery services, and chances are the FBI — which has plenty of forensics chops itself — is talking to all of them about the best way to retrieve data from whatever remains of the platters, as well as, more than likely, the manufacturer of the drive itself. Even if the platters were shattered, they could conceivably be reassembled and at least partially read.
“The level of detail they can rip out of systems these days seems incomprehensible to most people,” Rob Lee, a forensic specialist who has examined computers seized from terrorists for the U.S. intelligence community, told the Washington Post, which wrote in detail about the various ways data could be recovered. Even data from the crashed space shuttle Columbia was nearly 100% recoverable, the article noted.
Is the data available anywhere else? Even if all the data on the drive itself is irretrievable, it might be available else, ranging from a backup, to a synchronization service such as Dropbox, to obtaining copies of data and other information from sources such as Lanza’s Internet service provider, email services such as Google, or his online gaming records.
“Many e-mail providers, such as Yahoo and Google, store data on their servers for a period of time, meaning that police might be able to subpoena Lanza’s provider for access to whatever data they have,” writes the Christian Science Monitor. “Google also stores information about users’ searches and other online activity indefinitely, although it anonymizes IP addresses after 9 months, making it impossible to tell what a given user was doing online prior to that time.”
While there has been increasing concern from civil liberties organizations about the amount of information that services collect and to which law enforcement organizations have access, in this particular case, it may be our best hope in trying to make some sort of sense of this tragedy.
What it takes is enough motivation and the right equipment — and the F.B.I. has both, writes Popular Mechanics.
Somewhere along a long, nondescript brick wall, there’s a little spot that’s different from the rest. Poking out from the rough surface of the wall is the half-inch extension of a USB flash drive. You connect it to your computer, upload or download files, and you’re on your way, with no one the wiser.
We learned about “dead drops” (at least, those who didn’t know about them already) a few weeks ago with General Petraeus got caught exchanging messages with his mistress by leaving messages in draft form in a shared Gmail account. But there’s another kind that offers a lot more possibilities — and risks.
It all started in October, 2010, when Berlin-based media artist Aram Bartholl came up with the idea as an art project: Install a USB flash drive in a wall, and people could freely upload and download art from it. He started out with five USB dead drops in New York, and posted a website with instructions, including an instructional video.
“Dead Drops is an anonymous, offline, peer to peer file-sharing network in public space,” reads the Dead Drop Manifesto. “Anyone can access a Dead Drop and everyone may install a Dead Drop in their neighborhood/city. A Dead Drop must be public accessible. A Dead Drop inside closed buildings or private places with limited or temporary access is not a Dead Drop. A real Dead Drop mounts as read and writeable mass storage drive without any custom software. Dead Drops don’t need to be synced or connected to each other. Each Dead Drop is singular in its existence. A very beautiful Dead Drop shows only the metal sheath enclosed type-A USB plug and is cemented into walls.You would hardly notice it. Dead Drops don’t need any cables or wireless technology. Your knees on the ground or a dirty jacket on the wall is what it takes share files offline. A Dead Drop is a naked piece of passively powered Universal Serial Bus technology embedded into the city, the only true public space. In an era of growing clouds and fancy new devices without access to local files we need to rethink the freedom and distribution of data.”
The idea exploded, and soon there were USB flash drives poking out of walls (and dogs) all over the world. Srsly, there’s more than 1100 of the things out there, according to the most recent map, ranging from New York to Toronto (where it contains porn and recipes) to New Zealand. (And those are just the public ones.) There’s also apps to tell you where Dead Drops are, as well as a Flickr set and a Twitter feed. (In addition, there’s wireless ones and DVD ones being set up as well.)
Certainly the serendipity of these little data glory holes is high. It’s basically superduper high-tech geocaching. Just think of the data, good and bad, that could be exchanged: Pictures, movies, building plans for terrorists, porn, Anonymous plans, Wikileaks data… They’re even being used to generate fiction. Honestly, I’m surprised it hasn’t shown up in a Will Smith movie yet.
Needless to say, the whole process, like any USB stick, is fraught. What keeps people from downloading something like a virus (which was raised as a concern almost immediately) or child porn onto their laptops? (I cringe every time I see a picture of someone with their laptop plugged into one of these things, and hope that at least it’s a junk laptop devoted to the purpose.)
For that matter, what keeps someone from uploading a virus, and from there spreading it around the world? Recall that the Stuxnut virus was spread through USB flash drives enticingly scattered around. Set up something like this at Burning Man with a virus and you could shut down all of Silicon Valley by mid-September.
On the other hand, in a day and age where governments are shutting down the entire Internet in their countries, the notion of a way for rebels to exchange information in this clandestine way sounds pretty darn cool. What a great way for Mr. Phelps to get information — though of course you’d have to make sure that the government hadn’t set up its own USB dead drop to try to catch you. Or for people trapped in a country to get information outside the country — post a code message to Twitter and wait for someone with a tablet and a USB port to come along.
Or maybe I’ve just seen Red Dawn too many times.