CommVault CEO Bob Hammer says his customers can’t get enough of data deduplication, and the vendor will give them a lot more of it when its next version of Simpana launches later this year.
During CommVault’s earnings call Tuesday, Hammer said deduplication was the major driver in the company’s 31% revenue growth last quarter. With Simpana 9, he said, CommVault will increase the scale and functionality of its deduplication while integrating source and target dedupe capability. Hammer says CommVault’s dedupe will go beyond anything on the market, “and there will be no close second.”
I spoke with Hammer after the call, and he clarified a bit.
“This will be our third-generation of deduplication, and we will dramatically increase the scale with the addition of source-side dedupe and the ability to deduplicate secondary copies and dedue directly to the cloud,” Hammer said. “Those are the major areas we’ll expand.”
Hammer says the expanded dedupe in Simpana 9 will take it closer to primary data.
“With source-side deduplication, you’re getting close to that primary layer,” he said. “We’re combining that deduplication with the ability to more intelligently manage snap copies across hardware silos. It’s not primary dedupe, but it’s close to the primary layer. We’re working with a number of hardware vendors that will be part of our release in the fall as well.”
Hammer also says Simpana 9 will deduplicate “virtualization environments across the board – at the source and in some cases at the target, but we don’t want to dedupe just at the target. It becomes an integrated seamless part of all tiers of storage, including the cloud and tape.”
Hammer says deduplication was the most popular feature of its Advanced Data Information Management (ADIM) product group, which also includes replication, virtualization, and archiving. ADIM revenue increased 91% year-over-year and 21% from the previous quarter, and made up 43% of CommVault’s $73.4 million revenue last quarter.
“It’s becoming a requirement for data management because it saves a significant amount of money on storage and reduces network traffic,” he said of data dedupe.
When CommVault released Simpana 8 with dedupe in January 2009, the vendor maintained many customers would no longer need hardware deduplication products. Hammer says CommVault now competes with more hardware vendors than software vendors for dedupe, with the main competition coming from EMC’s Data Domain.
“Our win rate remains very high, although clearly EMC is doing well with that product in its installed base,” Hammer said. “It’s still the best deduplication appliance on the market.”
While Hammer was talking up CommVault’s plans for dedupe Tuesday, Data Domain launched its DD Boost software to speed the dedupe process, At EMC World. Data Domain execs also discussed plans to integrate Data Domain’s dedupe with EMC Avamar source-based dedupe to end the target versus source debate.
Hammer also said on the earnings call that CommVault is developing technology that will bring it into a third major market to go with its current place in the backup and ADIM markets.
He wouldn’t elaborate when I spoke to him. “We’ll talk about that later in the fall,” he said.
CommVault may also reconsider its current strategy of developing all of its own technology rather than picking up IP via acquisition. When asked if that was a possibility during the earnings call, Hammer said: “In the past, we’ve dismissed it. When we plan our strategy for the next 12 months we’ll take a broader look at what we’re going to do with the company, and it may come into play. Now it’s not in the plan, but that may change.”
Big day on tap at EMC World — we were just given press releases for several announcements going out today: Data Domain software called DD Boost, integrated today with NetBackup and Backup Exec and in the second half of this year with EMC Networker; Unisphere unified management between Clariion and Celerra; and a new Ionix storage configuration tool.
DD Boost is similar to Symantec’s OpenStorage API (OST), in that it integrates the target data deduplication device with backup software, but in this case it also offloads some of the processing to the backup server as well. Per the press release: “The DD Boost software library is distributed to the backup server and identifies data segments inline as they arrive. After asking the Data Domain storage system which segments are new, it compresses and forwards only the unique segments.” Surprisingly, no mention of Avamar in this release.
Unisphere puts Celerra and Clariion management into a single pane, and includes cross-array reporting. EMC is also rolling out FAST v2 (block level) for Clariion and Celerra today, as well as new integration with VMware through VMware’s vStorage APIs for Array Integration (VAAI) and a new vCenter plugin.
Finally, Ionix Storage Configuration Advisor 2.0 “automates the validation of storage configuration best-practices in physical and virtual environments, utilizes agentless discovery…and provides detailed reports and trend analysis that improve storage change and configuration management processes.’
Stay tuned for followup reports with further detail.
EMC CEO Joe Tucci officially kicked off EMC World 2010 with a keynote speech this morning that included an obvious reference to Oracle Corp.’s “stack” offerings. “Other companies are building the whole stack,” including servers, database, middleware and storage, Tucci said (it’s the reference to database and middleware that’s the Oracle giveaway).
Tucci said this approach will lead to a siloed data center, as with previous waves of IT that now includes silos of mainframe and distributed systems at many enterprises. “If three or four vendors do it and you’re not using everything from one vendor, you’ll have stacks that run into the same problems we have today,” Tucci said. “We also have a stack,” he said, referring to EMC’s vBlock products with Cisco and VMware, “but rather than a verticalization approach, we’re taking a virtualization approach.”
Tucci also referred to EMC’s most recent Digital Universe report on data growth, and outlined EMC’s general vision for managing that growth using cloud computing and data center federation. While much of the data growth will be introduced by consumers using mobile phones that send multimedia such as photos and video, or specialized machines like medical imaging devices, some 80% of the data will still be managed by enterprises, Tucci said. “You won’t have to buy [resources] for the peaks of the year or the day When you reach the peaks, you can federate,” Tucci said.
In support of this vision, EMC announced VPlex, a new wide-area caching device that will pool data over geographic distance this morning.
In a Q&A session following the keynote, Tucci was asked about acquisition strategies given EMC’s estimated $6.5 million in free cash per its most recent 10-Q. Despite this free cash flow and the $2 billion acquisition of Data Domain last year, Tucci said he still prefers the “string of pearls” acquisition approach. So far this year, EMC has bought one company, Archer Technologies, LLC, a provider of governance, risk, and compliance software.
After Tucci mentioned EMC needs to compete better in the low end of the midrange on EMC’s last earnings call, its Iomega division came out with the ix12-300r, which blurred the lines between the top end of the Iomega line and the low end of EMC’s Clariion and Celerra lines. Would the new low-midrange products Tucci alluded to on that earnings call come from the Iomega or Clariion/Celerra side? “We will have a line with functionality from Clariion and Celerra that will be driven significantly downmarket,” Tucci said. “Iomega will remain below that.”
This morning’s big announcement at EMC World is called VPlex, which EMC says will allow for federation of data over geographic distance.
VPlex was first publicly discussed at last year’s VMWorld conference. At the time, EMC officials referred to it as “active-active” storage to support distance VMotion. The key difference between this and metro clusters is cache coherency, enabled by EMC’s acquisition of technology from YottaYotta three years ago. While the stretched array cluster remains locally “array aware” — integrating with EMC FAST, for example — it can propagate data as a distributed pool quickly enough to support running applications being VMotioned over distance.
The VPlex device is an appliance which begins at 1U and can scale up to 4U, with 32 GB cache, two quadcore processors per appliance, and can front any of EMC’s arrays. The goal, according to Pat Gelsinger, President and Chief Operating Officer, EMC Information Infrastructure Products, is to be able to front third-party arrays as well, although Brian Gallagher, President, Symmetrix and Virtualization Product Group, said those third party arrays are not fully supported yet.”
Two separately licensed versions of VPlex are available today — VPlex Local, which covers local data center data migrations, which starts at $77,000 as an up-front fee or $26,000 for subscription-based pricing. VPlex Metro is also becoming available today and will support data over over distances up to 100 km (5 ms latency) using synchronous replication.
In early 2011, officials said, EMC will release VPlex Geo, which will support “thousands of virtual machines over thousands of miles” and asynchronous replication. Finally, VPlex Global, also due out next year, will support multi-site pooling using asynchronous or synchronous replication.
Stay tuned for more on this announcement and other news from the show.
With EMC World fast upon us, announcements have begun to take on an EMC theme, including one from Precise Software Inc. that its transaction performance management software is integrated with EMC’s Fully Automated Storage Tiering (FAST) to offer transaction-by-transaction monitoring and storage tier migration.
The software has been generally available since the end of 2009 as part of the EMC Select program. Customers interested in linking critical database and other application transactions with the performance boost available from SSDs can have Precise’s software create a list of “suggestions” of what volumes and transactions could best benefit from Flash storage. An integration between Precise and Symmetrix Management Console can then ‘hand off’ that list of suggestions to FAST, which will perform the migration to higher tiers of storage accordingly. In the ‘handoff’ scenario, the storage manager would manually approve the data movement suggested by Precise.
EMC offers some application performance management through its Ionix IT Operations Intelligence products, but that monitoring is focused on the network rather than transactions,” Precise’s EVP of Products and Marketing Zohar Gilad said.
Of course, EMC FAST is far from the only automated storage tiering software currently available. Gilad said integration with other vendors’ storage tiering software is on the roadmap, but declined to disclose who else Precise might be working with.
As certification announcements go, this one is more interesting, I think, than most others, if only because it harkens back to one of the most interesting product announcements/demonstrations I saw last year.
At last year’s VMWorld in San Francisco, Cisco and VMware demonstrated distance VMotion, a technology that will be key to VMware’s vision of data center federation and fluidity between public and private clouds. However, distance VMotion as of that conference had several limitations, the most significant of which from a storage perspective is the need to migrate potentially large volumes of data over distance very quickly in order to support VMotion between data centers.
VMware said last year it will support customers if they deploy distance VMotion using the Cisco network, but its support statement included extensive fine print, including a minimum network bandwidth of 622 Mbps, or an OC12 connection.
Partners were scrambling at that time to step in to solve the data migration problem (including EMC, which was developing “active-active” storage to support distance VMotion), and some of the exhibitors on the show floor, including NetEx and F5 Networks, claimed to be able to solve the problem today. At the time, however, no WAN optimization products were certified for distance VMotion with VMware.
Today, NetEx announced certification of its HyperIP software as VMware Ready, which according to a press release means “HyperIP integrates consistently with VMware technology and is ready for deployment in customer environments.” The press release doesn’t mention distance VMotion specifically, but a NetEx spokesperson said a large oil and gas company has deployed the software for distance VMotion. That customer is not open to taking questions from press, the spokesperson said.
According to IDC’s 2010 Digital Universe report, digital data grew 62% last year as 800,000 PB were added. IDC says 1.2 million PB (1.2 zettabytes) will be added this year, and that will increase to 35 ZB in 10 years.
While those numbers may look staggering on a page, they probably don’t shock anybody charged with managing data storage. The real shocking – and frightening – number is that IDC says the amount of IT staff to manage all this data will only grow by a factor of 1.4 by 2020. If IDC is correct, than the dreaded “do more with less” mantra will become a long-term way of life.
So how will this all change the way we manage data? Chuck Hollis, global marketing CTO of EMC – which sponsored the IDC study – says the data growth will push a lot more of it to the cloud this year. Hollis says the IT staffs at large enterprises that he talks to are ready to set up private clouds to manage data.
“For tech guys, this is the year of putting your cloud strategy together,” Hollis says. “We’re way beyond the ‘What is the cloud?’ discussion, and it’s a very mature discussion with the IT guys I talk to.
“The larger enterprises say, ‘We’re big, we can do this ourselves. We can build a private cloud behind the firewall and get comfortable with it.’ They’re saying, ‘We pay the same price for this stuff – the processors, server, storage – there’s no reason I can’t do what Amazon does.’”
Hollis says as long as organizations feel they can control their data in the cloud, they’re willing to move it there.
“The cloud works when enterprise guys can be in control,” he said. “Ask them to give up control, and it’s not that attractive a proposition for them. You can’t outsource responsibility and accountability. In financial services, a trillion dollars a day floats around the global economy over the cloud. Most days we’re OK with that. Clouds, schmouds, it doesn’t matter as long as enterprise guys feel they’re in control.”
Other emerging methods of managing growth aren’t quite as mature, Hollis says. That includes data deduplication for primary data. While EMC is now the leader in backup dedupe, Hollis says the success of primary deduplication “has a lot to do with processors being fast enough to do it without impacting performance. If you have a SAP application with 10,000 demanding users, maybe it [deduplication]’s a false savings. The concern is, at what cost? The technology gets better year over year, but some are of the opinion this is just a temporary fix, you’re just buying yourself some time. A lot of information is not compressible, like JPEGs. You can’t compress something that’s already compressed.”
Flash solid state storage is another area where EMC has been out front, but it’s another technology where the greatest benefits are still down the road. “If you take what processors have done in the last 10 years as far as density, price and performance, then start with flash in 2010 and forecast it out in 10 years, it could actually get cheaper than disk,” Hollis said. “That would be an interesting world.”
FalconStor Software officially reported revenue Thursday, confirming what if first said in a preliminary report April 19 – it had a lousy quarter.
FalconStor’s $17.1 million in revenue was down from $21 million a year ago, and it lost $5.5 million compared to a loss of $900,000 in the same quarter last year.
FalconStor’s problem is it sells mainly through OEM partners, and its largest partnerships were disrupted last year. Its biggest OEM partner, EMC, bought Data Domain and now sells more Data Domain data deduplication boxes and fewer of its VTLs that use FalconStor software. So FalconStor revenue from EMC declined $300,000.
Sun is another partner, but Sun was in the process of getting acquired by Oracle for most of 2009 and its been unclear which of its products would survive the acquisition. FalconStor revenue from Sun dropped $1.1 million last quarter. FalconStor also took a hit when Hewlett-Packard acquired FalconStor partner 3Com, although FalconStor executives say they expect a rebound now that 3Com is integrated into HP. Another FalconStor partner, Copan, effectively went out of business last year before SGI acquired its assets and resurrected its archiving product.
FalconStor says it will cut spending and has imposed a hiring freeze until it becomes profitable again. More importantly, it is finding new OEM partners. As VP of business development Bernie Wu put it, “We had an unusually high level of disruption with our OEM partners last year, and we’re forming a new foundation of partnerships.”
FalconStor executives say they expect to launch two new Tier 1 OEM deals late this year. One will be for a cloud services offering. They didn’t such much about the other, but one possibility is a deal with Hitachi Data Systems for FalconStor’s File-interface Deduplication System (FDS) software.
HDS so far has a piecemeal approach to backup data deduplication. It resells IBM Diligent ProtecTier, but doesn’t push a product owned by its rival IBM. HDS salespeople have financial incentive to sell the new Sepaton VTLs built on HDS disk, but there’s no formal reseller deal. HDS OEMs CommVault’s Simpana that includes deduplication and certifies FalconStor’s dedupe, but lacks one main dedupe product.
During the earnings call Thursday, Wu said FalconStor had a “significant pipeline” with HDS for the FalconStor software it resells and “we expect that partnership to deepen.”