Adventures in Data Center Automation:

June, 2008

Jun 30 2008   8:38PM GMT

Month in Review - June 2008



Posted by: Ryan Shopp
DataCenter

We started getting back on track this month with 4 opinion pieces.

Performance and Availability vs. Analytics - Part 5 of 5

So Let’s Talk a little about Traffic Flow Reporting & Analysis

Mapping HP Software to the Data Center Automation Blueprint - take 2

NetQoS goes even deeper

In the news we saw HP making a splash with new announcements during it HP Universe show, BMC & NetQoS announced some acquisitions, Apptio came out of stealth mode and  a variety of vendors had new updates/releases for current products.

For July, I have some more thoughts to talk about in the Traffic Flow Reporting & Analysis space, also I need to take another pass at the DCAB to incorporate the updates from the Performance and Availability vs. Analytics series.

Jun 23 2008   3:00PM GMT

So let’s talk a little about Traffic Flow Reporting and Analysis



Posted by: Ryan Shopp
DataCenter, Alcatel-Lucent, Compuware, Accellent, Application monitoring, HP Software, InfoVista, NetScout, Solarwinds, Network monitoring, Packet Design, Performance management, NetQoS, Opnet, Xangati

Next up, I plan to dig into this sector a little deeper (as always from a purely data center centric perspective - aka no End-User Monitoring that requires a desktop agent).

The priority for these products is to provide an end-to-end service/application perspective on traffic performance and capacity. The goals; help quickly troubleshoot from an application or end-point perspective OR better understand what/where traffic levels are going across the infrastructure. All this from a network-centric control point (no loading of agents on a server or client - since the network team doesn’t own the responsibility for those).

So on the surface I see two main categories (each has subcategories that I’ll dig into during follow-up posts)

Flow Reporting-centric (these vendors gather Cisco NetFlow, J-flow, sFlow from infrastructure agents and report in various ways)

  • Netscout, Solarwinds, CA eHealth, NetQoS, Mazu Networks, Xangati, InfoVista, Opnet, Lancope, Packet Design, Q1 Labs. Alcatel-Lucent VitaNet, HP Performance Insight - to name a few

Flow Self-Collection & Reporting (these vendors span/tap actual traffic flows and report in various ways)

  • NetQoS, Mazu Networks, InfoVista (through acquisition of Accellent), Lancope, CA Wily, Q1 Labs, Compuware - to name a few

I quickly notice now that many of the vendors actually support both - which I assume is about flexibility as some customers don’t have NetFlow type capabilities enabled or don’t wish to enabling them for a variety of reasons.

So my first set of questions/experiences I’m now reading/researching about are:

1) What are the key benefits to going the self-collection route over the Reporting only route? Unique metrics? Scalability? Limitations around NetFlow (e.g., Performance)

2) When it comes to reporting only using Netflow, etc - what metrics are being used these days.

I remember first integrating and being able to report on RMON2 probes and early Cisco NetFlow data back in 2001 within the Lucent VitalNet product…so where are things 6 years later now that NetFlow is much more pervasive and I’m sure improved.

My assumption on some of these are as follows (vendors & users please leave comments to help educate me for my follow-up posts),

When it comes to reporting, there are historical/capacity centric reports & their are real-time/troubleshooting centric views. My assumption (again, currently an assumption..I haven’t read to much on this topic yet) is most the reporting centric vendors (that don’t also offer their own passive flow monitoring capability) are focused more on those historical/capacity reports (e.g., eHealth, Solarwinds, InfoVista). These reports are how much data is going where and what type of data is it over a day/week/month etc. Once this data is archived, they slide & dice in a variety of ways. But, basically it’s about looking at it for trends over time.

Now, when it comes to real-time, since so much data is coming in so quickly their needs to be extra intelligence/automation helping out - building a “what looks normal” model and then focusing on identifying and then alerting someone when something “odd” is noted. Of course, they need to store/report on much of the same data as the historic/capacity centric products as they build credibility and trust in their users.

So when it comes down to it..much of the same data is being used for 2 unique users…one focused on planning improvements and the other focused on quickly resolving issues. So now that I’ve finished writing this post a better way to probably organize the field of play is not by technology (NetFlow vs. Self-Collect) but by usage. I’ll read some more and do that next time.

Another angle to ponder on this topic will be around the WAN acceleration/optimization vendors…but again, for another day.


Jun 20 2008   1:31AM GMT

links for 2008-06-20



Posted by: Ryan Shopp
DataCenter


Jun 19 2008   1:31AM GMT

links for 2008-06-19



Posted by: Ryan Shopp
DataCenter


Jun 18 2008   1:35AM GMT

links for 2008-06-18



Posted by: Ryan Shopp
DataCenter


Jun 17 2008   10:41PM GMT

Performance and Availability vs. Analytics - Part 5 of 5



Posted by: Ryan Shopp
DataCenter, Analytics, CMDB, DCAB, IT Process Automation, BSM

Finally, the last installment of this 5 part series (which was originally a ? part series). This last segment took a little more time then expected to get to. These days you can find BSM definitions and products all over the place. So the question I’ve been asking myself is BSM different then Analytics (as defined in our Data Center Automation Blueprint).

First up, here are some of the best definitions of Business Service Management:

  • Business Service Management is about enabling IT operations and support staff with empowering information that helps them to understand the impact on the business in business terms
  • Business Service Management is the integration and consolidation of systems management with business management
  • Business Service Management is about understanding the business perspective also known as the “top down perspective”. What value, revenue, cost, churn, ROI, etc. can be associated with the IT services, applications, processes, transactions, etc. being delivered and supported by your IT organization?
  • Business Service Management is an IT operations management software product that links the availability and performance status of IT infrastructure components to business-oriented IT services that enable business processes
  • Business Service Management metric might look at the dollar impact of server downtime as opposed to an ITSM (IT-focused) metric that identifies the percent uptime for the same server
  • First-generation BSM solutions offer:
    • a way of defining and describing business processes;
    • discovery (partly manual, partly automatic) of IT service components;
    • mapped (partly manual, partly automatic) business processes to IT components;
    • adapters to other infrastructure management products;
    • Measure end-to-end performance for business processes;
    • Measure the business impact of downtime;
    • analyz the root causes of incidents resulting in downtime;
    • provided dashboard views so that selected target audiences can combine relevant information.

In part 4, I defined the Data Center Automation Blueprint’s analytics category as “a roll-up aggregation view of metrics that are mapped to the business metrics and goals.”

So from what I’ve heard/read etc, Analytics is a major component/subset of BSM…but BSM isn’t specifically or just analytics.   BUT, if you combine (from the DCAB) Resource Reconciliation, Process Orchestration and Analytics into one bundle things are looking very, very close to BSM as I read it.  With that said, going forward we will continue to watch/discuss BSM and it’s specific applicability to the data center.

To jump back to any of the previous topics in this series follow the below links:

Part one covered data collection
Part two covered applying analytics and business/service mapping to those collection points
Part three covered evolving the Data Center Automation Blueprint from Performance & Availability to Service Assurance.
Part four coverd the term analytics and how it’s applied as a standalone category and within categories of the DCAB.

Next action item would be a couple key updates to the Data Center Automation Blueprint.


Jun 11 2008   1:34AM GMT

links for 2008-06-11



Posted by: Ryan Shopp
DataCenter


Jun 10 2008   11:33PM GMT

links for 2008-06-10



Posted by: Ryan Shopp
DataCenter


Jun 7 2008   11:35PM GMT

links for 2008-06-07



Posted by: Ryan Shopp
DataCenter


Jun 5 2008   11:36PM GMT

links for 2008-06-05



Posted by: Ryan Shopp
DataCenter