What Happens when you pair Netezza with DB2 for z/OS?

The American Association of Retired Persons (AARP) recently paired Netezza with their transactional environment, which includes DB2 for z/OS, and achieved remarkable results. Often when you read customer success stories, you are bombarded with metrics. The AARP success story has those metrics:

  • 1400% improvement in data loading time
  • 1700% improvement in report response time
  • 347% ROI in three years

But metrics tell only part of the story. And sometimes the story gets a lot more interesting when you dig a little deeper.

AARP had been using Oracle Database for their data warehouse. But their system simply could not keep up with the demand. As Margherita Bruni from AARP says, “our analysts would run a report, then go for coffee or for lunch, and, maybe if they were lucky, by 5:00 p.m. they would get the response—it was unacceptable—the system was so busy writing the new daily data that it didn’t give any importance to the read operations performed by users.” The stresses on their system were so great that in 2009 alone, their Oracle Database environment had more than 30 system failures. To compound matters, these system performance issues meant that full backups were not possible. Instead AARP would back up only a few critical tables, which is a less than desirable disaster recovery scenario. Clearly, something had to be done.

AARP chose to move their 36 TB data warehouse to Netezza. You can see from the metrics above that they achieved remarkable performance improvements. But what do those performance improvements mean. Well, for the IT staff, they mean that they are relieved of a huge daily burden. Their old system required one full-time database administrator (DBA) and one half-time SAN network support person. These people are now, for the most part, free to work on other projects. And more importantly, they don’t have to deal with the stress of the old environment any more.

But the benefits are not being enjoyed only by the IT staff. They are also being enjoyed by the business analysts, who according to Bruni “could not believe how quickly results were provided—they were so shocked that their work could be accomplished in a matter of hours rather than weeks that, initially, they thought data was cached.” She goes on to say that “one analyst, who is now a director, told us that he used the extra time for other projects, which ultimately helped him become more successful and receive a promotion.” Now that is what I call a great impact statement. The metrics are great, but when someone is freed up to do work that gets them a promotion, that’s a very tangible illustration of the difference that Netezza can make.

Another illustration of the difference is the impact it had on the group that implemented the Netezza system. As Bruni says “after we moved to IBM Netezza, the word spread that we were doing things right and that leveraging us as an internal service was really smart; we’ve gained new mission-critical areas, such as the social-impact area which supports our Drive to End Hunger and Create the Good campaigns.” It certainly looks like you can add IT management to the list of constituents who have had a positive career impact as a result of moving from Oracle Database to IBM Netezza.

For more information about this story, see AARP: Achieving a 347 percent ROI in three years from BI modernization effort.

Demo: Using Hadoop to Extract and Analyze Unstructured Information

Here’s a nice demo. It shows InfoSphere BigInsights, which is IBM’s Hadoop product. BigInsights is essentially Apache Hadoop together with extensions for installation, management, security, integration, and so on. The demo also shows InfoShpere BigSheets. BigSheets is basically an easy-to-use interface for creating and running Map and Reduce jobs. As you can see from the demo, BigSheets makes it quick and easy to apply text analytics extractors and filters to unstructured or semi-structured data. The demo itself shows how you can quickly analyze several aspects of revenue information pulled from earnings press releases. It even includes a nice round-trip to the annotated source data to see “why” certain conditions occurred.

Don’t forget there is no charge for BigInsights Basic Edition. You can freely download it from InfoSphere BigInsights.

Benchmark Results for Informix TimeSeries in Meter Data Management

AMT-SYBEX are a leading provider of platforms for traditional and smart metering. They created a Meterflow Benchmark to help customers choose the best underpinning infrastructure for their platform, and they worked with IBM to run that benchmark with Informix TimeSeries. I previous blogged about Why Informix Rules for Time Series Data Management. Well, the results of this benchmark further illustrate the benefits of Informix TimeSeries. The following quote is from the resulting AMT-SYBEX case study:

We believe that this represents ground breaking levels of performance which is ten times faster than other published benchmarks in this area.

As you can see, Informix is 10x faster than the leading database software they previously worked with. If you read the Executive Summary, you will also see that IBM Informix enjoys almost linear scalability when going from 10 million meters up to 100 million meters, which is a great testament to the efficiency of operation for Informix TimeSeries.

Netezza Video by DJ Steve Porter

If you saw the NBA video mix ad campaign during last year’s NBA Playoffs in the US, you will be familiar with DJ Steve Porter’s work. It is a fusion of video remixing, together with his DJ skills. Even if you were not lucky enough to catch that ad campaign, you may have come across him from his many hugely successful viral videos on YouTube. Well, DJ Steve Porter has used his unique style to create a video for IBM Netezza. I think you’ll agree that it is not your typical corporate video.

DB2 and Oracle Database: An Autonomic Computing Comparison

Wikipedia defines autonomic computing as “the self-managing characteristics of distributed computing resources, adapting to unpredictable changes whilst hiding intrinsic complexity to operators and users”. Both IBM and Oracle have added autonomic computing features to their database software products. On 29 September 2011, IBM will host a Chat with the Labs webcast where the hosts will compare the autonomic computing features of IBM DB2 and Oracle Database in the following areas:

  • Memory Management
  • Storage Management
  • Utility throttling
  • Automatic Configuration
  • Automatic Maintenance

You can sign up for the webcast at: DB2 and Oracle Database: An Autonomic Computing Comparison.

Initial Thoughts on New Oracle Database Appliance

I just watched the Oracle Webcast announcing its new database appliance. Here are my initial reactions.

I expected Oracle to announce a mini-Exadata, as had been widely rumored. However, as far as I can see, this is not a mini-Exadata. I don’t have details yet, but it does not appear to contain the Exadata storage-layer software. This is simply an Oracle Database appliance, or an Oracle RAC appliance. Nothing more. In other words, it is the fusion of Oracle software, Oracle hardware, and some support/services.

Because this announcement is really about making Oracle Database easier to deploy, I’m not sure it has much applicability for organizations with an existing Oracle Database set-up, unless they are planning a hardware migration. But judging from how this was presented and positioned, Oracle are probably focusing this product on channel sales, and making this as partner-friendly as possible.

I like that Oracle have followed IBM’s lead and added pay-as-you-grow licensing/pricing. In this appliance, you activate and license CPU cores as needed. Of course, IBM pureScale Application System already offers this. However, IBM does still have an advantage in this regard with its ability to seamlessly add or remove database processing capacity, with a combination of its transparent scaling and daily-based software licensing. In other words, you can purchase the DB2 pureScale database software licenses only for the days where you need the extra capacity. This is a great strategy for eliminating the over-provisioning of database software licenses just to deal with situations where there are significant short-term spikes in demand (like retailers around the holidays, for instance). For more information, check out the “flexible licensing” section of the following blog post: IBM Previews New Integrated System for Transactional Workloads.

Something that’s not clear yet are the growth options for these Oracle Database Appliances. At least from my initial look, there does not seem to be a seamless upgrade path to Exadata. It appears that if someone wants to grow from this appliance, it will not be an insignificant undertaking. Perhaps someone from Oracle can comment on that.

So, in summary, this announcement appears to simply be a packaging exercise by Oracle, where they have created a relatively straightforward database appliance. Your thoughts/reactions?

Why Informix Rules for Time Series Data Management

Informix has had TimeSeries data management capabilities for more than a decade. However, those capabilities are garnering more attention today than ever before. As our world is becoming more instrumented, there is an increasing need to manage data from sensors. And this data from sensors is often being generated at intervals, creating the need for time series analysis.

As I’ve just said, Informix has long had TimeSeries capabilities. However, it wasn’t until recent customer evaluations became public knowledge that the incredible performance of Informix for time series applications became apparent. And now, as a result, Informix is being touted for Smarter Planet-type solutions, including Smart Grid systems.

I mentioned recent customer evaluations. One of those was at ONCOR, a provider of electricity to millions of people in Texas. ONCOR compared Oracle Database and Informix for its temporal data management and analysis needs. They discovered that, for their usage, Informix is 20x faster than Oracle Database when it comes to loading data from Smart Meters. They discovered that Informix is up to 30x faster than Oracle Database for their time series queries. And ONCOR discovered that, before applying data compression, Informix has storage savings of approximately 70% when compared with Oracle Database.

If you are using Oracle Database for a time series application, you should certainly consider Informix. You may significantly improve performance, while at the same time lowering your server, software, and storage costs. The secret sauce is in the way that Informix stores and accesses this time series data. The Informix approach is unique among relational database systems. It stores information that indicates the data source only one time, and then stores the time-stamped values for that source in an infinitely wide column beside it. This approach results in both the storage savings and the huge performance gains.

If you want to read more about Informix and the management of time series data, check out these recent blog posts: This Smart Meter Stuff is for Real and Using Informix to Capture TimeSeries Data that Overwhelms Commodity Databases.

Gartner: IBM DB2's Maturing Oracle Compatibility Presents Opportunities, with some Limitations

When IBM DB2 first added syntax that is compatible with Oracle Database data types, SQL, PL/SQL, scripting, and more, Gartner wrote their “first take” on the technology in IBM DB2 9.7 Shakes Up the DBMS Market With Oracle Compatibility. More than two years later, Gartner are now following up with a research report on the features. You can read the research report at: IBM DB2′s Maturing Oracle Compatibility Presents Opportunities, with some Limitations. This is not a commissioned report. It is independent research from Gartner.

Gartner: IBM DB2's Maturing Oracle Compatibility Presents Opportunities, with some Limitations

More Organizations Move up to the Mainframe and DB2 for z/OS

Any of you who are familiar with DB2 on the mainframe (officially known as DB2 for z/OS) know how efficient it is. The mainframe is not for every organization. However, for those organizations for whom the mainframe is a good fit, the tremendous levels of efficiency, reliability, availability, and security directly translate into significant cost savings.

Database software on the mainframe may be relatively boring when compared with the data management flavor of the day (whether it is Hadoop or any of the other technologies associated with Big Data). But when it comes to storing mission-critical transactions, nothing beats the ruthless efficiency of the mainframe. And that boring, ruthless efficiency has been winning over organizations.

Earlier this year, eWeek reported how the IBM Mainframe Replaces HP, Oracle Systems for Payment Solutions. In this article, eWeek describe how Payment Solution Providers (PSP) from Canada chose DB2 for z/OS over Oracle Database on HP servers. A couple of items in this article really catch the eye. One is that the operational efficiencies of the mainframe are expected to lower IT costs up to 35 percent for PSP. The other is that PSP’s system can now process up to 5,000 transactions per second.

Another organization who moved in the same direction is BC Card—Korea’s largest credit card company. The Register ran a story about how a Korean bank dumps Unix boxes for mainframes. BC Card is a coalition of 11 South Korean banks that handles credit card transactions for 2.62 million merchants and 40 million credit card holders in the country. They dumped their HP and Oracle Sun servers in favor of an IBM mainframe. In an accompanying IBM press release, it was revealed that IBM scored highest in every benchmark test category from performance to security to flexibility. Another significant factor in moving to the mainframe is the combination of the utility pricing that lets customers activate and deactivate mainframe engines on-demand, together with software pricing that scales up and down with capacity.

Despite continual predictions to its demise, it has been reported that the mainframe has experienced one of its best years ever, with an increase in usage (well, technically MIPS) of 86% from the same time in 2010. Much of this growth is coming from new customers to the mainframe. In fact, since the System z196 mainframe started shipping in the third quarter of 2010, IBM has added 68 new mainframe customers, with more than two-thirds of them consolidating from distributed systems.

It may not be as exciting as the newest technology on the block, but it is difficult to beat the reliability and efficiency of the mainframe. Especially when you are faced with the realities of managing a relatively large environment, and all of the costs associated with doing so. And don’t forget, the mainframe can provide you with a hierarchical store, a relational store, or a native XML store. And when you combine the security advantages and the 24×7 availability, together with cost efficiency, it makes for an interesting proposition.