IBM InfoSphere BigInsights Review

It gives us the option of extending our analytics system. Now, a customer can move part of their data from Netezza into a Hadoop cluster.


What is most valuable?

BigSQL – implementation of DB2 Database Partitioning Feature on HDFS cluster – in conjunction with IBM Fluid Query for Netezza.

How has it helped my organization?

It gives us the option of extending our analytics system. Now, a customer can move part of their data from Netezza into a Hadoop cluster. What's more, they can run algorithms directly on HDFS using R.

What needs improvement?

Installation process should be improved for IBM ValueAdd components, especially scripts for R/BigR installation. They could add Hue and Ranger to the set of services.

For how long have I used the solution?

I've used it for three months.

What was my experience with deployment of the solution?

Deployment of BigR is problematic as the services have to be installed on whole clusters. Some elements are not clearly described in documentation, and they are split into a few topics in InfoCenter.

How are customer service and technical support?

Great. We have direct support from IBM Poland pre-sales and lab teams.

Which solution did I use previously and why did I switch?

I have implemented integration processes on HortonWorks and Cloudera. The product was chosen by our customer due to Fluid Query implementation.

How was the initial setup?

Initial setup is rather complex in comparison with Cloudera.

What about the implementation team?

I have implemented the cluster for our customer with IBM support.

What's my experience with pricing, setup cost, and licensing?

IBM provides a PVU model for licensing. What is more, a five-node cluster is added for various products such as IBM Information Server.

**Disclosure: My company has a business relationship with this vendor other than being a customer: We're a business partner.
Add a Comment
Guest
3 Comments

author avatarBusiness Unit technical Lead at a tech services company with 1,001-5,000 employees
Real User

Marcin,

Just started working with 2 people from that Poland group. I agree they are really a giant help! Wish I could have had access earlier. On the BigR we had to install ours as well. It was fairly complex and install process was poorly documented. BigInsights SQL implementation is fairly good. So far the only thing I found was CTAS doesn't work exactly the same as other db's. On the Fluid Query we have been using this as well. Have you used both data mover and query feature ? Which do you like better Sqoop or data mover ? Great post.

author avatarArchitect at Cloudware Polska Sp z o.o.
Consultant

Currently we have used only Query feature. In future we would like to configure also move feature - then I could compare it to Sqoop.

author avatarBusiness Unit technical Lead at a tech services company with 1,001-5,000 employees
Real User

Data mover has some advantage over Sqoop in that it can be invoked from either Netezza or Hadoop side. On the Hadoop side I prefer Sqoop over Data mover in that I can control everything from the command line. Data mover requires a change to XML file every time you want a new table.