IBM Information Analyzer Review
Data rules, column analysis are valuable but the interface is not intuitive.

Valuable Features

Data rules, column analysis

Improvements to My Organization

Sometimes a project knows nothing about a particular set of data. IA is good at data profiling / data discovery. It can give insight into data about format, uniqueness, completeness, etc. The other powerful feature of IA is its ability to check data against business rules. It can give statistics on how many records violate a rule.

Room for Improvement

The interface is not the most friendly. Performance.

There are also these following features - documented in the user guide - but do not work:

1. Global Logical Variables (GLVs)

2. Migrating projects. Neither the internal method (Export/Import) nor the command line interface (CLI) method work 100%. They always error out.

3. When you open a data rule and do no modifications, when you close it, IA asks if you want to save the changes, even if you did not make any. A bit disturbing when you know you did not change anything yet you start to doubt what you think you know.

My wish list for new features:

1. Ability to use functions on data sources. I do not understand how IBM could miss this. Data sources are not visible when coding custom expressions. For example if you have a field called CUSTOMER.ACCOUNT_NUM, you cannot code TRIM(ACCOUNT_NUM). My workaround is to create a variable in the rule definition then bind it in the data rule. Functions can only be applied to variables, not directly to fields. I have a rule where I do things to about 12 fields - concatenate, substring, length, coalesce, etc - and I had to make up 12 lines in the definition that do nothing but refer to these variables. I had to invent a rule so I coded seemingly useless rule conditions like address1 = address1 just so I have a variable for the field I want to code functions for. Huge oversight on the part of IBM.

2. Copy a data rule and modify the copy. Right now only rule definitions can be copied, not data rules. Sometimes I need to create two or more versions of the same rule. IA forces me to generate each of them from scratch. This is annoying when version 2 is only slightly different from version 1. If it took me an hour to code the original, it would take me close to that amount of time to code the new version. If I could copy and modify, the effort would only take maybe 5 minutes.

3. The date of last modification. IA only shows the date of creation which is generally useless. The last modification date is far more importer and needs to be available and visible.

4. A file manager, a la Windows Explorer. I may want to see the list of rules and sort them by date of modification.

5. Enhanced dedup on output. Currently, IA can only exclude duplicates based on the entire record. It should allow deduping on a select set of columns.

6. Feature to select one record from multiple matches in a join. For instance, in Oracle SQL, one can FETCH FIRST ROW ONLY or use ROWNUM or TOP 1.

7. Ability to sort the output.

Use of Solution

9 years

Stability Issues

Yes, the product sometimes crashes or freezes.

Scalability Issues


Customer Service and Technical Support

Customer Service:

Scale of 1 to 10: 8. While IBM is excellent at responding to inquiries, it is slow to implement much-needed software fixes. While that is common in the industry, I would still like to see IBM fix software bugs sooner.

Technical Support:

Same as customer service.



Other Advice

Get the latest version. Compare with competing products. Know that there are not many experts in the product and that they may pay a premium to hire them.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
reviewer233916Real UserTOP 20

I agree the interface is not the best and therefore there is still room to improvement, but performance as a lot to do with the data volumes to process, moving the IADB, out of xmeta with some tuning in DB2 and running the IA jobs with adecuate configuration files would certainly improve performance, other interesting point is to process only required columns rather than full table and/or file and perhaps the use of data samples rather then full data profiling.

Like (0)06 December 16
