We just raised a $30M Series A: Read our story

IBM SPSS Statistics Competitors and Alternatives

Get our free report covering IBM, TIBCO, Weka, and other competitors of IBM SPSS Statistics. Updated: September 2021.
540,984 professionals have used our research since 2012.

Read reviews of IBM SPSS Statistics competitors and alternatives

RameshCh
Sr. BigData Architect at ITC Infotech
MSP
Top 5
Very elastic, easy to scale, and a straightforward setup

Pros and Cons

  • "It's easy to increase performance as required."
  • "Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively."

What is our primary use case?

We work with clients in the insurance space mostly. Insurance companies need to process claims. Their claim systems run under Databricks, where we do multiple transformations of the data. 

What is most valuable?

The elasticity of the solution is excellent.

The storage, etc., can be scaled up quite easily when we need it to.

It's easy to increase performance as required.

The solution runs on Spark very well.

What needs improvement?

Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively.

They're currently coming out with a new feature, which is Date Lake. It will come with a new layer of data compliance.

For how long have I used the solution?

We've been using the solution for two years.

What do I think about the stability of the solution?

I don't see any issues with stability going down to the cluster. It would certainly be fine if it's maintained. It's highly available even if things are dropped. It will still be up and running. I would describe it as very reliable. We don't have issues with crashing. There aren't bugs and glitches that affect the way it works.

What do I think about the scalability of the solution?

The system is extremely scalable. It's one of its greatest features and a big selling point. If a company needs to scale or expand, they can do so very easily.

We require daily usage from the solution even though we don't directly work with Databricks on a day to day basis. Due to the fact that we schedule everything we need and it will trigger work that needs to be done, it's used often. Do you need to log into the database console every day? No. You just need to configure it one time and that's it. Then it will deliver everything needed in the time required.

How are customer service and technical support?

We use Microsoft support, so we are enterprise customers for them. We raise a service request for Databricks, however, we use Microsoft. Overall, we've been satisfied with the support we've been given. They're responsive to our needs.

Which solution did I use previously and why did I switch?

We work with multiple clients and this solution is just one of the examples of products we work with. We use several others as well, depending on the client.

It's all wrappers between the same underlying systems. For example, Spark. It's all open-source. We've worked with them as well as the wrappers around it, whether the company was labeled Databrary, IBM insights, Cloudera, etc. These wrappers are all on the same open-source system.

If we with Azure data, we take over Databricks. Otherwise, we have to create a VM separately. Those things are not needed because Azure is already providing those things for us.

How was the initial setup?

The situation may have been a bit different for me than for many users or organizations. I've been in this industry for more than 15 or 17 years. I have a lot of experience. I also took the time to do some research and preparation for the setup. It was straightforward for me.

The deployment with Microsoft usually can be done in 20 minutes. However, it can take 40 to 45 minutes to complete. An organization only requires one person to upload the data and have complete access to the account.

What about the implementation team?

I deployed the solution myself. I didn't require any assistance, so I didn't enlist any resellers or consultants to help with the process.

What's my experience with pricing, setup cost, and licensing?

The solution is expensive. It's not like a lot of competitors, which are open-source.

What other advice do I have?

There isn't really a version, per se. 

It's a popular service. I'd recommend the solution. The solution is cloud-agnostic right now, so it really can go into any cloud. It's the users who will be leveraging installed environments that can have these services, no matter if they are using Azure or Ubiquiti, or other systems.

I don't think you can find any other tool or any other service that is faster them Databricks. I don't see that right now. It's your best option.

Overall, I'd rate the solution eight out of ten. The reason I'm not giving it full marks is that it's expensive compared to open source alternatives. Also, the configuration is difficult, so sometimes you need to spend a couple of hours to get it right.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Shilpa Prakash
Analytics Lead at Pegasus
Real User
Top 5
Good stability, very good data analysis tool pack and excellent documentation

Pros and Cons

  • "Most of the features, especially on the data analysis tool pack, are really good. The way they do clustering and output is great. You can do fairly elaborate outputs. The results, the ensembles, all of these, are fantastic."
  • "Virtualization could be much better."

What is our primary use case?

The solution can be used in any domain, including banking, insurance, health care, partitions, classifications, etc. It's basically a complete solution, that helps with data client consolidation.

How has it helped my organization?

It's a VSL service provider, so improvement to an organization depends on how the client designs the implementation for the plan. Clients would be better reviewers of how the solution has improved their organizations.

What is most valuable?

Most of the features, especially on the data analysis tool pack, are really good. The way they do clustering and output is great. You can do fairly elaborate outputs. The results, the ensembles, all of these, are fantastic.

What needs improvement?

The solution should be faster.

Virtualization could be much better.

The solution could add some freeware. The text analytics, for example, aren't there in the VSL version. It requires a separate license. It should be included in the main solution's license as it's a basic requirement. It's shouldn't cost extra.

For how long have I used the solution?

I've been working with the solution for four years.

What do I think about the stability of the solution?

The stability of the solution is good. We've never found any issues. It's compatible through any open-source solution. It's also very flexible for us to use in any of the parameters, unlike a few other professional products that we have used. With RedHat, for example, there is not the flexibility of changing the parameters and we are forced to use one or two version features.

What do I think about the scalability of the solution?

I haven't tried to scale because we don't have an extensive user base, so I can't speak to scalability.

How are customer service and technical support?

We haven't been in touch with technical support, but the documentation is quite elaborate, so I don't believe we'll need to reach out to them that much.

Which solution did I use previously and why did I switch?

We previously used IBM products. This solution is more flexible and robust.

How was the initial setup?

The initial setup was complex, but the support for the solution is good, so it was an okay implementation.

Deployment times vary. If there's a device, the SAS administrator can set it up pretty fast. Otherwise, for a first time user, it takes a while. Having administrator services helps a lot.

What's my experience with pricing, setup cost, and licensing?

The solution is quite expensive. The pricing is too high.

What other advice do I have?

I'm a SAS Enterprise Miner certified professional. I'm both a consultant and a user of the solution.

Our company is using the on-premises deployment model, although we mostly use in-house products. I use the desktop version for our clients. We might use the Cloud, depending on what the client is comfortable with. Most of our clients are enterprises.

I'd advise others considering implementing the solution to look at the costs and licensing to see if it is within your organization's budget.

I'd rate the solution eight out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
KH
Trainer at a government with 10,001+ employees
Real User
Free to use, stable, and easy to install

Pros and Cons

  • "It can handle an unlimited amount of data, which is the advantage of using Knime."
  • "It could input more data acquisitions from other sources and it is difficult to combine with Python."

What is our primary use case?

Knime is used for data analytics.

What is most valuable?

It can handle an unlimited amount of data, which is the advantage of using Knime.

It already has algorithms included.

What needs improvement?

I haven't had a lot of time to explore Knime in detail, but when you compare it with Orange, I would like it to be able to find data and collect it from another source. Also, to collect data for Knime from Twitter, Instagram, or Facebook for example, and to add widgets to Knime.

It could input more data acquisitions from other sources and it is difficult to combine with Python. It can be done with special requirements.

For how long have I used the solution?

I have been using Knime for three months.

What do I think about the stability of the solution?

In the three months that I have been using Knime, it has been very stable.

What do I think about the scalability of the solution?

From my understanding, it is scalable. It can handle a large amount of data. It indicates that it can handle unlimited amounts of data.

How was the initial setup?

The initial setup was straightforward. It was very easy.

What's my experience with pricing, setup cost, and licensing?

This is an open-source solution that is free to use.

What other advice do I have?

I would recommend Knime to others who are interested in using it.

Students can use Kmine for their research.

I would rate Knime an eight out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
Get our free report covering IBM, TIBCO, Weka, and other competitors of IBM SPSS Statistics. Updated: September 2021.
540,984 professionals have used our research since 2012.