PeerSpot user
Lead Software Engineer - Theatrical Global at a marketing services firm with 1,001-5,000 employees
Vendor
The biggest performance improvements are for queries that have to analyze a large amount of historical data.

What is most valuable?

Fast query processing for historical data analytics. Write Optimized Store (WOS) continuous data loading without drastically impacting performance of OLAP queries. It's one of the few columnar databases that has the capability to provide near real time data delivery for analytics with minimal delay sourcing data from traditional databases or NoSQL data stores or any unstructured data sources.

How has it helped my organization?

With traditional RDBMS historical data analysis or any complex queries took minutes to complete. With the addition of Vertica to handle big data queries, these reports are now returned in under 15 seconds. The biggest performance improvements obviously are for queries that have to analyze a large amount of historical data.

What needs improvement?

Stability, scalability (3 node Community Edition) and backup/restore all need to be worked on. Without proper work load management and resource pool allocation, any batch/ETL or streaming jobs which refreshes data frequently will impair OLAP query performance.

For how long have I used the solution?

We've been using the three node cluster for about one and a half years.

Buyer's Guide
Vertica
March 2024
Learn what your peers think about Vertica. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
768,924 professionals have used our research since 2012.

What do I think about the stability of the solution?

We had several incidents where SQL queries with UDF predicates would shutdown the cluster or sometimes a single node. We worked with HP support to get these things fixed with subsequent versions of Vertica.

What do I think about the scalability of the solution?

With the Community Edition we are restricted to three nodes. We have a lot of enterprise clients who stress our cluster to its limits. The only advice I would give to new adopters is that if you want superior performance and reliability you are better off going all-in with the enterprise edition and a large number of nodes; assuming you have a lot of clients who run queries concurrently.

How was the initial setup?

Setup and administration are very easy. Vertica was designed to be operational with minimal Database Administrator effort.

Which other solutions did I evaluate?

We evaluated various other solutions but we chose Vertica because its SQL implementation is very similar to PostgreSQL, and therefore it saved us lot of development time re-writing SQL queries. Vertica seems to be one of the few columnar database which can handle both ETL/Batch jobs and OLAP queries simultaneously. We stream data into Vertica from RDBMS frequently than what is typically recommended for Columnar databases.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Co-Founder at a tech services company
Consultant
Its speed differentiates it from other columnars, and works on commodity hardware
Pros and Cons
  • "The feature of the product that is most important is the speed. I needed a columnar database, and its speed is what it's built to do, and so that's what really does differentiate Vertica from its competitors."
  • "I don't need any special hardware. I can use commodity hardware, which is nice to have in a commercial solution."

    What is our primary use case?

    When I have a business need for a few pieces of information, and I need to process it quickly, that's when I use Vertica.

    How has it helped my organization?

    We got something like a six-times improvement using Vertica.

    What is most valuable?

    The feature of the product that is most important is the speed. I needed a columnar database, and its speed is what it's built to do, and so that's what really does differentiate Vertica from its competitors.

     I think what also draws me to it is that I don't need any special hardware. So I can use commodity hardware, which is nice to have in a commercial solution.

    What needs improvement?

    I think it's starting to get a little expensive. Open source products are starting to get more robust, so I think that's something that they need to start looking at in terms of licensing.

    For how long have I used the solution?

    More than five years.

    What do I think about the stability of the solution?

    Absolutely stable. It's supported. The stability is one thing, the support is the other thing.

    What do I think about the scalability of the solution?

    No scalability issues. Like I said, in its competitive set it is just faster, better, depending on how you use it, because it is columnar.

    How are customer service and technical support?

    We don't need them that much, but when we do need them, we use the virtual tech support, and that's fine. It works, and it's responsive. Within 24 hours, we get resolution.

    We didn't pay for a higher tier of service, but we generally just have questions for support.

    Which solution did I use previously and why did I switch?

    We've used Greenplum, Teradata, and then Vertica. We used the big data open source solutions as well that are getting better. So those are the four that I can think of off the top of my head. Greenplum and Teradata are just getting too expensive. 

    Particularly compared against its open source set, I think that's really the one key piece where Vertica might have a little bit more ladder room. It was always the leader in terms of pricing against Greenplum and Teradata, so that's why Vertica turned up again for us, but now that the open source solutions are trying to compete a little bit better in terms of stability, that's where we sometimes consider change.

    Which other solutions did I evaluate?

    I evaluated Teradata, and another, but I didn't like either of them, not for what we needed.

    What other advice do I have?

    The pros are, if you have columnar processing, then this is in your top three solutions. I think the con is the software pricing, and licensing needs to start getting more competitive with the open source solutions, or they need to market their stability a lot more.

    Test out the solution. Most people who test it buy it. So that's the biggest draw that it has, you can test in a day.

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Buyer's Guide
    Vertica
    March 2024
    Learn what your peers think about Vertica. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
    768,924 professionals have used our research since 2012.
    it_user418314 - PeerSpot reviewer
    Associate at a tech services company with 501-1,000 employees
    Consultant
    I like the clustering aspect with the share-nothing mentality. I also value the ease of maintenance.

    What is most valuable?

    The biggest, most valuable feature for us is the clustering aspect with a share-nothing mentality. Most clusters usually require their own shared storage, shared subnet, etc. and this becomes a pain and a nightmare to maintain.

    The second most valuable feature is that it's very easy to maintain. It's a breeze once you know how to handle it with your scenario in mind.

    How has it helped my organization?

    Loading raw data and leveraging column store to quickly aggregate the values as well as run a general analysis were the biggest improvements we found. Before, we had to scrub the data or reformat, load it, possibly scrub it some more, and then run the first set of analysis, and so on.

    With Vertica, we were able to combine some of these steps, such as loading gzip data directly into the table and leveraging R in Vertica to run all of the analysis.

    What needs improvement?

    Developer Tools - Vertica really needs some kind of IDE plugin for a system such as Eclipse or IntelliJ. Developing external functions in Vertica can kind of be like shooting in the dark sometimes. Also, an improved monitor or monitoring with alerting built-in that actually works would be a welcome addition.

    They truly need a Python or some script that can handle all of the low-level system changes for you and find out how the customer has heavily modified their nodes before the install. Some automation here would help a lot.

    The product overall is a great product, however management tools as well as monitoring tools are lacking. The product does, however, offer a lot of information in the form of system views and tables, but most of the data is hard to translate with out the help of their support team.

    For how long have I used the solution?

    I have used HP Vertica in multiple companies over the last four years. We currently have it running on a three-node Centos cluster and a six-node Centos cluster.

    What was my experience with deployment of the solution?

    There have been no issues with the deployment.

    What do I think about the stability of the solution?

    There have been no issues with the stability.

    What do I think about the scalability of the solution?

    We have had no issues scaling it for our needs.

    How are customer service and technical support?

    Like everything else HP has support for, the support is very poor. You normally have to threaten to leave, not buy support renewals, or call your sales rep to talk
    to anyone who knows anything about the product. The community normally knows more than support and most of my questions or issues were resolved by searching the old community boards while I wait for overseas support to ask me to send them the logs again for the 50th time.

    Which solution did I use previously and why did I switch?

    I have previously tried SQL PDW, Mongo, Cassandra for alternatives. Even though all of those products are in different landscapes, the Vertica column store ended up being the best thing that was needed.

    How was the initial setup?

    It is straightforward if you read the documents and have mid to senior-level knowledge of Linux. Non-Linux admins will find the setup complex and cumbersome since most are Windows admin and they want point-and-click.

    What about the implementation team?

    We implemented through our in-house team. You need to read the docs, then read them again, and then make yourself a cheat sheet. Once you have done the setup for a two-node cluster, do some Research and Development before taking the time to do a large production cluster or buy the license.

    What was our ROI?

    ROI is great compared to the previous solution, SQL Server.

    What's my experience with pricing, setup cost, and licensing?

    TCO is much lower given the Linux OS and the fact that Vertica is licensed by data size and not node count. The best advice for licensing is to make sure you have a proper data retention policy in place and well-documented as well as some growth expectations before buying. Following this, it will make sure you don't over or under buy.

    What other advice do I have?

    If you are not Linux savvy, find a person that is. Make a cheat sheet with the commands and/or steps for your environment. If you are in the cloud, make sure to understand the networking aspect is completely different in AWS from it will be in your local data center. Failure to plan is planning to fail with Vertica implementation, and try not to mess up the spread as it's a pain to fix. If you read the documents, you will see what I am talking about.

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    it_user450444 - PeerSpot reviewer
    BI Architect / Software Engineer at a tech services company with 51-200 employees
    Consultant
    We thought the Management Console was a nice feature, but it turns out it gives us insight on what is happening behind the scenes.

    What is most valuable?

    Speed of query response time for complicated queries on tables with billions of rows including joins on varchar columns. There is no limitation on which columns can be queried or joined on and we see query times in the milliseconds for a lot of queries that just won't return at all from other products.

    Ease of administration. The Management Console we thought was a nice to have turns out to give us insight on what is happening behind the scenes so easily it has sped up query tuning, insight as to what jobs are running, and resource use on the boxes the product sits on.

    Style of deployment. We were able to build out a server farm exactly as we are accustomed to. We did not have to buy fancy hardware. Our first cluster was deployed on servers we had sitting around from other migrations and replaced products. As we grow also the growth is native to how we do business.

    How has it helped my organization?

    We can have insight into data we never had before. We can provide that insight to internal users so we do not have to generate reports for them all the time. With response times like these there is no concern of having them wait for results to return and so they do not think things are broken.

    What needs improvement?

    Getting the Management Console up and running as expected was a bit of a challenge.

    For how long have I used the solution?

    We've been using it for one and a half years.

    What do I think about the stability of the solution?

    We have amazing stability. We even had to migrate the databases to other boxes and found it moved the data without much intervention from us and no down time. It worked exactly as a cluster should. We joke here all the time that we would love to say we like Vertica support but since we never need them, we actually do not know!

    What do I think about the scalability of the solution?

    Scalability is one of the huge strengths of this product, and scalable in a way, as I said before, that is native to how we do business.

    How are customer service and technical support?

    We've never had to contact them.

    Which solution did I use previously and why did I switch?

    We switched off of Infobright because it was not performant at all at the scale we needed. The number of limitations on Infobright are too many to list in a small review like this.

    How was the initial setup?

    Initial setup of the database was straightforward.

    What about the implementation team?

    We did need support though for the initial installation. They were incredibly responsive and helpful and deployment was completed in a very reasonable amount of time despite issues initially getting the Management Console up and running.

    What's my experience with pricing, setup cost, and licensing?

    Pricing is more than fair. This is very reasonably priced and since it is a perpetual license you are not stuck paying it again and again.

    Which other solutions did I evaluate?

    We evaluated Netezza and Teradata alongside Vertica.

    What other advice do I have?

    Do you want to stand up a data warehouse in a reasonable amount of time using the in-house skills accustomed to dealing with an RDBMS? If that is the case, nothing beats Vertica, hands down.

    Disclosure: PeerSpot contacted the reviewer to collect the review and to validate authenticity. The reviewer was referred by the vendor, but the review is not subject to editing or approval by the vendor.
    PeerSpot user
    Jelson-Ghigonetto - PeerSpot reviewer
    Senior Solution Architect at Grupo Cimcorp
    Real User
    Top 5
    Powerful tool, excellent revision options, with easy depolyment
    Pros and Cons
    • "I enjoy the cybersecurity and backup features."
    • "The biggest problem is the cost of cloud deployment."

    What is our primary use case?

    The primary use case is machine learning and currently, I am working on IOT projects.

    How has it helped my organization?

    With Vertica, I am able to make changes using other Vertica features and I do not have to start the project over. The Vertica tool is very powerful but you cannot purchase the product based on individual features. 

    What is most valuable?

    I am highly trained on Vertica and I am resistant to using other products because I do not have experience with those products. Some of the most valuable features are cybersecurity and backup.

    What needs improvement?

    The biggest problem is the cost of cloud deployment.

    For how long have I used the solution?

    I have worked with Vertica for the past five years.

    How was the initial setup?

    The initial setup is straightforward and easy deployment.

    What other advice do I have?

    I would rate Vertica an eight out of ten.

    Which deployment model are you using for this solution?

    Hybrid Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Other
    Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
    PeerSpot user
    Director - Big Data, IoT and Analytics at a tech services company with 11-50 employees
    Reseller
    Scalable and high-performing with near real-time analytics
    Pros and Cons
    • "Vertica has a few features that I like. From an architecture standpoint, they have separated compute and storage. So you have low-cost object storage for primary storage and the ability to have several sub-clusters working off the same ObjectStore. So it provides workload isolation."
    • "Vertica offers a platform-as-a-service version, but their software-as-a-service solution is only available on AWS. They need to get a SaaS version on Azure and GCP as fast as possible."

    What is our primary use case?

    I use Vertica for traditional data warehouse reporting and some advanced analytics as well as real-time data processing or real-time analytics. I sell Vertica, so my clients' use cases vary. There are several types of users, and each company is going to have different requirements. And the number of users is not necessarily a great measure of usage if you're doing clickstream-type data on a website with massive numbers of queries coming in very quickly. But in other cases, you have people hammering it for reporting and various kinds of business use cases.

    What is most valuable?

    Vertica has a few features that I like. From an architecture standpoint, they have separated compute and storage. So you have low-cost object storage for primary storage and the ability to have several sub-clusters working off the same ObjectStore. So it provides workload isolation. Vertica handles analytics and large queries extremely well — very high performance —and on top of all of these features, Vertica has a very low total cost ownership.

    What needs improvement?

    Vertica offers a platform-as-a-service version, but their software-as-a-service solution is only available on AWS. They need to get a SaaS version on Azure and GCP as fast as possible. I know that's in their roadmap, but I can't wait.

    For how long have I used the solution?

    I've been using Vertica for almost three years.

    What do I think about the stability of the solution?

    Vertica is rock-solid.

    What do I think about the scalability of the solution?

    Vertica is massively scalable. My customers' workloads vary. While the number of users is not necessarily the ideal performance measure, the user bases range from a few dozen in smaller setups to thousands in the largest setups. And some of my customers have several hundred users on the platform concurrently.

    How are customer service and support?

    I deal directly with Vertica's engineers, so I don't have much experience with their frontline tech support. However, the more senior tech support on the backline is just world-class. These guys are amazing. And they are exceedingly committed to their customers. I've seen them do crazy stuff to support a customer.

    How was the initial setup?

    Setting up Vertica is pretty straightforward. There's nothing super fancy about deploying Vertica. Depending on what you're doing, deployment can take anywhere from 10 to 15 minutes to several hours. It's quick if you want to use the automatic tools, which work very well. And if you want to do your own thing or do it on-prem, then it's going to take you a little bit longer. And you don't need large teams to administer it afterward.

    What other advice do I have?

    I rate Vertica 10 out of 10. Anyone thinking about using Vertica should try to understand more than its basic functionality. They should look at its more powerful features like advanced, predictive, and real-time analytics. And just to add, that in the world of data warehousing, it's more accurate to say "near real-time" rather than "real-time" analytics. 

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
    PeerSpot user
    PeerSpot user
    Senior business Intelligence consultant at Asociación SevillaUP
    Consultant
    ​Data Warehouse response times have decreased​. It doesn't support stored procedures in the way we are used to thinking of them.

    What is most valuable?

    Speed in query in general and specifically in aggregate functions on multi-million rows tables.

    How has it helped my organization?

    Data Warehouse response times have decreased of one order of magnitude with respect to the previous solution (SQL Server + Oracle).

    What needs improvement?

    Sadly, it does not support stored procedures in the way we are used to thinking of them. There is the possibility to code plug-in in C++, but that's out of our reach. Correlated sub-queries are another point where we'd love to see enhancements, plus the overall choice of functions available. ETL with SSIS was not as easy as one we had expected (must remember to COMMIT and we had some issues with datetime + timezone, but that's was probably our fault).

    OleDB and .NET providers need some touches; and another great improvement would be support for Entity Framework, which so far I haven't seen.

    There is no serious graphical IDE for HPE Vertica, that's frustrating. One free option available is DbVisualizer for Vertica, but it's a bit basic.

    For how long have I used the solution?

    One year.

    What do I think about the stability of the solution?

    We have a one node cluster on Red Hat and last week the DB went down. The setting to restart the database is not very intuitive and by default the DB does not restart alone.

    After a reboot, which may be good in some environments, but leaves you with an insecurity feeling.

    What do I think about the scalability of the solution?

    Our DB isin in the tens of Gigs, we did not need to scale yet.

    How are customer service and technical support?

    N/A, not used.

    Which solution did I use previously and why did I switch?

    We had SQL Server, switched for money reasons and space. But we're not sure yet, SQL Server is way more stable and predictable.

    How was the initial setup?

    No, the documentation is scarce on non standard setups. We had to create a virtual machine locally, set it up and then upload it to AWS.

    What's my experience with pricing, setup cost, and licensing?

    We use the free community license, plenty of space for our environment. If I had unlimited budget I'd buy a preinstalled instance on EC2, much faster, but costly.

    Which other solutions did I evaluate?

    Netezza, but I didn't like it. For no particular reason, but the feeling was not right. Redshift - I was not impressed by the performance. Google Big Query - we tried it.

    What other advice do I have?

    Do COMMIT, and enable/enforce constraints because by default they ARE NOT!!!!

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    PeerSpot user
    Architect at a tech services company with 51-200 employees
    Consultant
    Vertica allows for thousands of users to run an analysis at the same time. Great aggressive compression.

    At the tech company I work , we were looking for new ways to allow end users (a couple of thousand external users)  to crunch through their detailed data in real time as well as enabling internal users and data analysts to gain the information they needed to run and optimize their business processes. 

    Unfortunately our current system had  become slower and slower over time due to the tremendous increase in data to be managed so a new approach had to be taken to accomplish this goal.  Our existing data warehouse/data management infrastructure just could not handle big data.

    We evaluated a variety of different solutions such as Amazon Redshift, Infobright and Microsoft. Vertica won out above all these other solutions. Our dataset is several hundred million rows and our avg. response time goal was less than 5 secs. We are building our environment for the future so another requirement was to be able to scale horizontally. 

    Redshift came close in response time but failed in concurrency, meaning multiple users running an analysis at the same time. Infobright came close in response time and concurrency but didn’t provide sufficient scalability. Vertica checked all boxes at a very competitive price-point.

    We found that the extreme speed, performance and flexibility is superior to all the other solutions out there. The massive scalability on industry-standard hardware, standard SQL interface and database designer and administration tools are excellent features of Vertica. I also really value the simplicity, concurrency for hundreds or thousands of users, and aggressive compression.

    This new environment allowed us to implement applications such as clickstream and predictive analysis which have added tremendous value for us. Currently there is about 500 GB – 1 TB of data that I am managing and I have found that Vertica is able to be integrated very well with a variety of Business Intelligence (BI), visualization, and ETL tools in their environment. I use Hadoop, Tableau and Birst and using all these solutions with Vertica has been overall quite smooth.

    Our query performance has increased by 500 – 1,000% through improvements in response time and I am now able to compress our data by more than 50%. The simultaneous loading and querying and aggressive compression has helped us become more efficient and productive. Furthermore the high availability without hardware redundancy, optimizer and execution engine, and high availability for analytics systems has saved us both time and money. 

    Disclosure: PeerSpot has made contact with the reviewer to validate that the person is a real user. The information in the posting is based upon a vendor-supplied case study, but the reviewer has confirmed the content's accuracy.
    PeerSpot user

    It seems you were mainly focused on how Vertica is good and did not run a benchmark, otherwise it could be nice if you could publish loading and query performance between all above DB's. 500GB - 1TB is not a lot of data .

    See all 4 comments
    Buyer's Guide
    Download our free Vertica Report and get advice and tips from experienced pros sharing their opinions.
    Updated: March 2024
    Buyer's Guide
    Download our free Vertica Report and get advice and tips from experienced pros sharing their opinions.