Cloudera Distribution for Hadoop vs InfluxDB vs Neo4j Graph Database comparison

Cancel
You must select at least 2 products to compare!
Cloudera Logo
772 views|598 comparisons
91% willing to recommend
InfluxData Logo
3,918 views|2,846 comparisons
100% willing to recommend
Neo4j Logo
1,142 views|743 comparisons
100% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cloudera Distribution for Hadoop, InfluxDB, and Neo4j Graph Database based on real PeerSpot user reviews.

Find out what your peers are saying about MongoDB, Couchbase, InfluxData and others in NoSQL Databases.
To learn more, read our detailed NoSQL Databases Report (Updated: April 2024).
767,667 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The file system is a valuable feature.""The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on.""The most valuable feature is Impala, the querying engine, which is very fast.""The main advantage is the storage is less expensive.""The product provides better data processing features than other tools.""The tool can be deployed using different container technologies, which makes it very scalable.""It has the best proxy, security, and support features compared to open-source products.""CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."

More Cloudera Distribution for Hadoop Pros →

"InfluxDB is a database where you can insert data. However, it would be best if you had different components for alerting, data sending, and visualization. You need to install tools to collect data from servers. It must be installed on Windows or Linux servers. During installation, ensure that the configuration file is correct to prevent issues. Once data is collected, it can be sent to InfluxDB. For visualization, you can use open-source tools like Grafana.""The most valuable features of InfluxDB are the documentation and performance, and the good plugins metrics in the ecosystem.""InfluxDB's best feature is that it's a cloud offering. Other good features include its time-series DB, fast time-bulk queries, and window operations.""The user interface is well-designed and easy to use. It provides a clear overview of the data, making it simple to understand the information at hand.""The solution is very powerful.""The most valuable feature of the solution is we can use InfluxDB to integrate with and plug into any other tools.""The most valuable features are aggregating the data and integration with Graphana for monitoring.""In our case, it started with a necessity to fill the gap that we had in monitoring. We had very reactive monitoring without trend analysis and without some advanced features. We were able to implement them by using a time series database. We are able to have all the data from applications, logs, and systems, and we can use a simple query language to correlate all the data and make things happen, especially with monitoring. We could more proactively monitor our systems and our players' trends."

More InfluxDB Pros →

"Creates the ability to visualize outputs.""As a graph database, I am surprised at their performance and response time.""Enables people to understand what the business problem is and how the technology helps."

More Neo4j Graph Database Pros →

Cons
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS.""While the deployed product is generally functional, there are instances where it presents difficulties.""There are multiple bugs when we update.""The initial setup of Cloudera is difficult.""Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions.""There are better solutions out there that have more features than this one.""It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform.""The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions."

More Cloudera Distribution for Hadoop Cons →

"I've tried both on-premises and cloud-based deployments, and each has its limitations.""InfluxDB cannot be used for high-cardinality data. It's also difficult and time-consuming to write queries, and there are some issues with bulk API.""InfluxDB is generally stable, but we've encountered issues with the configuration file in our ticket stack. For instance, a mistake in one of the metrics out of a hundred KPIs can disrupt data collection for all KPIs. This happens because the agent stops working if there's an issue with any configuration part. To address this, it is essential to ensure that all configurations are part of the agent's EXE file when provided. This makes it easier to package the agent for server installation and ensures all KPIs are available from the server. Additionally, the agent cannot encrypt and decrypt passwords for authentication, which can be problematic when monitoring URLs or requiring authentication tokens. This requires additional scripting and can prolong service restart times.""The solution's UI can be more user-friendly.""InfluxDB can improve by including new metrics on other technologies. They had some changes recently to pool data from endpoints but the functionality is not good enough in the industry.""The solution doesn't have much of a user interface.""The error logging capability can be improved because the logs are not very informative.""In terms of features that I would like to see or have, in the community version, some features are not available. I would like to have clustering and authentication in the community version."

More InfluxDB Cons →

"So far, we have not had any issues and are happy with the product in general.""There are concerns about performance and whether the tool can necessarily scale to provide the solution."

More Neo4j Graph Database Cons →

Pricing and Cost Advice
  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • "Cloudera Distribution for Hadoop is expensive, with support costs involved."
  • "I wouldn't recommend CDH to others because of its high cost."
  • "The price is very high. The solution is expensive."
  • "The solution is expensive."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

  • "We are using the open-source version of InfluxDB."
  • "InfluxDB is open-source, but there are additional costs for scaling."
  • "InfluxDB recently increased its price. It is very expensive now."
  • "The tool is an open-source product."
  • More InfluxDB Pricing and Cost Advice →

    Information Not Available
    report
    Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
    767,667 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The product comes with an annual subscription, which is expensive. They are bundling technologies together. You have to… more »
    Top Answer:InfluxDB is a database where you can insert data. However, it would be best if you had different components for… more »
    Top Answer:InfluxDB is generally stable, but we've encountered issues with the configuration file in our ticket stack. For… more »
    Top Answer:InfluxDB is a database where you can insert data. However, it would be best if you had different components for… more »
    Top Answer:As a graph database, I am surprised at their performance and response time.
    Top Answer:I don't have information about the license fee amount. That said, I know that the Neo4j license fee is more expensive… more »
    Top Answer:So far, we have not had any issues and are happy with the product in general.
    Ranking
    5th
    out of 18 in NoSQL Databases
    Views
    772
    Comparisons
    598
    Reviews
    14
    Average Words per Review
    409
    Rating
    8.1
    3rd
    out of 18 in NoSQL Databases
    Views
    3,918
    Comparisons
    2,846
    Reviews
    6
    Average Words per Review
    551
    Rating
    7.3
    7th
    out of 18 in NoSQL Databases
    Views
    1,142
    Comparisons
    743
    Reviews
    1
    Average Words per Review
    872
    Rating
    9.0
    Comparisons
    Learn More
    Overview
    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.

    InfluxDB is open-source software that helps developers and enterprises alike to collect, store, process, and visualize time series data and to build next-generation applications. InfluxDB provides monitoring and insight on IoT, application, system, container, and infrastructure quickly and easily without complexities or compromises in scale, speed, or productivity.

    InfluxDB has become a popular insight system for unified metrics and events enabling the most demanding SLAs. InfluxDB is used in just about every type of industry across a wide range of use cases, including network monitoring, IoT monitoring, industrial IoT, and infrastructure and application monitoring.

    InfluxDB offers its users:

    • Infrastructure and application monitoring: Collect, process, and analyze real-time data from edge devices to help optimize distributed infrastructure. 
    • IoT monitoring and analytics: InfluxDB is designed to store large volumes of time series data and quickly perform real-time analysis on that data. Gain insights from all the sensor data and use the collected data to create and perform automated tasks. 
    • Network monitoring: Manage responsive and high-performing networks with widely distributed resources.

    InfluxDB Benefits

    There are several benefits to using InfluxDB . Some of the biggest advantages the solution offers include:

    • APIs and ready toolset: InfluxDB can be accessed via a set of powerful tools enabling users to get started quickly, with less programming required. This includes a REST API, extensive client libraries, a wide variety of open-source integrations, and Flux - a functional data scripting language for querying, analysis, and events. The InfluxDB API can be used to write data from edge devices to the InfluxDB instance 
    • Time series engine: Get any data - events, logs, traces - from any edge device - systems, sensors, queues, databases, and networks. This data is stored in a powerful and high-performing engine capable of ingesting millions of data points per second.
    • Community: InfluxDB has a large community of cloud and open-source developers ready to assist users. 
    • Ready-made templates: Use InfluxDB Templates, a set of tools with a packager and other ready monitoring solutions. These tools allow users to share their monitoring expertise with coworkers and other community members around the world. The Templates gallery offers available templates for some of the most popular tools and applications.
    • Enhanced UI: InfluxDB’s UI includes an explorer, dashboarding tools, and a script editor. Use it to easily browse the collected metric and event data and apply common transformations. The dashboarding tool comes with a variety of visualization options that help users view insights from the data. The script editor assists users to quickly master Flux with easily accessible examples, auto-completion, and real-time syntax checking.

    Reviews from Real Users

    InfluxDB stands out among its competitors for a number of reasons. Two major ones are its flexible integration options and its data aggregation feature.

    Shalauddin Ahamad S., a software engineer at a tech services company, notes, “The most valuable features are aggregating the data and the integration with Grafana for monitoring.”

    Neo4j is the graph database solution allowing the analysis of complex relationships and patterns in data, leading to better decision-making and improved business processes. The graph database offers easy data integration from multiple sources, providing a more comprehensive view. 

    The most valuable aspect of a graph database is its performance and response time, as it does not use the join function and only has nodes and raw data. Overall, Neo4j, as a global first-ranking solution, has helped organizations become more efficient and effective in data analysis and decision-making processes.

    Sample Customers
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    ebay, AXA, Mozilla, DiDi, LeTV, Siminars, Cognito, ProcessOut, Recommend, CATS, Smarsh, Row 44, Clustree, Bleemeo
    Walmart, Telenor, Wazoku, Adidas, Cerved, GameSys, eBay, Schleich, ICIJ, die Bayerisch, Megree, InfoJobs, LinkedIn
    Top Industries
    REVIEWERS
    Financial Services Firm25%
    Computer Software Company21%
    Insurance Company14%
    Comms Service Provider11%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company16%
    Educational Organization8%
    Manufacturing Company8%
    VISITORS READING REVIEWS
    Computer Software Company12%
    Financial Services Firm12%
    Educational Organization9%
    Manufacturing Company7%
    VISITORS READING REVIEWS
    Financial Services Firm15%
    Computer Software Company15%
    Comms Service Provider11%
    University8%
    Company Size
    REVIEWERS
    Small Business28%
    Midsize Enterprise17%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise9%
    Large Enterprise74%
    REVIEWERS
    Small Business22%
    Midsize Enterprise22%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business21%
    Midsize Enterprise14%
    Large Enterprise65%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise15%
    Large Enterprise65%
    Buyer's Guide
    NoSQL Databases
    April 2024
    Find out what your peers are saying about MongoDB, Couchbase, InfluxData and others in NoSQL Databases. Updated: April 2024.
    767,667 professionals have used our research since 2012.