It differentiates itself from other Hadoop distributions because of the lack of namenode, the base components are implemented in C, and the NFS functionality.
What is most valuable?
MapR is one of three leading Hadoop distributions. The key differentiators I found the most valuable are:
- Lack of namenode (no SPOF)
- Performance – the base components are implemented in C
- NFS functionality – copying to MapR FS is as easy as copying files to a directory
How has it helped my organization?
I haven't yet used it in production environments, but I've used it extensively in two proofs-of-concept.
What needs improvement?
It would be nice to have new developments in the Apache space (Spark, Storm, etc.).
What was my experience with deployment of the solution?
I've had no issues with deployment so far.
What do I think about the stability of the solution?
What do I think about the scalability of the solution?
There have been no scalability issues so far.
How are customer service and technical support?
I haven't yet had to use customer service. Technical Support
I haven't yet had to use technical service.
Which solution did I use previously and why did I switch?
I tested both Cloudera and Hortonworks, and I rate it similarly to Hortonworks and slightly higher than Cloudera.
How was the initial setup?
Initial setup is rather straightforward thanks to detailed documentation covering all the bases. More advanced installations required manual deployment to an extent, but this too is well-documented.
What about the implementation team?
What was our ROI?
I haven't put it into production yet.
What other advice do I have?
I don't have any advice because the installation documentation is superb.
Disclosure: My company has a business relationship with this vendor other than being a customer: We have partnerships with all leading Hadoop vendors.
Jan 03 2016