Please share with the community what you think needs improvement with Dataiku Data Science Studio.
What are its weaknesses? What would you like to see changed in a future version?
When the flows get complex, there are too many data sets on them. Normal users will get confused by the influx of information. This was a problem up until recently, however, they have since resolved it. Now, there's a feature called Zones(introduced in version 8) that allows users to collapse multiple flows into a single flow. It allows everyone to easily carry on with their work. Other than that issue, that has now been resolved, there isn't anything lacking by way of features. The interface for the web app can be a bit difficult. It needs to have better capabilities, at least for developers who like to code. This is due to the fact that everything is enabled in a single window with different tabs. For them to actually develop and do the concurrent testing that needs to be done, it takes a bit of time. That is one improvement that I would like to see - from a web app developer perspective.
I think the interface is very nice, but for somebody who is not as familiar with IT as I am, it may be much more difficult for them. It is nice for me because I'm familiar with this type of software that falls in the realm of the data science platform. I can see how a client who really doesn't know anything about IT or computers might try to use it and find that it would be a little difficult to access some features. That type of user may really need training in order to work with Dataiku. So, in the next release of Dataiku DSS (Data Science Studio), they should make it more friendly for everybody to use, not just IT people. For me, I find that it is a little slow during use. When I use Dataiku to run my script to transfer data, it takes more time than I would expect for the operation to complete.
From an administrative point of view, I would like to be able to communicate with the users who are logged into the system. For example, I would like to be able to send a broadcast message that says "I am shutting down the system." I would like to see more organization and better cohesion within the tool. In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin. I would like to have a better way to manage images and sound. The error messages are not self explanatory and can sometimes be difficult to understand.
I would like to have better exclusion of data capability. The ability to have charts right from the explorer would be an improvement. I would like to see additions to the architecture for specific business use cases.
Hello community members,
There are many Data Science Platforms available. Which platform would you recommend that can handle large amounts of data? Why?
Let the community know what you think. Share your opinions now!