Gopi Krishnan's Professional Profile

Badges

User Activity

Almost 4 years ago

Answered a question: What needs improvement with Apache Spark?

There is still enough space of improvement on Apache Spark in term of integration and improving speed. Apache spark community can use Rust, C++ implementation to improve performance.

Almost 4 years ago

Answered a question: What advice do you have for others considering Apache Spark?

I would say for some use case we don't have to go to Apache spark and it can be implemented using ordinary python,go or Java application. For some use cases if leveraging the usage of Apache Spark gives better performance and reduction of time we can go for Apache Spark. I…

Almost 4 years ago

Answered a question: What do you like most about Apache Spark?

I love every core functionality of Apache Spark Initially they have only provided RDD basic interface to process the data across distributed cluster. Then it evolved to dataframe and dataset interface with optimised execution engine and more flexibility for developers to…

Almost 4 years ago

Answered a question: What is your experience regarding pricing and costs for Apache Spark?

Apache spark is available in cloud services like AWS cloud, Azure. We have to use the specific service for our use case. For example we can use AWS Glue which runs spark for ETL process, AWS EMR /Azurre data brick for on demand data processing in the cloud. Basically it…

Almost 4 years ago

Answered a question: What is your primary use case for Apache Spark?

Apache Spark can be used in multiple use case in big data and in data engineering task. We are using Apache spark for ETL, integration with streaming data and performing real time prediction like anomaly, price prediction and data exploration on large volume of data.

Almost 4 years ago

Answered a question: What is data lake storage?

Data lake can hold vast pools of raw data at optimal price. One way to imagine a data lake is to compare it with the natural lake which stores all the water from different sources in its raw form For any enterprise product there will be more than one application, Which…