What needs improvement with Amazon Redshift?

What is Amazon Redshift? Amazon Redshift is a fully administered, petabyte-scale cloud-based data warehouse service. Users are able to begin with a minimal amount of gigabytes of data and can easily scale up to a petabyte or more as needed. This will enable them to utilize their own data to develop new intuitions on how to improve business processes and client relations. Initially, users start to develop a data warehouse by initiating what is called an Amazon Redshift cluster or a set of...

Download Amazon Redshift Report Read more

Related Q&As

Oct 6, 2023

How can I convert SQL functions in Teradata to Redshift manually?

Aug 31, 2023

What are the challenges faced during migrating from Netezza to AWS Redshift?

Mikalai Surta Head of Big Data Department at IBA Group · Answer 1 · 2024-02-22T08:57:40Z

The product must become a bit more serverless. Users should have to pay only for the resources they consume.

Tamás Srancsik Data Analyst Lead at Vectornator · Answer 2 · 2023-09-07T14:30:00Z

It would be good to see Redshift as a serverless offering. The proposition may be unclear, but at the time, there were certain limitations with the pay-as-you-go offering. However, a serverless offering would be more flexible on-demand pricing, which would be good to see because Redshift is not expensive, but I always have to buy a new server if I need more computing than I have. Setting up a new server is an easy task, but it would be better if I could scale my Redshift cluster up or down as needed; still, there is a need for manual control. For example, my analyst team is working on a job that requires a lot of computing and is only needed for this month, week, or even today. The job should scale up and down automatically, but it is not yet fully developed.

score 0 · Answer 3 · 2023-07-27T20:51:51Z

As our scalability requirements and data growth exceeded expectations, Redshift didn't scale up to meet our business needs. So, at that point, we made a switch to Snowflake, which provided the scalability we needed. So, scalability is one area of improvement. One area where Amazon Redshift could improve is in adopting the compute-separate, data-separate architecture, which Delta, Snowflake are adopting, and a few others in the cloud data warehouse spectrum. Although Redshift introduces Aqua to achieve some level of scalability, I still feel that when it comes to scaling up, whether it's vertical scaling or horizontal scaling, there is a noticeable amount of downtime for end consumers. So, if I need to switch from DC1 to DC2, or from one compute/storage optimization to another, I have to bring down the entire cluster and then bring it back up. That's a pain point.

score 0 · Answer 4 · 2023-05-09T07:52:00Z

When compared to Snowflake, Amazon Redshift does not have the capability to dynamically increase the VM file. However, Amazon Redshift provides a virtual database called 'VW' that allows you to increase the size of the warehouse to run faster on a monthly basis without changing anything. This feature is not available in Redshift. So it's a limitation of Redshift. It's not possible to immediately increase the virtual warehouse size in Amazon Redshift. When compared to Snowflake, we cannot increase the virtual warehouse size in Redshift.

reviewer2176086 Data Scientist at vPhrase · Answer 5 · 2023-05-04T09:21:00Z

Specifically, with Redshift Spectrum, some SQL commands have to run on Redshift instead of Redshift Spectrum, which slows down a few things in the tool. In the solution, user-based access is quite hard. In general, certain permissions are difficult to manage. The solution is expensive compared to Snowflake. I have used Snowflake because they're cheaper than Amazon Redshift. Some of the commands are run on Redshift itself, and some of the commands are completed by Spectrum, which is problematic. However, it is faster when computed by Spectrum.

Nishkarsh Jain Sr BI and Data Engineer at Datacult · Answer 6 · 2023-04-12T11:56:03Z

Redshift's serverless technology needs to improve because not everyone is technically inclined. Organizations want to quickly access and import data into their data warehouse without hassle. Redshift's ETL tool, Glue, is not seamlessly integrated with Redshift. I've encountered many instances where it couldn't fetch the perfect data type from the source, which should be intuitive. Snowflake's ETL tool, on the other hand, is more intuitive and seamless.

Ansari Rehman Cloud Data Architect (AWS-Snowflake-Teradata-Oracle) at Capgemini · Answer 7 · 2023-03-13T06:12:00Z

During our last office project, Redshift couldn't perform well even for a data size of 6 TB. Thus, compared to Teradata and Snowflake, the solution needs to work faster. They should extend the plan by including better optimization and readability as we get while using Teradata. Also, they should provide zero-copy coding and sharing facilities.

score 0 · Answer 8 · 2023-03-09T22:01:00Z

They should provide a structured way to work with interim data than to store it in parquet files locally. Also, Redshift is unwieldy. There should be better integration between Python and Redshift. It could be more accessible without using so many sequels. They should make writing and reading the data frames into and from Redshift easier. The performance could be better. I have used Redshift for extensive queries. For the large tables, it's easier to unload to Redshift, but subquery tables that run complex grids are slower for configuration. I have to use the unloaded command to unload the whole table. Further, I have to read the table into a server with extensive memory in Python and process the data ahead. It's not optimal.

Liana Iuhas CEO at Quark Technologies SRL · Answer 9 · 2023-02-20T13:51:11Z

AWS Snowflake has a very good feature for cloning databases. It makes it easy to clone a data warehouse, which is useful. I would like to see this feature in Redshift.

Syed Zakaulla Project Manager at Softway · Answer 10 · 2023-02-13T20:14:00Z

I would like Amazon Redshift to improve its performance, analytics, scalability, and stability. Other than these points, I am not aware of any other areas to address since Amazon provides a variety of independent services for their customers to choose from, and if one were to express dissatisfaction with Amazon Redshift, Amazon would likely suggest AWS Glue as an alternative. Similarly, if another issue arose, Amazon might recommend Amazon RDS. There are a lot of things they try to upsell to you, each with its own pros and cons and in different packages offering different perks. So, it all depends on your business needs and what you choose for your business. I wouldn't criticize Amazon for this because they have created packages tailored to their customer's needs, which helps to prevent customers from looking elsewhere.

score 0 · Answer 11 · 2022-12-13T14:45:00Z

Pricing sometimes depends on the setup (key, etc.) which makes it hard for somebody new to AWS. Detailed research has to be conducted to end up with a competitive solution in terms of pricing and performance.

HansScholing Consultant at ANWB · Answer 12 · 2022-11-15T15:16:39Z

HS

HansScholing

Consultant at ANWB

Consultant

Top 20

Nov 15, 2022

The customer support could be more responsive.

Jayanta Datta Executive Director at Morgan Stanley · Answer 13 · 2022-07-03T14:37:00Z

Infinite storage is available in Snowflake and is not available in Redshift. Analytical tools for integration would be helpful in the future.

MandarGarge V.P. Digital Transformation at e-Zest Solutions · Answer 14 · 2022-06-07T18:13:19Z

MandarGarge

V.P. Digital Transformation at e-Zest Solutions

Real User

Top 5Leaderboard

Jun 7, 2022

I would like to see improvement in the pricing and the simplicity of using this solution.

Kundan Amin Senior Consultant at Dynamic Elements AS · Answer 15 · 2022-05-19T11:05:00Z

Redshift's GUI could be more user-friendly. It's easier to perform queries and all that stuff in Azure Synapse Analytics.

Atish Asdsd Manager at Protiviti · Answer 16 · 2022-04-25T09:35:15Z

The technical support should be better in terms of their knowledge, and they should be more customer-friendly.

reviewer1752741 Data Analyst at a tech vendor with 51-200 employees · Answer 17 · 2021-12-27T19:44:47Z

I cannot state which features of the solution are in need of improvement, since those which we make use of have not changed. The solution has four maintenance windows so, when it comes to stability, I think it would be better to decrease their number. I rate the solution as an eight out of ten because it is not 100 percent, as there is much service-related maintenance required. It would be nice to see support for the usual LTP features, such as those involving traditional processing.

AmitKulkarni Consultant at kulki data management & consultants · Answer 18 · 2021-10-06T12:35:00Z

Amazon should provide more cloud-native tools that can integrate with Redshift like Microsoft's development tools for Azure.

reviewer997101 Senior Solutions Architect at a retailer with 10,001+ employees · Answer 19 · 2021-08-07T10:41:15Z

Planting is the primary key enforcement that should be improved but there is probably a reason that they don't follow the reference architecture. It means they are creating clones of the data shading. Cost control measures could be improved along with added transparency.

score 0 · Answer 20 · 2021-07-02T07:10:42Z

We recently moved from the DC2 cluster to the RA3 cluster, which is a different node type and we are finding some issues with the RA3 cluster regarding connection and processing. There is room for improvement in this area. We are in talks with AWS regarding the connection issues. In an upcoming release, I would like to have a Snowflake-like feature where we can create another cluster in the same data warehouse, with the same data. You can create a different cluster and compute nodes for each of your use cases, for retail, and for your data analyst all while keeping your underlying data safe. Additionally, the cluster resize process takes down the cluster for too long, approximately 15 minutes. There are limitations to the size, you can resize only by a multiplier of two, for example, if you have four nodes then you can either go to eight nodes or you can come down to two nodes. There should be fewer limitations.

score 0 · Answer 21 · 2021-05-12T13:09:47Z

Improvement could be made in the area of streaming data. The capability can definitely be improved. There are other products like Kinesis which is a separate service we use for streaming data ingestion. Whatever features are missing in Redshift, they have separate sources but if there were the feasibility to ingest real-time streaming data directly into RedShift, that would be very useful.

Eli Misael Manjarrez DBA at Kimetrics · Answer 22 · 2021-02-24T07:09:00Z

Redshift is a multi-tier engine that works like a calculator. There is some missing functionality and sometimes it's so difficult to work in. We need to convert these functionalities using VACUUM inside Amazon Redshift and then it causes some complexity. Sometimes I'd like for them to support some special features or some special installations because we need automatic populations. I would like to see more programming outside of the cloud. I would like to see more functionalities under JSON files. the only functionality that they have now with JSON is reports. I would also like to see other data sources like MongoDB.

MadhavanSrinivasan CEO at Screenit Labs Pvt Ltd · Answer 23 · 2020-10-21T04:34:06Z

We have had some challenges with respect to considering some of the high-end availability architecture for production. We don't find many issues now, but initially, we had some challenges. This is an older product, so when it comes to usability, it requires a technical person to work with it. It requires a specialist and a good business case to work on it. It has to be a little more user-friendly than what it is today. In our experiments, the handling of unstructured data was not very smooth.

Thomas Dallemagne Cloud & Data - practice leader at Micropole Belgium · Answer 24 · 2020-07-19T08:15:38Z

I would like a better way to ingest data in realtime because there is a bit too much latency. There are too many limitations with respect to concurrency. It is now possible to auto-scale it, although that is still slow. It could offer smaller nodes with decoupling of storage and processing because for the moment, the only nodes available to work that way are huge, and for large companies.

Sarfraz Nawaz Chief Executive Officer at Ampcome · Answer 25 · 2020-06-21T08:08:07Z

The OLAP slide and dice features need to be improved. For example, if a business wants to bring in a general ledger from an ERP, they want to slice and dice the data. What we have found is that they have a lot of formulas that are used to calculate metrics, so what we do is use SQL Server Analysis Services. The question then becomes one of adopting a single vendor and transitioning to Azure. If Redshift had similar capabilities then it would be very good.

score 0 · Answer 26 · 2020-06-17T10:55:59Z

The managing updates, deletes, and role-level change performance is very low. For example, while you are doing inserts, updates, deletes, and amalgamates, the performance is very, very poor. If you want to query the database after you have a lot of terabytes of data, the load, performance-wise, is very low. Looking at the performance of the query, querying the database, and especially with the amalgamates when it is getting updated, it is really poor. We like this solution and have tried all of the native services; they were working quite well. The only concern about Redshift was managing the cluster, especially the EMR cluster. Our company policy was not to use EMR clusters, especially with the nodes failing. There were many instances of downtime happening. Essentially, there was too much data traffic. The other drawback was the CDC, as we do not have any tools that can support it. Creating the structure is easy on the DDL side, but after you create the table and you want to transform the data to store it in a database, the performance is poor. It takes a lot of time to ingest and update the data. After you ingest the data and someone wants to fetch it in the table, it takes a lot of time performance-wise to return the results.

score 0 · Answer 27 · 2020-01-22T12:44:00Z

It would be useful to have an option where all of the data can be queried at once and then have the result shown. As it is now, when we run a query and we are looking at the results, part of the data remains to be processed at the back end. That works very well, but in some cases, we require the whole data to be queried at once and then have the results shown. We have not faced many use cases where it would have been useful, but in one or two, we used other methods to achieve this goal. When our clients contact customer support, they don't want to speak with a machine. Instead, they want to chat with a real person who can provide a solution. Customer service bots can provide solutions but they cannot understand our problems.

it_user1256502 Senior System Engineer at Infosys Technologies Ltd · Answer 28 · 2020-01-12T07:22:00Z

Pricing is one of the concerns that I have because if you compare Snowflake with Redshift, it provides some of the same services, but at a much cheaper rate. So pricing is one of the things that it could improve. It should be more competitive. Otherwise, everything else looks good, especially the data storage and analytical processes.

MiodragMilojevic Senior Data Archirect at Telenor · Answer 29 · 2019-11-27T05:42:00Z

From my perspective, the product could be improved by making it more flexible. There are now more flexible products on the market that allow for expandability and dynamic expansion as the market changes with regard to data warehouses. Although the product is simple to use there can be problems. If you declare some unique key in a column and then store it, the database is going to believe this is what you have and results will be distorted. It's fine if the query is simple but if it's complex or you have too many queries per hour, it can create a bottleneck for Redshift and then you can't return and recover. It requires some fine-tuning. For additional features, I would like to see support for partitions, it doesn't exist yet as a feature. It's quite an important issue when you're dealing with large databases. Also, I believe the product needs improvement in parallel threading to support more database users without jeopardizing performance.

score 0 · Answer 30 · 2019-11-07T10:35:00Z

AD

reviewer1221354

Senior Software Engineer at a tech services company with 51-200 employees

Real User

Nov 7, 2019

Running parallel queries results in poor performance and this needs to be improved.

reviewer1207005 Head of Analytics at DXC · Answer 31 · 2019-10-13T07:07:00Z

SV

reviewer1207005

Head of Analytics at DXC

Real User

Oct 13, 2019

The speed of the solution and its portability needs improvement.

Mediha Šiljić Chief Information Officer at Sensilab · Answer 32 · 2019-07-31T05:52:00Z

Compatibility with other products, for example, Microsoft and Google, is a bit difficult because each one of them wants to be isolated with their solutions. That's a big problem now.

Nir Wasserman BI Manager at jfrog · Answer 33 · 2018-09-04T14:04:00Z

In the next release, a pivot function would be a big help. It could save a lot of time creating a query or process to handle operations.