Oracle index to AWS Redshift Sortkey

Question

I am new to Redhsift and migrting oracle to Redshift.

One of the oracle tables have around 60 indexes. AWS recommends its a good practice to have around 6 compound sort keys.

How would these 60 oracle indexes translate to Redhsift sort keys ? I understand there is no automated conversion or can't have all 60 of them as compound sort keys. I am new to redshift and May I know , how usually this conversion is approached.

In Oracle we can keep adding indexes to the same table and the queries / reports can use them. But in Redshift Changing sortkeys is through recreating the table. How do we make all queries which uses different filter columns and join columns on the same table have best performance?

Thanks

nevsv nevsv · Accepted Answer · 2017-10-09T09:03:12

Redshift is columnar database, and it doesn't have indexes in the same meaning as in Oracle at all.

You can think of Redshift's compound sort key (not interleaved) as IOT in Oracle (index organized table), with all the data sorted physically by this compound key.

If you create interleaved sort key on x columns, it will act as a separate index on each of x columns in some manner.

In any way, being columnar database, Redshift can outperform Oracle in many aggregation queries due to it's compression and data structure. The main factors that affect performance in Redshift are distribution style and key, sort key and columns encoding.

If you can't fit all your queries with one table structure, you can duplicate the table with different structure, but the same data. This approach is widely used in big data columnar databases (for example projections in Vertica) and helps to achieve performance with storage being the cost.

Please review this page with several useful tips about Redshift performance: https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-techniques-for-amazon-redshift/

Oracle index to AWS Redshift Sortkey

2 Answers