Redshift Archives - Page 5 of 5

Redshift data ingestion deduplication / upsert ( delete / insert ) using a staging table – and a little tuning

thg / July 15, 2018

Why would you do this? In Redshift blocks are immutable – and are re-written completely – no partial block writes. […]

References to AWS Docs. and git / github For Redshift tuning table design distribution styles and distribution keys and much more…

thg / July 14, 2018

References to AWS Docs. For Redshift tuning table design distribution styles and distribution keys and much more… https://aws.amazon.com/blogs/big-data/amazon-redshift-engineerings-advanced-table-design-playbook-distribution-styles-and-distribution-keys/ https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-techniques-for-amazon-redshift/ https://docs.aws.amazon.com/redshift/latest/dg/c_best-practices-best-dist-key.html

Redshift

redshift diststyle and distkey combination define slice distributions

thg / July 13, 2018

diststyle and distkey combination define slice distributions diststyle defaults to “even” when not defined distkey has no default

Redshift

Setting up Aginity Workbench Desktop Client to connect to Redshift

thg / July 4, 2018

Download Aginity Workbench Desktop Client Here – I believe it is a Windows only tool. http://www.aginity.com/redshift/ I had to

Redshift

redshift show slices from system table stv_slices

thg / July 3, 2018

select * from stv_slices;

Amazon AWS, Big Data, Redshift

Create Volume Test Data Using TPCH-KIT or TPCDS (tpcsds-kit) on Linux and Copy the Data Into Redshift

thg / July 1, 2018

General description of the process Generate a fairly large volume of test data using the tpch-kit – (setup required described

Redshift

Protected: AWS Big Data References EMR / Redshift / Kinesis / Spark / Hadoop / and Use Cases

thg / May 25, 2018

There is no excerpt because this is a protected post.

Redshift

Five part blog on tuning Redshift – starting with determining most frequency scanned tables and segment / table size

thg / May 13, 2018

Reference: https://aws.amazon.com/blogs/big-data/amazon-redshift-engineerings-advanced-table-design-playbook-preamble-prerequisites-and-prioritization/ Important info on scan frequency and table size… and a lot more. This guy is pretty good. Read all

Amazon AWS, Redshift

Maintaining a tables organization for optimal execution plans

thg / May 13, 2018

First Off Note: Constraint Enforcement Redshift uniqueness constraints, primary key constraints, and foreign key constraints are not enforced Unfortunately Redshift uniqueness,

Amazon AWS, Redshift

AWS Redshift Customizing Storage Across Your Cluster / Becoming Performant

thg / May 4, 2018

AWS Redshift Customizing Storage Across Your Cluster / Becoming Performant AWS Redshift is a highly tunable data warehousing environment that