Redshift

Amazon Redshift is a fully-managed petabyte-scale cloud-based data warehouse product designed for large scale data set storage and analysis.
Can be used for online analytical processing
Amazon Redshift mostly only supports Single-AZ deployments:
- some clusters are compatible with Multi-AZ
Redshift is based on PostgreSQL
- It’s not used for online transaction processing
- Rather it is OLAP
  - Online analytical processing
    - analytics and data warehousing
10x better performance than other data warehouses
Scale to PBs of data
Columnar storage of data (rather than row-based)
Parallel query engine
Has SQL interface for performing the queries
Any business intelligence tools integrate with it, such as:
- QuickSight
- Tableau Software

Redshift vs Athena

in redshift, you must load the data
Redshift has indexes (Athena doesn’t). Thus,
- Redshift is going to have much faster queries
- Redshift can do much faster joins
- Faster integration

Large inserts are much better
- Large batches of data = efficient
- one row at a time = wildly inefficient