Inspecting Flights within the U.S. with AWS and Energy BI | by Aashish Nair | Jul, 2023


Information Warehousing with AWS Redshift

With AWS Glue, the information that was initially in a flat mannequin can now be represented with a extra becoming star schema in a knowledge warehouse.

The cloud knowledge warehouse for this knowledge can be created with AWS Redshift Serverless. This entails making a namespace named flights-namespace in addition to a database named dev. As well as, it requires a workgroup named flights-workgroup, which can be used to write down SQL queries.

Observe: The workgroup has been configured to permit units outdoors of the VPC to entry the database. This can be helpful when creating the visualization with Energy BI

Workgroup (Created by Creator)

Now, we are able to open the question editor in Redshift and begin creating the actual fact and dimension tables within the dev database.

Question Editor (Created by Creator)

First, the 4 tables within the schema must be created within the warehouse utilizing the next instructions:

Created Tables (Created by Creator)

The 4 tables are actually within the knowledge warehouse, however they’re all empty for the reason that knowledge continues to be within the flights-data-processed bucket.

The info could be copied into this knowledge warehouse utilizing the COPY command.

For example, the information in flights.csv could be copied into the flights desk utilizing the next command syntax:

Observe: the iam_role variable needs to be assigned no matter iam position is was chosen when creating the workgroup.

By executing the COPY command for every of the csv recordsdata within the flights-data-processed bucket, the 4 tables needs to be full of the required knowledge.

For example, here’s a preview of the airport desk:

Question Output (Created by Creator)


Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button