Amazon Athena, Amazon S3, and VPC Flow Logs

During this hands-on project you will explore using Amazon Athena, Amazon S3, and VPC Flow Logs to deploy an easily searchable analytics platform using SQL-like queries.

Successfully complete this lab by achieving the following learning objectives.

1Create the Amazon S3 Bucket

  • Create a new S3 bucket that is prefixed with csaa-hol-

2Create the VPC Flow Log and Generate Records

  • Create a brand new VPC Flow Log for the entire VPC.

  • Name the flow logs vpc-to-s3

  • Set the filter to All

  • Set a 1 minute aggregation interval

  • Configure the flow log to be sent to your new Amazon S3 bucket

  • Use the AWS default format

  • Use the Parquet log file format

  • Enable Hive-compatible S3 prefixes

  • Partition logs by every 1 hour

  • Browse to the DNS entry for OurApplicationLoadBalancer using HTTP and refresh a couple of times to generate traffic

  • Wait a few minutes and refresh the S3 objects list until you start seeing objects generated before you move on

3Set up Amazon Athena

Last updated