A quick look at Data Lake

AWS S3 + Glue + Athena

For example, we can dump all the unstructured raw data into AWS S3 service, which provide unlimited storage capacity.

Partition for query performance

Considering of query performance, it’s probably not a good idea to just throw all data into one big bucket.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store