Disclaimer: I’m taking into account that you more or less know many of the AWS concepts in EMR, IAM, S3; together with what is Zeppelin and Spark. How to start with a project involving big data at your enterprise?