└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # DevOps Monitoring Challenge 2 | ------ 3 | 4 | Congratulations on making it to the Monitoring Challenge! Companies today are working on building monitoring 5 | solutions to have better insight on the health and performance of their computing systems. Your challenge 6 | today is to set up a monitoring stack to capture data points from two live EC2 instances! ​ 7 | 8 | **All tools and AWS services are** **FREE!** 9 | **Submission equals invitation to interview!** **Incomplete solutions are welcomed!** 10 | 11 | **Submit a single PDF containing an image of your dashboard in action and notes on your process.** 12 | We will ask for the live dashboard during the interview! 13 | 14 | **The challenge is written with respects to AWS but GCP and AZURE solutions are welcomed!** 15 | 16 | #### Tips On Getting Started: 17 | 18 | * Dive Deep into tools! Then build fast! 19 | * Find resources online. 20 | * Keep it in the cloud. 21 | * Automate simulations. 22 | 23 | #### Monitoring Challenge + Starting Resources: 24 | 25 | 1. Set up 3 t2.micro AWS EC2 instances 26 | * What is AWS? [(link)](https://youtu.be/qcY-uiEHhn0) 27 | * How to create an AWS account. [(link)](https://aws.amazon.com/premiumsupport/knowledge-center/create-and-activate-aws-account/) 28 | * Launching and ssh-ing into your first EC2 instance. [(link)](https://docs.aws.amazon.com/quickstarts/latest/vmlaunch/step-1-launch-instance.html) [(link)](https://youtu.be/BCM9aaaWvR0) 29 | * Bash commands and scripting 101 [(link)](https://linuxconfig.org/bash-scripting-tutorial-for-beginners) 30 | 2. Configure Prometheus to monitor 2 EC2 instances. 31 | * What is Prometheus? [(link)](https://prometheus.io/docs/introduction/overview/) 32 | 3. Connect Prometheus to Grafana. 33 | * What is Grafana? [(link)](https://grafana.com/grafana/) 34 | 4. Collect and display cpu, memory, and disk space usage. 35 | 5. Simulate high cpu, memory, and disk usage on monitored instances. 36 | 37 | #### Optional: 38 | * Collect and display network traffic rate, upload rate, and download rate. 39 | * Use [DockProm](https://github.com/stefanprodan/dockprom) instead of manual implementation of Prometheus + Grafana 40 | * Launch DockProm into a kubernetes cluster [[1]](https://prometheus.io/docs/prometheus/latest/configuration/configuration/#%3Ckubernetes_sd_config)[[2]](https://www.weave.works/blog/prometheus-and-kubernetes-monitoring-your-applications/) 41 | 42 | #### Support: 43 | 44 | * Please email ​monitoring-challenge@insightdevops.com​ if you need help! 45 | 46 | --------- 47 | 48 | ##### Please have Grafana easily visible and ready to run before the interview. 49 | 50 | 51 | 52 | --------------------------------------------------------------------------------