ECS / Docker NFS Issue hunt downThis post appeared initially on the Scout24 Engineering Blog Who doesn’t know this situation: Sometime something weird is going on in your setup, but everything is so vague that you don’t know where to start. This is a story about a bug we experienced in a production setup and how we found out what was the root cause. The problem Imagine a ECS cluster with multiple tasks running with an NFS (EFS in this case) backed persistent root volume. Continue reading
SREcon17 EMEA Dulin recap
What is SRE?
Site reliability engineering (SRE) is a discipline that incorporates aspects of software engineering and applies that to operations whose goals are to create ultra-scalable and highly reliable software systems. A more detailed explanation can be found at Googles SRE page
AWS multi account infrastructure
Back in October 2016 I held a talk in the Munich AWS User Group about managing AWS Multi account infrastructure at glomex. This post is summarizing the talk and appeared initially on the glomex techblog.Continue reading