We've detected you're from the Netherlands, if you'd like you can view this page in Dutch!
View Dutch Stay hereWe've detected you're from Germany, if you'd like you can view this page in German!
View German Stay hereBerlin, Permanent
You’ll be joining at a time when the concept of SRE is being revolutionized within the company. You will be part of a team of 5, and will own the products around observability and monitoring. You will work with SLIs, SLOs, SLAs, and will support with defining the SRE strategy, as well as roadmap. You’ll initally be working with K8s on AWS/ Azure, and will build an internal cluster to provide distributed tracing. Part of your role will be to ensure that critical services have a well-configured monitoring and logging system. You’ll share your expertise with the development teams, and will write code to improve the performance of services. You will act as Incident Commander for services provided to the end customer on a large scale, and will steer the investigation, perform debriefs and analyse preventative measures, and get hands-on to improve the TTR. The SRE setup in this company is fairly mature, but there are still improvements to be made until they reach their goal of embedding SREs into product teams on a project basis. You’ll have the opportunity to take the initiative to build things, have an impact, and play a pivotal role in helping achieve this goal.