Site Reliability Engineering Manager | SRE

at Distributed
Location Dar es Salaam, Tanzania, United Republic of
Date Posted July 18, 2023
Category Engineering
Management
Job Type Full-time
Currency TZS

Description

About this role 

You'll be creating the brand-new SRE Squad within the Platforms team within Enterprise Digital whose goal is to make life for the development teams in other parts of the business easier by providing a set of low-friction, fully managed cloud capabilities and abstracting these complexities away from the developer – allowing them to focus on code and not infrastructure.

As a hands-on Engineering Manager of the Operations and SRE team, your code will lead by example. Your team will also be responsible for the stability, security, availability and performance of these cloud capabilities which will be built by other teams within the Platforms team. You and your team will also collaborate closely with engineering teams across the tribes -building AI OPS tooling to support the business.

Your Responsibilities

  • Lead and grow a team of engineers and SRE’s in ensuring our platform and the applications running on it are stable and secure ensuring systems remain available with no drops in performance
  • Create and Lead strategy for planned outages and DR exercises
  • Implement monitoring and self-healing capabilities for systems to minimise downtime
  • Provide strategic and operational oversight for Enterprise software product development
  • Work closely with business leaders to develop short and long-term strategies
  • Develop and drive execution on 6 months and 1-year road maps
  • Drive innovation, establish new approaches in improving productivity
  • Establish a metrics-based organisation, develop critical operational metrics and push for continuous improvement
  • Remove manual implementation from workflows and look to automate as much as possible

Required profile for job ad : Site Reliability Engineering Manager | SREAbout You

We’re looking for passionate technologists who enjoy working in collaborative agile teams. You’ll need to be a clear, concise & engaging communicator with people on your team. We enjoy the big picture and the detail; we want people who excel at both.

  • 10+ years of hands-on software engineering experience delivering and managing software in production in a commercial setting, ideally in enterprise environments
  • A good level of commercial expertise in core Java and Spring Boot
  • 4+ years of experience in building software products in JavaScript/Typescript. Java is a bonus
  • 5+ years of experience in an SRE/DevOps or another similar role
  • 5+ years of experience with agile systems development methodologies
  • 3+ years' experience with cloud computing on AWS
  • Passion for automation with a reluctance for manual implementation
  • Experience with DevOps tools, processes, and culture
  • Experience setting and managing Service Level Objectives (SLOs) and Service Level Agreements (SLAs)
  • Experience with ITIL processes is a bonus
Drop files here browse files ...