Senior Site Reliability Engineer for PaaS

Location: Woking, GB

Senior Site Reliability Engineer for PaaS

Reporting to the Head of SW and Engineering  

McLaren Applied

McLaren Applied works at the intersection of technology, data and human ingenuity to deliver quantifiable performance advantage to key sectors undergoing large-scale disruptive change; Motorsport, Automotive, Transport, IoT and Health. 

Our Team

Our product development teams have a diverse range of skilled professionals across Software, Data Science, Simulation, Systems and other engineering disciplines. These teams collaborate to build products that solve challenges across a wide range of industries.

Our teams use a data-led approach to solve difficult business problems. They employ modern software tools and practises with cloud-based infrastructure and services to deliver scalable solutions that provide:

  • Insight, condition monitoring and predictions of system behaviour.
  • Communications and data gathering services with supporting edge and cloud services.
  • Virtual product development lifecycle, including the use of our industry leading simulator.

The Role

This is an opportunity to join our Software Engineering community as a Site Reliability Engineer (SRE) to work on initiatives that:

  • Lead strategic imperatives that create and improve the McLaren Applied ‘Platform as a Service’ offering to the product delivery teams.
    • Enable product development teams to deliver software products on immutable infrastructure.
    • Develop and facilitate production and development infrastructure and associated tooling.
    • Integrate third party managed services used for delivery and development lifecycle.
  • Lead on implementation of the McLaren Applied cloud governance policy in the form of a PaaS and be a key participant in maintaining and updating the policy in accordance with technology shifts, customer feedback and product development feedback.

You will need to be an open-minded technologist, who values a collaborative work environment and is willing to learn and explore as the fast-paced industry evolves and changes.

Your skills will be utilised for:

  • Leadership of the Site Reliability Engineering (SRE) community.
  • Automated infrastructure creation, monitoring and deployment.
  • Evaluation and implementation of tools for monitoring, security and automation.
  • Forward thinking by evaluating the technology horizon and maintaining our tech radar.
  • Best practises as related to risk mitigation of downstream scalability, performance, resilience, reliability and security.
  • Evaluation, monitoring and development of cost models.
  • Participation in estimation sessions.
  • DevOps culture coaching and knowledge sharing with the wider Software and Analytics community.

Key Responsibilities:

  • Key contributor to the Technical Roadmap for the McLaren Cloud Platform (PaaS).
  • Development and maintenance of the McLaren Cloud Platform (PaaS)
    • Collaborate and consult with software engineers and data scientists to help design and implement robust and scalable software products.
    • Knowledge sharing and education of team members to enable our DevOps culture.
    • Proactively monitor costs and security posture of the PaaS and products running on it.
  • Define and implement tooling to continually improve our software development, release and maintenance processes.

Example scenarios you will be helping us with

  • Design and implement infrastructure for sensitive workloads where our platform infrastructure needs to be isolated with no / limited internet access (inbound and outbound).
  • Design and implement monitoring infrastructure for real-time data streams.
  • Design network and access to allow software engineers and data scientists to access services in AWS while keeping the services and data safe and secure.
  • Enable and collaborate with teams to automate the entire delivery of a product. From a single web application to the configuration of a cloud account.
  • Design and implement security and access management so that users and roles have access only to resources they need within the AWS account.

Experience

  • Experience working with delivery teams deploying software on the cloud.
  • Experienced in strategic technical leadership.
  • Evidence of tailored and contextual communication in all directions to realise value as feedback and enquiry.
  • Hands on experience in delivering production quality services.

Required:

  • Experience with a major cloud vendor, preferably AWS.
  • Expertise in containerization and container orchestration (preferably Docker and Kubernetes) and related managed services (e.g. EKS).
  • Experience with Identity as a Service and SSO integrations.
  • Experience securing cloud environments.
  • Experienced in:
    • Infrastructure as code (preferably Terraform and Terragrunt).
    • Desired State Configuration (Ansible)
    • Python, Golang
    • Test driven methodologies.
    • Implementation of automated build and release pipelines.
    • Infrastructure diagnostics and monitoring (e.g. ELK, TIGK, TICK)

Desirable

  • Experience in designing, developing and deploying data intensive (high frequency or big data) solutions.
  • Familiarity with AWS Well-Architected Framework.
  • AWS Certified Solutions Architect – Professional or AWS Certified DevOps Engineer – Professional.

Work Location

You will be based at Victoria Gate in Woking. 

Recruitment Process

If our technical recruiters find your application is suitable for the role, you will be asked to complete a technical challenge followed by a phone interview.