Principal SRE - Site Reliability

Posted 21 hours 53 minutes ago by CBSbutler Ltd.

Permanent
Part Time
Other
Berkshire, Wokingham, United Kingdom, RG401
Job Description

Role: Principal Site Reliability Engineer (Platform / DevOps)

Location: Wokingham (Reading) - Hybrid (60% remote / 40% onsite)

Duration: 6 Months+

Rate: £500-£520 per day

Clearance: Active SC Clearance required (mandatory)

Overview

We are seeking an experienced Principal SRE / Platform Engineer to lead platform-first initiatives focused on scalability, reliability, and performance across distributed systems. This role requires strong DevOps expertise and the ability to design and maintain resilient cloud-based infrastructure.

Key Responsibilities
  • Lead platform-first engineering initiatives to enhance scalability and reliability
  • Design, build, and maintain resilient infrastructure for distributed systems
  • Implement monitoring and alerting solutions to ensure high availability
  • Collaborate with engineering teams to improve system reliability and mitigate risks
  • Develop and maintain CI/CD pipelines to support efficient deployments
  • Recommend ongoing improvements to platform architecture and processes
  • Ensure compliance with security, governance, and regulatory standards
Required Skills & Experience
  • Strong background in software engineering for large-scale distributed systems
  • Proficiency in Golang, Java, or Python
  • Hands-on experience with AWS, Azure, or GCP
  • Deep knowledge of Kubernetes and container orchestration
  • Proven experience with CI/CD and infrastructure automation
  • Excellent troubleshooting and communication skills