Leave us your email address and we'll send you all the new jobs according to your preferences.
Cloud Site Reliability Engineer
Posted 7 hours 50 minutes ago by GBST Holdings Limited
At GBST, we're inspiring wealth innovation for wealth management and advice organisations globally. Our commitment to excellence, track record of continued and successful delivery, hard work and product excellence has earned us the trust and partnership of many of the world's leading financial services organisations.
We've invested heavily in transforming our technology stack to bring a truly immersive and digital experience to the front and back-office.
We're now on the lookout for a Cloud Site Reliability Engineer to strengthen our Technology team that is working on delivering robust, scalable, and reliable cloud infrastructure and services. In your role as Cloud Site Reliability Engineer, you'll work at the heart of our platform operations, ensuring high availability, reliability, and performance of our cloud-based systems. You'll be responsible for automating infrastructure, implementing resilience strategies, and supporting our global client base with best-in-class reliability engineering.
This is a London based role collaborates with production support, development, cloud platform, and architecture teams to deliver operational excellence and continuous improvement.
What U will do:- Manage and optimise cloud infrastructure to ensure high availability and system reliability.
- Design, deploy, and maintain scalable infrastructure on AWS using Kubernetes, Docker, and Infrastructure as Code (Terraform, CloudFormation).
- Implement and automate resilience testing strategies using chaos engineering tools (e.g., AWS Fault Injection, Gremlin, Chaos Monkey, LitmusChaos).
- Monitor and observe systems using tools such as Prometheus, Grafana, Datadog, New Relic, and Elastic Stack.
- Automate operational processes using scripting languages (Python, Go, Shell, Ruby, Java).
- Participate in incident response, triage, mitigation, and root cause analysis, ensuring minimal downtime and continuous improvement.
- Develop playbooks for common incidents, reducing Mean Time to Resolution (MTTR).
- Design and test disaster recovery strategies, conduct DR drills, and implement multi-region failover and data replication.
- Define and manage Service Level Objectives (SLOs), Service Level Agreements (SLAs), and Service Level Indicators (SLIs).
- Collaborate across teams to improve platform resilience and performance, and mentor others in SRE best practices.
- Ensure compliance with GBST policies, statutory requirements, and industry standards (e.g., PCI DSS, GDPR, ISO 27001).
- Deliver 24/7 support via on call rotation for after hours issues.
- ITIL Foundation Certification, AWS Certified Cloud Practitioner (CCP), and Terraform Associate.
- Hands on experience with AWS cloud administration and automation technologies.
- Skilled in observability tooling (infrastructure monitoring, log aggregation, analytics, APM, Synthetic/RUM).
- Proficient with BitBucket (GIT source code management and CI).
- Experience with observability suites (DataDog, New Relic, Dynatrace, Splunk, Sumo Logic)
- Strong problem solving and debugging abilities.
- Clear communicator and effective collaborator.
- Proactive, organised, and able to manage multiple priorities in a fast paced team.
- AWS SysOps Administrator Certification.
- Experience with zero downtime deployment strategies.
- Background in highly available, secure, and performant production systems.
- Disaster recovery planning, failure injection, and mentoring experience.
- Flexible/hybrid working arrangements
- Instant savings and discounts at major retailers across the country
- Private Health Insurance including Dental and Optical Cover
- Non contributory Pension Scheme
- Salary Sacrifice Schemes - Car, Cycle to Work, and Additional Pension Contributions
- Additional GBST & U day off every year
- Employee Assistance Program (EAP)
- LinkedIn Learning access
If you're looking for a role where you can be a part of exciting innovation, we want to hear from you! apply now or reach out to if you have any questions.
Please note: Due to a high volume of applications received, we are only able to contact applicants progressing to the next stage. We are currently managing recruitment internally and do not require support from external agencies.
GBST Holdings Limited
Related Jobs
Property Services Planner/Scheduler
- £33,185 Annual
- London, United Kingdom
Senior Trade Mark Administrator with Arabic - 100% Remote Working Available
- London, United Kingdom
Clinical Negligence Solicitor
- £45,000 - £65,000 Annual
- Birmingham, United Kingdom
Deputy Head of Operations
- £90,000 - £120,000 Annual
- London, United Kingdom
Tax Disputes Assistant Manager
- £48,000 - £60,000 Annual
- London, United Kingdom