Leave us your email address and we'll send you all the new jobs according to your preferences.

Site Reliability Engineer

Posted 15 days 2 hours ago by Searchability

£70,000 - £75,000 Annual
Permanent
Not Specified
Other
Cheshire, Chester, United Kingdom, CH1 1
Job Description

Site Reliability Engineer

Role Description:

An opportunity for an experienced site reliability engineer to work for a globally recognised company in the heart of Chester on a hybrid working basis has arisen. You will join a team who are responsible for building a suite of observability tools while working closely with other members of the Network Services team to ensure one of the largest network infrastructures in the world is highly available, resilient, and secure.

Company Benefits

  • 25 days annual leave plus bank holidays and additional dayS going forward
  • Private healthcare for you and your family
  • Competitive pension plan, life assurance and group income protection cover
  • The ability to change your core benefits as well as the option of selecting a variety of flexible benefits to suit your personal circumstances including access to a wellbeing account, travel insurance, critical illness
  • Use of a flex fund to use towards benefits
  • Wellbeing helpline, mental health first aiders and virtual GP service

Main Responsibilities of a Site Reliability Engineer:

  • Maintain and enhance network monitoring, orchestration, and automation solutions, encompassing tasks such as inventory reconciliation, workflow automation, network configuration validation, health monitoring, alert management, and incident resolution.
  • Conduct audits on Network Infrastructure to uphold best practices and standards.
  • Work collaboratively with teams to troubleshoot and address network issues.
  • Develop API-driven services for seamless integration with other systems.
  • Take charge of automating routine tasks and collaborate with colleagues to design and deploy tools aimed at streamlining internal processes and automating end-to-end workflows within the network infrastructure.
  • Create and maintain automated test frameworks and comprehensive documentation.
  • Take personal responsibility for implementing build and release pipelines, overseeing deployment scheduling, and managing issues, risks, and impediments.
  • Collaborate with stakeholders to prioritize and deliver solutions, ensuring successful project outcomes.
  • Plan and execute releases while providing leadership and management to the team.
  • Foster innovation and process improvement through collaboration within the team.
  • Generate reports to identify and address network inventory gaps, ensuring compliance with standards and best practices.
  • Identify vulnerabilities and implement measures to maintain a secure network environment.

Required Skills:

  • Proficiency in Splunk Search Processing Language
  • Strong programming skills with practical experience in Python
  • Hands-on expertise in automation and orchestration tools like Ansible, Itential, or similar platforms
  • Practical experience with network monitoring tools
  • Ability to develop API-based services
  • Solid understanding of Network Domain fundamentals, including expertise in Network Asset and Configuration management processes
  • Familiarity with the Software Development Life Cycle and proficiency in Agile methodologies, utilizing tools such as Bitbucket, JIRA, and Jenkins
  • Analytical and problem-solving abilities to manage multiple project factors concurrently.
  • Excellent communication skills, both verbal and written
Email this Job