Senior Site Reliability Engineer

Posted 3 days 6 hours ago by Caspian One Ltd

125 000,00 € - 175 000,00 € Annual
Permanent
Not Specified
Other
Not Specified, Ireland
Job Description

Senior Systems Reliability Engineer (SRE)

Employment Type: Full-time, Permanent
Location:
Ireland (Remote)
Salary: 125,000-175,000 Euros

About the company:

A global fintech organisation operating mission-critical trading technology is expanding its engineering presence in Ireland. The team builds and supports high-performance, low-latency systems used across financial markets, with a strong engineering culture and focus on reliability, fairness and technical excellence.

They are now expanding their engineering presence in Ireland and hiring their next Senior Systems Reliability Engineer to support our mission-critical trading systems.

The Role

As a Senior SRE, you'll join a highly technical, engineering-driven environment responsible for the reliability, performance, and operational excellence of a large-scale, bare-metal trading platform. This is a hybrid role combining systems engineering, observability, automation, and Real Time operational support.

You'll work across the full stack - (Linux, networking, applications, hardware) and play a key role in building a follow-the-Sun support model with teams in the U.S. and Europe.

What You'll Do

  • Own the technical operations of trading systems running on bare-metal infrastructure
  • Monitor, troubleshoot, and resolve issues across OS, network, hardware, and application layers
  • Build and improve automation, tooling, and configuration management (Ansible or similar)
  • Develop and maintain observability dashboards, alerts, and telemetry pipelines
  • Participate in deployments, start-up/shutdown procedures, and change management
  • Contribute to engineering projects such as OS tuning, Kernel-level optimisation, and performance improvements
  • Collaborate with platform, development, and market operations teams
  • Participate in on-call rotation (1 week on; occasional Saturday for industry-wide testing)
  • Document processes, mentor teammates, and promote operational best practices

What You Will Bring

Must-Have Technical Skills

  • Strong Linux experience (comfortable with system processes, logs, services, troubleshooting)
  • Hands-on Scripting with Python or Bash
  • Experience with Ansible or similar configuration management tools
  • Solid understanding of networking fundamentals: TCP/IP, routing, multicast
  • Experience supporting large, distributed, or high-availability systems
    Must have technical skills in observability; Prometheus, Grafana, Splunk, Graylog, Telemetry, alerting systems (eg Alertmanager), log pipelines

Nice-to-Have

  • Experience with Bare-metal deployments
  • Kernel tuning/Kernel bypass techniques
  • KDB experience
  • Familiarity with Arista/Cisco Switches, Corvil, Solarflare/Mellanox NICs
  • Understanding of trading systems

Who You Are

This matters as much as the tech.

You are someone who:

  • Works well independently and in distributed teams
  • Communicates clearly and calmly
  • Is collaborative, low-ego, and easy to work with
  • Can follow processes while still thinking critically
  • Learns quickly and enjoys understanding complex systems
  • Thrives in a high-trust, engineering-focused culture

Why Join?

  • Work on mission-critical, low-latency trading systems
  • Highly technical environment with deep engineering challenges
  • Exposure to Kernel tuning, networking, automation, and performance optimisation
  • Flexible working arrangements with opportunity for travel