Site Reliability Engineer (SRE) - Platform Infrastructure Team

Posted 12 days 16 hours ago by OnHires

100 000,00 € - 125 000,00 € Annual
Permanent
Not Specified
Other
Paris, France
Job Description

Location: Remote (EU time zone)

Start Date: ASAP

About the Company

Our client is a fast-growing, fully independent SaaS product company with a portfolio of 10+ B2C digital products - and more on the way. They build their own products, use them internally, and scale them globally. No external clients, no investors, and no startup risk.

The team includes 90+ professionals working remotely from over 20 countries across Europe and the Middle East. In 2025, the company plans to launch up to 10 new products beyond the MVP stage.

About the Role

Our client is looking for a Site Reliability Engineer (SRE) to join their Platform Infrastructure team.

In this role, you'll take ownership of reliability, scalability, and observability across their systems. You'll work closely with DevOps and product engineers to ensure smooth deployments, robust CI/CD pipelines, and a secure, highly available infrastructure that performs under high load.

Responsibilities

  • Design, implement, and maintain secure, scalable infrastructure on AWS

  • Improve system observability (metrics, logs, traces, alerts)

  • Define and implement SLIs/SLOs across services

  • Enhance and manage Kubernetes clusters and multi-environment deployments

  • Collaborate with product teams to ensure apps (Next.js, NestJS) are production-ready

  • Support and improve CI/CD pipelines (GitHub Actions, Infra as Code, automation)

  • Contribute to incident response, post-mortems, and capacity planning

  • Build automation for self-healing and recovery of systems

  • Promote and implement SRE best practices across the company

Requirements

  • 3+ years in SRE, DevOps, or Platform Engineering roles

  • Strong experience with AWS and Kubernetes in production environments

  • Good understanding of modern web app stacks (Next.js, NestJS is a plus)

  • Deep expertise in monitoring, alerting, and observability tools

  • Experience with SLIs/SLOs definition and management

  • Hands-on experience with high-load, high-throughput systems

  • Proficiency with Terraform or similar Infrastructure-as-Code tools

  • Scripting skills to automate manual processes

  • Solid troubleshooting mindset and strong system thinking

  • Familiarity with CI/CD tooling (e.g., GitHub Actions)

  • Team-oriented, startup-friendly mindset

Nice-to-Have

  • Experience working in fast-paced startup environments

  • Exposure to multi-product SaaS platforms

Key Performance Indicators

  • Service Uptime

  • MTTR / MTBF

  • Monitoring Coverage

What our client offer

  • 22 paid vacation days + public holidays based on your country

  • Fully remote role with flexible working hours (7:00-18:00 GMT+2)

  • Annual performance bonus

  • Sponsored upskilling and development

  • Annual company retreats in Europe (fully covered)

  • High-caliber, senior-level team

  • No startup chaos - only focused, scalable product work

Ready to engineer systems that power a growing portfolio of B2C products? Apply now!