Site Reliability Engineer
Posted 3 days 11 hours ago by Grid Dynamics International, Inc.
Hybrid position with on-calls
We are seeking a highly motivated and skilled Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability of the client's critical Data Platform solutions. In this role, you will be instrumental in providing dedicated support and maintaining the health of the data infrastructure.
This position involves on-call responsibilities to address critical incidents and maintain system availability.
Essential functions- Provide dedicated support and ensure the stability and reliability of our Data Platform solutions.
- Participate in an on-call rotation to address and resolve production incidents promptly.
- Utilize infrastructure-as-code (IaC) tools such as CloudFormation and Terraform to manage and provision cloud resources on AWS and GCP.
- Implement and enforce robust data security practices within the cloud-based applications.
- Apply a strong understanding of cloud networking concepts (e.g., VPCs, Route 53, Security Groups, NLB/ALB on AWS) to ensure secure and efficient data flow.
- Design, build, and maintain CI/CD pipelines for large-scale applications on AWS, promoting automation and efficient deployments.
- Support and manage applications deployed on Kubernetes, ensuring their scalability and resilience.
- Leverage skills in Go and/or Python for automation and operational tasks.
- Apply excellent analytical and problem-solving skills to troubleshoot complex issues and implement effective solutions.
- Proven experience managing applications, data engineering, machine learning, or data science workloads on AWS and/or GCP.
- Hands-on experience with infrastructure templating tools like CloudFormation and Terraform.
- Solid understanding of data security principles and their application in cloud environments.
- Familiarity with cloud networking concepts, specifically within AWS (e.g., VPCs, Route 53, Security Groups, NLB/ALB).
- Experience in building and maintaining CI/CD pipelines for large-scale applications on AWS.
- Experience with migrating and supporting applications in Kubernetes environments.
- Advancement in at least one programming language such as Go or Python.
- Exceptional analytical and problem-solving abilities with a proactive approach.
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Benefits package - medical insurance, sports
- Corporate social events
- Well-equipped office
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization, and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.