Leave us your email address and we'll send you all the new jobs according to your preferences.

Stability Governance Program Manager - Vice President

Posted 2 hours 22 minutes ago by Citigroup Inc.

Permanent
Full Time
Executive Jobs
Belfast, City, United Kingdom, BT1 1
Job Description
Overview

Engineer the future of global finance. At Citi, our Tech team doesn't just support finance - we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech.

Responsibilities
  • As a Program Manager for Strategic Initiatives at the Vice President level, you will play a critical role in executing firm-wide strategies, with a focus on enterprise resiliency and recoverability. You will be responsible for leading and driving key workstreams, ensuring the successful delivery of projects that enhance the resilience of critical applications and support broader strategic goals. This role governs the implementation of enhanced testing, recovery, and reporting capabilities, ensuring critical business services remain within defined impact tolerances and minimizing client impact.

  • Lead and govern the implementation and execution of Production Swing testing for critical applications, ensuring applications run from their alternate site for a minimum of 5 days.

  • Drive the implementation and oversight of Data Recovery testing, ensuring applications can recover critical data from backup solutions within the defined Impact Tolerance (ITOL).

  • Drive the onboarding of critical applications to the One-Touch Recovery orchestration solution.

  • Develop and execute strategies to minimize the Recovery Time Actual (TRTA) for critical applications.

  • Serve as a key champion for resilient application design, advocating for and integrating resiliency principles into architectures, and driving the adoption of established resiliency patterns.

  • Leverage cloud-native services and features to enhance application resiliency. This includes services for auto-scaling, load balancing, and disaster recovery.

  • Explore and implement chaos engineering practices to proactively identify and address system weaknesses under stress.

  • Partner with IO owners and platform teams to expand OTR capabilities across diverse technology stacks through API development and integration.

  • Proactively identify vulnerabilities through regular architecture reviews, comprehensive scenario testing, and foundational testing.

  • Document and demonstrate mitigation efforts for all discovered vulnerabilities. This includes developing remediation plans, implementing necessary changes, and validating the effectiveness of mitigations.

  • Ensure that all identified vulnerabilities have remediation plans scheduled.

  • Govern and ensure that all critical applications adhere to operational resilience testing and recovery requirements.

  • Collaborate with relevant stakeholders to define and maintain appropriate impact tolerances for critical business services.

  • Ensure adherence to regulatory requirements for operational resilience including MAS, OCC, and other jurisdictional mandates.

  • Monitor and report on key resilience metrics, including the number of applications executing production swing tests, the number of applications on One-Touch Recovery, recovery times and adherence to operational resilience requirements.

  • Provide regular updates to senior management on the status of resilience initiatives and key performance indicators.

  • Drive the development of resiliency dashboards and self-service reporting capabilities to provide transparency into program progress and application resiliency posture.

Key Qualifications
  • Experience in software engineering, site reliability engineering (SRE), or technology risk and controls.

  • Experience in a program or project management role, delivering complex, cross-functional technology initiatives.

  • Proven expertise in analyzing complex application, database, network, and OS issues across distributed, large-scale, customer-facing systems.

  • Strong understanding of resiliency principles, including disaster recovery, data recovery, and high-availability architecture.

  • Excellent communication skills and a proven ability to work effectively across multiple business and technical teams.

  • Bachelor's degree in Computer Science, Engineering, or an equivalent field.

What we'll provide you

By joining Citi, you will not only be part of a business casual workplace with a hybrid working model (up to 2 days working at home per week), but also receive a competitive base salary (which is annually reviewed), and enjoy a whole host of additional benefits such as:

  • 27 days annual leave (plus bank holidays)

  • A discretionary annual performance related bonus

  • Private Medical Care & Life Insurance

  • Employee Assistance Program

  • Pension Plan

  • Paid Parental Leave

  • Special discounts for employees, family, and friends

  • Access to an array of learning and development resources

Alongside these benefits Citi is committed to ensuring our workplace is where everyone feels comfortable coming to work as their whole self, every day. We want the best talent around the world to be energized to join us, motivated to stay and empowered to thrive. Citi is an equal opportunity employer. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi. View Citi's EEO Policy Statement and the Know Your Rights poster.

Email this Job