Leave us your email address and we'll send you all the new jobs according to your preferences.
AI & Model Evaluation Product Director
Posted 2 hours 16 minutes ago by London Stock Exchange Group
Permanent
Full Time
Other
London, United Kingdom
Job Description
LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a commitment to excellence in delivering the services our customers expect from us. With extensive experience, deep knowledge, and worldwide presence across financial markets, we enable businesses and economies around the world to fund innovation, manage risk and create jobs. It's how we've contributed to supporting the financial stability and growth of communities and economies globally for more than 300 years.Through a comprehensive suite of trusted financial market infrastructure services - and our open-access model - we provide the flexibility, stability and trust that enable our customers to pursue their ambitions with confidence and clarity.LSEG is headquartered in the United Kingdom, with significant operations in 70 countries across EMEA, North America, Latin America and Asia Pacific. We employ 25,000 people globally, more than half located in Asia Pacific. LSEG's ticker symbol is LSEG. Our People: People are at the heart of what we do and drive the success of our business. Our culture of connecting, creating opportunity and delivering excellence shape how we think, how we do things and how we help our people fulfil their potential.We embrace diversity and actively seek to attract individuals with unique backgrounds and perspectives. We break down barriers and encourage teamwork, enabling innovation and rapid development of solutions that make a difference. Our workplace generates an enriching and rewarding experience for our people and customers alike. Our vision is to build an inclusive culture in which everyone feels encouraged to fulfil their potential.We know that real personal growth cannot be achieved by simply climbing a career ladder - which is why we encourage and enable a wealth of avenues and interesting opportunities for everyone to broaden and deepen their skills and expertise.As a global organisation spanning 70 countries and one rooted in a culture of growth, opportunity, diversity and innovation, LSEG is a place where everyone can grow, develop and fulfil your potential with meaningful careers. Role Purpose: We are seeking an experienced AI & Model Evaluation Manager to lead the evaluation, validation, and governance of advanced AI, machine learning, and statistical models across our Active Data Layer programme of development.This role blends technical depth, strategic leadership, and strong stakeholder management, ensuring that our models are accurate, reliable, safe, and aligned with regulatory and organisational standards.You will oversee the end-to-end lifecycle of model evaluation - ranging from large language models (LLMs), agentic systems, and machine learning models used across Risk Intelligence to source, determine, match, and resolve model-driven data tasks within our Financial Crime and Screening domains. to end lifecycle of model evaluation.You will act as the product owner of the AILab and associated analytics infrastructure supporting our ADL models, working in close partnership with technology leads on the development and implementation of these new capabilities.As a senior member of the Data & Product team, you will partner closely with engineering and architecture teams, including Data Science, AIOps, as well as the wider business functions such as Risk & Compliance, Legal, and Content Operations to ensure our data capabilities are accurate, fair, robust, auditable, and business-effective.In addition, you will use your experience to shape best practices, drive innovation, and influence broader AI and model governance frameworks. Key Responsibilities: AI & ML Model Evaluation Lead the design and execution of evaluation methodologies for LLMs, multimodal systems, AI agents, and traditional ML models. Oversee scenario based testing, regression suites, multiturn agent simulations, and automated evaluation systems such as LLMasJudge and hybrid scoring approaches. based testing, regression suites, multi turn agent simulations, and automated evaluation systems such as LLM as Judge and hybrid scoring approaches. Build, refine, and maintain frameworks that assess model quality, robustness, performance, safety, explainability, and reliability at scale. Model Validation & Governance Direct the independent review and validation of models across teams, ensuring compliance with internal LSEG governance standards and processes, and relevant regulatory expectations. Maintain a robust model inventory, validation documentation, and version controlled evidence supporting approval and audit requirements. controlled evidence supporting approval and audit requirements. Serve as a subject matter expert on model risk & decision methodologies (as applicable), AI evaluation patterns, and modelling frameworks. Experimentation & Monitoring Oversee the development and operation of online and offline experimentation platforms, including A/B testing, shadow deployments, canary releases, and continuous monitoring. Embed evaluation and experimentation into CI/CD pipelines, enabling automated quality gates and reliable release processes for model-driven products. Implement observability practices that track model drift, degradation, safety issues, and agent behaviour over time. Leadership & Strategy Work collaboratively within a cross-functional high-performing team, fostering innovation, technical excellence, and a collaborative culture. Define and execute strategic direction for AI evaluation, model risk management, and model governance frameworks. Partner with product, engineering, research, risk, compliance, and senior leadership across the organisation to influence AI development practices and decision-making. Represent the function in internal and external audits, regulatory engagements, and cross-functional governance forums. functional governance forums. Stakeholder Engagement Act as a trusted advisor to model owners, developers, and business leaders-translating complex technical findings into actionable insights. Support change management across the organisation to drive consistency in evaluation standards, documentation quality, and responsible AI adoption. Required Experience & Qualifications Bachelor's, Master's, or PhD or equivalent experience in Computer Science, Machine Learning, Applied Mathematics, Statistics, Financial Engineering, or a related quantitative field. Significant professional experience (typically 7-12+ years) in AI/ML product development / management, model validation, quantitative research, risk modelling, or related areas. Demonstrated success working with technical teams or senior specialists in high stakes modelling environments. Deep understanding of AI/ML systems-including LLMs, agentic architectures, RAG pipelines, credit or pricing models, or risk modelling techniques. Hands-on experience developing or validating models, performing statistical testing, and analysing model assumptions, limitations, and risks. Familiarity with model evaluation tooling, experimentation frameworks, and modern ML infrastructure. Excellent communication skills, with the ability to present complex findings clearly to both technical and non technical audiences. Preferred Experience & Qualifications: Experience in B2B data or RegTech environments. Experience managing AI systems in production environments or high scale data and ML platforms. Experience in working with teams in MLOps, DevOps, or large scale compute environments (e.g., GPU clusters, cloud orchestration, Kubernetes). Experience with Generative AI evaluation, agent testing, or AI safety frameworks. Track record of partnering with regulatory bodies or leading audit readiness efforts. What Success Looks Like Models across the organisation consistently meet high standards
London Stock Exchange Group
Related Jobs
Senior Live Learning Manager
- £53,000 - £56,500 Annual
- London, United Kingdom
DevSecOps Engineer
- £40,000 - £50,000 Annual
- Hertfordshire, Stevenage, United Kingdom, SG1 1
ER Advisor 3-Month FTC
- £22 - £23 Hourly
- Hampshire, Portsmouth, United Kingdom, PO1 1
Solicitor - Commercial Property
- Dublin, Dublin, Ireland
Strategic Director of Housing , Assets and Investments ( Local Government
- £700 - £800 Daily
- London, United Kingdom