QA Engineer, AI

Posted 7 days 20 hours ago by TORTUS

£80,000 - £100,000 Annual
Permanent
Full Time
Other
London, United Kingdom
Job Description
Who We're Looking For: The One Thing That Matters

You are the last line of defence between our code and patient care.

TORTUS runs in hospitals, GP surgeries, ambulances, and across dozens of clinical environments - on iPads, in Epic, Cerner, Surgery Connect, EMIS, and multiple browsers. You'll own quality end-to-end: building the automated test infrastructure, running AI output evaluations, managing releases, and ensuring nothing ships that could break in any of those places. You'll be managing AI coding agents, building simulation suites for clinical AI outputs, and pioneering what quality engineering looks like when development is AI-accelerated.

We're looking for a technically strong QA engineer who is obsessed with AI tooling. You'll be building and maintaining automated test suites, but you'll also be running multiple coding agent instances, designing Playwright harnesses at scale, and creating entirely new kinds of tests - like evaluating whether our AI-generated clinical notes are correct. You'll also need to be comfortable rolling up your sleeves for hands on manual and exploratory testing - complex clinical workflows across multiple deployment surfaces demand human judgement, not just automation. If you're the kind of person who sees AI agents as force multipliers rather than novelties, we want to talk to you.

What Else Would Be Great
  • 2-5 years of experience in QA engineering, software testing, or SDET roles
  • Hands on experience with automated testing frameworks (Playwright, Jest, Vitest, or similar)
  • Genuine enthusiasm for AI assisted development - you're already using Claude Code, Cursor, Codex, or similar tools to ship faster
  • Familiarity with CI/CD pipelines and integrating quality gates into automated workflows
  • Understanding of requirements traceability and how to maintain requirement to test case mapping
  • Experience writing test plans, protocols, and verification reports
  • Comfort testing across multiple platforms and environments (web, mobile, iPad, embedded in third party systems)
  • Exposure to compliance standards (IEC 62304, ISO 13485, ISO 14971) or Quality Management Systems is a strong plus - but not required
  • Strong written communication - you'll be producing documentation that goes into our regulatory technical file
What You'll Do Automated Testing & Infrastructure
  • Own and expand the end to end test suite using Playwright, Jest, and Vitest - covering unit, integration, system, and E2E testing
  • Build automated test harnesses for all deployment surfaces: web app, iPad, Epic, Cerner, Surgery Connect, EMIS, and multiple browsers
  • Design test cases covering functional requirements, edge cases, and failure modes
  • Integrate tests into CI pipelines to gate every PR with passing tests
  • Use AI coding agents to accelerate test creation and maintenance
AI Output Evaluation & Simulation
  • Build end to end simulation suites for our clinical AI pipeline: upload known audio, verify transcripts, clinical codes, and guidelines are correct
  • Create smoke tests for AI outputs
  • Collaborate with the ML team on AI evaluations - both deterministic tests and non deterministic model output evaluations
  • Detect quality regressions when prompts, models, or pipelines change
Release Management & QA Gating
  • Own the QA process for versioned releases
  • Run regression and manual exploratory testing across all supported environments before promoting to production
  • Manage the release checklist - ensure nothing ships without verification on all target platforms
Traceability & Compliance
  • Maintain the requirements traceability matrix, linking software requirements to test cases and results
  • Ensure verification coverage with documented evidence for our regulatory technical file
  • Support ISO 13485 audits and Class IIa certification with testing evidence
  • Automate compliance workflows where possible
Defect Management
  • Log, triage, and track defects with clear severity classification
  • Work with developers to reproduce issues, verify fixes, and close the loop
  • Ensure no high severity defects ship without resolution and re testing
What Does Wild Success Look Like?

After 12-18 months, you've transformed how TORTUS ships software. The automated test suite covers all major deployment surfaces, and developers trust it to catch issues before they reach staging. You've built a simulation suite that catches AI regressions before they reach clinicians - we know within minutes if a release has broken clinical coding or note generation. The traceability matrix is immaculate, and auditors breeze through our verification evidence. Releases are versioned, documented, and predictable. You've reduced manual regression time by 70%+ through automation and AI assisted test generation. The engineering team sees you as the person who makes it safe to move fast.

Where Does This Role Take You?

This is a foundational role in a fast growing, safety critical AI company. High performers will have opportunities to:

  • Lead the QA function as the team scales, potentially growing into a QA Lead or Engineering Manager role
  • Specialise in AI evaluation and clinical safety validation - a genuinely novel discipline
  • Shape the intersection of AI accelerated development and medical device compliance
  • Build out a quality engineering team as TORTUS grows internationally
Our Stack
  • Google Cloud Platform (GCP), Google Kubernetes Engine (GKE)
  • Terraform, GitOps & ArgoCD
  • PostgreSQL + Redis
  • TypeScript across our stack (React, NestJS), Python/FastAPI for ML workloads
  • Playwright for E2E testing, Jest/Vitest for unit and integration tests
  • Self hosted ML models, MLflow for experiment tracking
What We Offer
  • Competitive salary + equity
  • 9 day fortnight (every other Friday off)
  • 25 days holiday + bank holidays
  • Mon & Fri WFH optional - office first culture
  • Latest MacBook + the right equipment to make you at your best
  • The chance to genuinely transform healthcare
About TORTUS

TORTUS was founded to address one of the most fundamental and persistent problems in healthcare: human error driven by cognitive overload and administrative burden. Modern clinicians are overwhelmed by documentation, compliance, and fragmented digital systems, leaving less time and attention for patient care.

Our mission is to eliminate avoidable human error in medicine by augmenting clinicians with real time, agentic AI. An AI co pilot for every clinician.

The core product is a real time AI system that operates inside live patient consultations. It transcribes and structures conversations, surfaces relevant clinical context and guidelines, and executes downstream actions such as documentation, prescribing workflows, and follow ups. Doctors and AI collaborate in real time, with the clinician always in control.

Traction
  • 500,000+ paid consultations processed
  • 10x year on year growth
  • 60 NHS hospitals deployed
  • 60-80% daily adoption when rolled out
  • 25% time savings per clinician
  • Regulated medical device (Class I today, progressing to Class IIa)
Location

We are an office first company based in the historic Holborn Town Hall. Where possible, we aim to be in the office at least 3 days a week (typically Tuesday, Wednesday and Thursday).