Our organisation is a regional healthcare network comprising 12 hospital sites, 3 specialist centres, and a group of 47 affiliated outpatient clinics. We employ approximately 8,400 staff and manage data across a diverse technology estate that reflects nearly three decades of system acquisitions, integrations, and migrations. Our data function is undergoing a structured modernisation programme aligned with the NHS Data Strategy. The objective of the programme is to consolidate our data infrastructure onto a unified cloud-based platform, improve data quality and lineage documentation, and enable the safe and governed use of patient data for operational analytics and clinical research. We are seeking a Senior Data Engineer to join the data platform team at a pivotal phase of this programme. The role carries significant technical responsibility and requires someone who can navigate the complexity of a large, heterogeneous data environment with patience and methodical discipline. Relevant healthcare data standards (HL7 FHIR, SNOMED CT, ICD-10) and UK information governance frameworks are central to this environment, and familiarity with them is a meaningful advantage.
Responsibilities
Design and implement data pipelines migrating from legacy on-premises systems to our Azure-based unified data platform
Build and maintain data quality monitoring and lineage documentation across all migrated datasets
Collaborate with clinical informatics, information governance, and infosec teams to ensure pipelines meet NHS DSP Toolkit requirements
Define and enforce data engineering standards: pipeline structure, testing, documentation, and deployment process
Mentor two junior data engineers through code review and structured technical development
Requirements
6+ years of data engineering with at least three years in large or complex organisations — NHS, financial services, or similarly regulated environments preferred
Strong Python and SQL — you build pipelines that are reliable, monitored, and understandable by the team that runs them after you leave
Apache Spark for large-scale data processing in a clinical or enterprise context
Azure Data Factory or equivalent cloud orchestration platform — ADF preferred given our stack
Delta Lake or Apache Iceberg for managing large tabular datasets with ACID guarantees
Familiarity with data governance tooling: data cataloguing, lineage tracking, and access control frameworks
HL7 FHIR or equivalent healthcare data interoperability standards is a significant advantage
Benefits
NHS pension scheme — one of the most valuable public sector benefits available in the UK
Hybrid working: three days remote, two days at our data centre in Bristol
£65,000 – £78,000 per annum, Agenda for Change Band 8a equivalent
Access to NHS Learning Hub and £1,800 annual CPD budget
27 days annual leave plus bank holidays, increasing with service