Principal Data Engineer (Azure, PySpark, Databricks)

PEXA
Leeds
3 weeks ago
Create job alert

Get AI-powered advice on this job and more exclusive features.

This range is provided by PEXA. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

Hi, we’re Smoove, part of the PEXA Group. Our vision is to simplify and revolutionise the home moving and ownership experience for everyone. We are on a mission to deliver products and services that remove the pain, frustration, uncertainty, friction and stress that the current process creates. We are a leading provider of tech in the property sector. Founded in 2003, our product focus has been our conveyancer two-sided marketplace, connecting consumers with a range of quality conveyancers to choose from at competitive prices via our easy-to-use tech platform. We are now building out our ecosystem so consumers can benefit from our services either via their Estate Agent or their Mortgage Broker, through smarter conveyancing platforms, making the home buying or selling process easier, quicker, safer and more transparent.

Role We are seeking an experienced Principal Data Engineer to define, lead, and scale the technical strategy of our data platform. This is a senior, hands-on leadership role at the intersection of architecture, governance, and engineering excellence, where you will shape how data is collected, processed, and delivered across the organisation. You will own the end-to-end quality, performance, and scalability of our data systems — from raw ingestion through to trusted datasets powering business-critical analytics and reporting. This includes setting standard and influencing the strategic roadmap for data infrastructure. Our stack is built on both AWS and Azure, using Databricks across data domains and you will lead the evolution of this ecosystem to meet future business needs. You’ll ensure that data is secure, compliant, discoverable, and business-ready, enabling analysts, data scientists, and stakeholders to make confident, data-driven decisions. This role is ideal for a highly technical leader who thrives at both the strategic and execution levels: someone equally comfortable defining architecture with executives, mentoring senior engineers, and optimising distributed pipelines at scale.

Role Responsibilities

  • Design and oversee scalable, performant, and secure architectures on Databricks and distributed systems.
  • Anticipate scaling challenges and ensure platforms are future-proof.
  • Lead the design and development of robust, high-performance data pipelines using PySpark and Databricks.
  • Define and ensure testing frameworks for data workflows.
  • Ensure end-to-end data quality from raw ingestion to curated, trusted datasets powering analytics.
  • Establish and enforce best practices for data governance, lineage, metadata, and security controls.
  • Ensure compliance with GDPR and other regulatory frameworks.
  • Act as a technical authority and mentor, guiding data engineers.
  • Influence cross-functional teams to align on data strategy, standards, and practices.
  • Partner with product, engineering, and business leaders to prioritise and deliver high-impact data initiatives.
  • Build a culture of data trust, ensuring downstream analytics and reporting are always accurate and consistent.
  • Evaluate and recommend emerging technologies where they add value to the ecosystem.

Skills & Experience Required

  • Broad experience as a Data Engineer including technical leadership
  • Broad cloud experience, ideally both Azure and AWS
  • Deep expertise in PySpark and distributed data processing at scale.
  • Extensive experience designing and optimising in Databricks.
  • Advanced SQL optimisation and schema design for analytical workloads.
  • Strong understanding of data security, privacy, and GDPR/PII compliance.
  • Experience implementing and leading data governance frameworks.
  • Proven experience leading the design and operation of a complex data platform.
  • Track record of mentoring engineers and raising technical standards.
  • Ability to influence senior stakeholders and align data initiatives with wider business goals.
  • Strategic mindset with a holistic view of data reliability, scalability, and business value.

Sound like you? We at Smoove are ready so if this role sounds like you, apply today.

To be conducted as part of post offer employment checks: The personal information we have collected from you will be shared with Cifas who will use it to prevent fraud, other unlawful or dishonest conduct, malpractice, and other seriously improper conduct. If any of these are detected, you could be refused certain services or employment. Your personal information will also be used to verify your identity. Further details of how your information will be used by us and Cifas, and your data protection rights, can be found at [Cifas].

GDPR Compliance: Digital Completion UK Limited (trading name “PEXA”), Optima Legal Services Limited (trading name "Optima Legal") and Smoove Limited are owned by DigCom UK Holdings Limited, a subsidiary of PEXA Group. When we process your applicant personal data for recruitment purposes, we do so as a controller. If as part of the recruitment process we share your personal data with another company within the PEXA Group, that company may process your personal data as either an independent controller or, in certain circumstances, a joint controller. By applying for this role, you consent to us processing your personal data in accordance with the UK GDPR and the Data Protection Act 2018, and further information can be found in our privacy notice.

Seniority level

Not Applicable

Employment type

Full-time

Job function

Information Technology

Industries: Information Services, Financial Services, and IT Services and IT Consulting

Referrals increase your chances of interviewing at PEXA by 2x


#J-18808-Ljbffr

Related Jobs

View all jobs

Principal Data Engineer

Principal Data Engineer

Principal Data Engineer

Principal Data Engineer

Principal Data Engineer

Principal Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Data Science Jobs for Career Switchers in Their 30s, 40s & 50s (UK Reality Check)

Thinking about switching into data science in your 30s, 40s or 50s? You’re far from alone. Across the UK, businesses are investing in data science talent to turn data into insight, support better decisions and unlock competitive advantage. But with all the hype about machine learning, Python, AI and data unicorns, it can be hard to separate real opportunities from noise. This article gives you a practical, UK-focused reality check on data science careers for mid-life career switchers — what roles really exist, what skills employers really hire for, how long retraining typically takes, what UK recruiters actually look for and how to craft a compelling career pivot story. Whether you come from finance, marketing, operations, research, project management or another field entirely, there are meaningful pathways into data science — and age itself is not the barrier many people fear.

How to Write a Data Science Job Ad That Attracts the Right People

Data science plays a critical role in how organisations across the UK make decisions, build products and gain competitive advantage. From forecasting and personalisation to risk modelling and experimentation, data scientists help translate data into insight and action. Yet many employers struggle to attract the right data science candidates. Job adverts often generate high volumes of applications, but few applicants have the mix of analytical skill, business understanding and communication ability the role actually requires. At the same time, experienced data scientists skip over adverts that feel vague, inflated or misaligned with real data science work. In most cases, the issue is not a lack of talent — it is the quality and clarity of the job advert. Data scientists are analytical, sceptical of hype and highly selective. A poorly written job ad signals unclear expectations and immature data practices. A well-written one signals credibility, focus and serious intent. This guide explains how to write a data science job ad that attracts the right people, improves applicant quality and positions your organisation as a strong data employer.

Maths for Data Science Jobs: The Only Topics You Actually Need (& How to Learn Them)

If you are applying for data science jobs in the UK, the maths can feel like a moving target. Job descriptions say “strong statistical knowledge” or “solid ML fundamentals” but they rarely tell you which topics you will actually use day to day. Here’s the truth: most UK data science roles do not require advanced pure maths. What they do require is confidence with a tight set of practical topics that come up repeatedly in modelling, experimentation, forecasting, evaluation, stakeholder comms & decision-making. This guide focuses on the only maths most data scientists keep using: Statistics for decision making (confidence intervals, hypothesis tests, power, uncertainty) Probability for real-world data (base rates, noise, sampling, Bayesian intuition) Linear algebra essentials (vectors, matrices, projections, PCA intuition) Calculus & gradients (enough to understand optimisation & backprop) Optimisation & model evaluation (loss functions, cross-validation, metrics, thresholds) You’ll also get a 6-week plan, portfolio projects & a resources section you can follow without getting pulled into unnecessary theory.