Engineer the Quantum RevolutionYour expertise can help us shape the future of quantum computing at Oxford Ionics.

View Open Roles

Senior Data Engineer (Data Governance, Databricks, PySpark)

PEXA Group Limited
Leeds
1 month ago
Create job alert

Hi, we’re Smoove, part of the PEXA Group.

Our vision is to simplify and revolutionise the home moving and ownership experience for everyone. We are on a mission to deliver products and services that remove the pain, frustration, uncertainty, frictionand stress that the current process creates.

We are a leading provider of tech in the property sector - founded in 2003, our product focus has been our conveyancer two-sided marketplace, connecting consumers with a range of quality conveyancers to choose from at competitive prices via our easy-to-use tech platform. We are now building out our ecosystem so consumers can benefit from our services either via their Estate Agent or their Mortgage Broker,throughsmarter conveyancing platforms, making the home buying or selling process easier, quicker, safer and more transparent

Why join Smoove?

Great question! We pride ourselves on attracting, developing and retaining a diverse range of people in an equally diverse range of roles and specialisms – who together achieve outstanding results. Our transparent approach and open-door policy make Smoove a great place to work and as our business expands, we are looking for ambitious, talented people to join us.

We are looking for a technically proficient Senior Data Engineer to join our growing Data team.

Your primary focus will be on ensuring data quality, stability, and reliability — from the moment data arrives in its rawest form to when it is used in decision-making dashboards and customer-facing reports.

You will optimise the transformation pipeline from start to finish, guaranteeing that datasets are robust, tested, secure, and business-ready.

Our data platform is built using Databricks, with data pipelines written in PySpark and orchestrated using Airflow. You will be expected to challenge and improve current transformations, ensuring they meet our performance, scalability, and data governance needs.

This includes work with complex, nested data structures, ensuring they are reliably parsed and transformed. Experience in managing sensitive data (PII) and implementing GDPR policies is required.

You’ll work closely with analysts, engineers, and business stakeholders to ensure that datasets are not only accurate but also trusted.

You will collaborate with product and engineering teams to incorporate data from new products into our core business datasets, ensuring that these new sources meet our data standards and are quickly usable for business intelligence.

You’ll help put controls in place — such as access policies, metadata layers, and automated data quality checks — to ensure long-term stability. Experience with a data governance platform like Alation is desirable.

While predominantly remote / home based the team meet up to 20-25 days per year for meaningful collaboration in either Leeds or Thame.

Key Responsibilities

  • Ensure end-to-end data quality, from raw ingested data to business-ready datasets
  • Optimise PySpark-based data transformation logic for performance and reliability
  • Build scalable and maintainable pipelines in Databricks and Airflow
  • Implement and uphold GDPR-compliant processes around PII data
  • Collaborate with stakeholders to define what "business-ready" means, and confidently sign off datasets as fit for consumption
  • Put testing strategies in place to detect data issues early and often
  • Contribute to access management, metadata management, and wider data governance practices
  • Help shape our approach to reliable data delivery for internal and external customers

Skills & Experience Required

  • Extensive hands-on experience with PySpark, including performance optimisation
  • Deep working knowledge of Databricks (development, architecture, and operations)
  • Proven experience working with Airflow for orchestration
  • Proven track record in managing and securing PII data, with GDPR compliance in mind
  • Experience in data governance processes; Alation experience preferred, but similar toolswelcome
  • Strong SQL skills and experience optimising complex queries
  • Strong experience in handling and transforming semi-structured data
  • High competency in programming, with a focus on clean, efficient, and production-quality code
  • Demonstrated ability to work with stakeholders to understand data needs and guide the validation and delivery process
  • Experience implementing and maintaining data quality tests and monitoring solutions
  • Strong verbal and written communication skills
  • Ability to think holistically about data reliability and how it serves business decisions
£65,000 - £75,000 a year

Sound like you?

We at Smoove are ready so if this role sounds like you, apply today.

GDPR Compliance

Digital Completion UK Limited (trading name “PEXA”), Optima Legal Services Limited (trading name "Optima Legal") and Smoove Limited(a holding company which comprises of the following wholly owned trading Subsidiary companies: United Legal Services Limited, United Home Services Limited, Legal-Eye Limited, and Amity Law Limited) are all owned directly by DigCom UK Holdings Limited, which is a wholly owned Subsidiary of PEXA Group Limited in Australia (ACN 140 677 792; ASX: PXA) (referred tocollectively as“PEXA Group”).

When we processyour applicant personal data for recruitment purposes, we do so as a controller. If as part of the recruitment process, we share your personal data with anothercompany within thePEXA Group, that company may process your personal data as either an independent controller or, in certain circumstances, a joint controller. By applying for this role, you consent to us processing your personal data in accordance with the UK General Data Protection Regulation ("UK GDPR") and the Data Protection Act 2018, and further information can be found in our privacy noticehttps://pexa.co.uk/applicant-policy/.


#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Pre-Employment Checks for Data Science Jobs: DBS, References & Right-to-Work and more Explained

Pre-employment screening in data science reflects the discipline's unique position at the intersection of statistical analysis, machine learning innovation, and strategic business intelligence. Data scientists often have privileged access to comprehensive datasets, proprietary algorithms, and business-critical insights that form the foundation of organisational strategy and competitive positioning. The data science industry operates within complex regulatory frameworks spanning GDPR, sector-specific data protection requirements, and emerging AI governance regulations. Data scientists must demonstrate not only technical competence in statistical modelling and machine learning but also deep understanding of research ethics, data privacy principles, and the societal implications of algorithmic decision-making. Modern data science roles frequently involve analysing personally identifiable information, financial data, healthcare records, and sensitive business intelligence across multiple jurisdictions and regulatory frameworks simultaneously. The combination of analytical privilege, predictive capabilities, and strategic influence makes thorough candidate verification essential for maintaining compliance, security, and public trust in data-driven insights and automated systems.

Why Now Is the Perfect Time to Launch Your Career in Data Science: The UK's Analytics Revolution

The United Kingdom stands at the forefront of a data science revolution that's reshaping how businesses make decisions, governments craft policies, and society tackles its greatest challenges. From the machine learning algorithms powering London's fintech innovation to the predictive models guiding Manchester's smart city initiatives, Britain's transformation into a data-driven economy has created an unprecedented demand for skilled data scientists that far outstrips the available talent. If you've been contemplating a career transition or seeking to position yourself at the cutting edge of the digital economy, data science represents one of the most intellectually stimulating, financially rewarding, and socially impactful career paths available today. The convergence of big data maturation, artificial intelligence mainstream adoption, business intelligence evolution, and cross-industry digital transformation has created the perfect conditions for data science career success.

Automate Your Data Science Jobs Search: Using ChatGPT, RSS & Alerts to Save Hours Each Week

Data science roles land daily across banks, product companies, consultancies, scaleups & the public sector—often buried in ATS portals or duplicated across boards. The fix: put discovery on rails with keyword-rich alerts, RSS feeds & a reusable ChatGPT workflow that triages listings, ranks fit, & tailors your CV in minutes. This copy-paste playbook is for www.datascience-jobs.co.uk readers. It’s UK-centric, practical, & designed to save you hours each week. What You’ll Have Working In 30 Minutes A role & keyword map spanning Core DS, Applied/Research, Product/Decision Science, NLP/CV, Causal/Experimentation, Time Series/Forecasting, MLOps-adjacent & Analytics Engineering overlaps. Shareable Boolean searches for Google & job boards that strip out noise. Always-on alerts & RSS feeds that bring fresh UK roles to you. A ChatGPT “Data Science Job Scout” prompt that deduplicates, scores match & outputs ready-to-paste actions. A simple pipeline tracker so deadlines & follow-ups never slip.