National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Databricks Engineer

London
3 weeks ago
Create job alert

Data Pipeline Development:

Design and implement end-to-end data pipelines in Azure Databricks, handling ingestion from various data sources, performing complex transformations, and publishing data to Azure Data Lake or other storage services.
Write efficient and standardized Spark SQL and PySpark code for data transformations, ensuring data integrity and accuracy across the pipeline.
Automate pipeline orchestration using Databricks Workflows or integration with external tools (e.g., Apache Airflow, Azure Data Factory).
Data Ingestion & Transformation:

Build scalable data ingestion processes to handle structured, semi-structured, and unstructured data from various sources (APIs, databases, file systems).
Implement data transformation logic using Spark, ensuring data is cleaned, transformed, and enriched according to business requirements.
Leverage Databricks features such as Delta Lake to manage and track changes to data, enabling better versioning and performance for incremental data loads.
Data Publishing & Integration:

Publish clean, transformed data to Azure Data Lake or other cloud storage solutions for consumption by analytics and reporting tools.
Define and document best practices for managing and maintaining robust, scalable data pipelines.
Data Governance & Security:

Implement and maintain data governance policies using Unity Catalog, ensuring proper organization, access control, and metadata management across data assets.
Ensure data security best practices, such as encryption at rest and in transit, and role-based access control (RBAC) within Azure Databricks and Azure services.
Performance Tuning & Optimization:

Optimize Spark jobs for performance by tuning configurations, partitioning data, and caching intermediate results to minimize processing time and resource consumption.
Continuously monitor and improve pipeline performance, addressing bottlenecks and optimizing for cost efficiency in Azure.
Automation & Monitoring:

Automate data pipeline deployment and management using tools like Terraform, ensuring consistency across environments.
Set up monitoring and alerting mechanisms for pipelines using Databricks built-in features and Azure Monitor to detect and resolve issues proactively.
Requirements

Data Pipeline Expertise: Extensive experience in designing and implementing scalable ETL/ELT data pipelines in Azure Databricks, transforming raw data into usable datasets for analysis.
Azure Databricks Proficiency: Strong knowledge of Spark (SQL, PySpark) for data transformation and processing within Databricks, along with experience building workflows and automation using Databricks Workflows.
Azure Data Services: Hands-on experience with Azure services like Azure Data Lake, Azure Blob Storage, and Azure Synapse for data storage, processing, and publication.

Data Governance & Security: Familiarity with managing data governance and security using Databricks Unity Catalog, ensuring data is appropriately organized, secured, and accessible to authorized users.
Optimization & Performance Tuning: Proven experience in optimizing data pipelines for performance, cost-efficiency, and scalability, including partitioning, caching, and tuning Spark jobs.
Cloud Architecture & Automation: Strong understanding of Azure cloud architecture, including best practices for infrastructure-as-code, automation, and monitoring in data environments

Related Jobs

View all jobs

Data Engineer (Databricks Champion)

Data Engineer (Databricks Champion)

Data Engineer (Databricks Champion)

Data Engineer (Databricks Champion)

Data Engineer (Databricks Champion)

Data Engineer (Databricks Champion)

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Get a Better Data Science Job After a Lay-Off or Redundancy

Redundancy can be tough to face, especially in a competitive field like data science. But it’s important to know: your experience, analytical thinking, and modelling skills are still in demand. Across sectors like healthcare, finance, e-commerce, government and AI startups, UK employers continue to seek data scientists who can deliver value through insight, prediction, and automation. This guide will walk you through how to bounce back from redundancy with purpose and clarity—whether you're a data analyst looking to step up, a mid-level data scientist, or a machine learning specialist seeking a better-aligned opportunity.

Data Science Jobs Salary Calculator 2025: Find Out What You Should Earn in the UK

Why last year’s pay survey is already out of date for UK data scientists “Am I being paid enough?” Every data professional eventually asks that question—often after a teammate reveals a hefty counter‑offer, a recruiter shares a six‑figure opening, or a headline trumpets the latest multimillion‑pound AI investment. Yet salary guides published even twelve months ago belong in a museum. Generative‑AI hype re‑priced Machine‑Learning Engineer roles, LLM fine‑tuning turned Prompt Engineering into a genuine career path, & fresh regulation forced companies to hire Responsible‑AI Officers on senior‑scientist money. To cut through the noise, DataScience‑Jobs.co.uk distilled a transparent, three‑factor formula. Insert your role, your region, & your seniority, and you’ll get a realistic 2025 salary benchmark—no stale averages, no vague ranges. This article walks you through the formula, examines the forces pushing data‑science pay ever higher, and offers five concrete actions to boost your market value within ninety days.

How to Present Data Science Solutions to Non-Technical Audiences: A Public Speaking Guide for Job Seekers

The ability to communicate clearly is now just as important as knowing how to build a predictive model or fine-tune a neural network. In fact, many UK data science job interviews are now designed to test your ability to explain your work to non-technical audiences—not just your technical competence. Whether you’re applying for your first data science role or moving into a lead or consultancy position, this guide will show you how to structure your presentation, simplify technical content, design effective visuals, and confidently answer stakeholder questions.