Engineer the Quantum RevolutionYour expertise can help us shape the future of quantum computing at Oxford Ionics.

View Open Roles

Lead Data Engineer

Capgemini
London
4 weeks ago
Applications closed

Related Jobs

View all jobs

Lead Data Engineer

Lead Data Engineer

Lead Data Engineer

Lead Data Engineer

Lead Data Engineer

Lead Data Engineer

Get The Future You Want!

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and build a more sustainable, more inclusive world.

Your Role:

We are seeking a highly experienced Lead Data Engineer with a strong background in working with cross-functional teams, particularly Data Science, with expertise in Azure Modern Data Platform and knowledge in GCP. The ideal candidate will take ownership of solutioning and mentoring the team, ensuring a smooth transition from Jupyter notebooks to Azure Databricks. Expertise in writing ETL frameworks with PySpark and Python coding skills for Data Migration and Transformation (On-Premises or Cloud) is essential. This role requires a deep understanding of both traditional and NoSQL databases, distributed data processing, and data transformation techniques.

  • Data Ingestion:Manage the ingestion of varied and unstructured data from REST APIs, as well as structured data from transactional databases. Experience in the design and development of Medallion architecture with Databricks and Unity Catalog governance principles is required.
  • Data Analysis and Transformation:Demonstrate excellent data analysis skills with strong SQL knowledge, along with robust reporting and data transformation capabilities, especially using PySpark throughout the development.
  • Validation and Verification:Perform thorough validation and verification using automation techniques by writing efficient unit test cases to ensure data quality.
  • Project and Stage Planning:Develop specific, targeted, and well-detailed project and stage plans.
  • Tool Proficiency:Utilize tools such as Jira, ADO, and Confluence effectively.
  • Coordination and Communication:Exhibit strong coordination and communication skills, reporting back on progress and alignment with the implementation strategy for data pipelines and production deployments.
  • Defect Management:Ensure transparency in defect management and work towards resolving them promptly, irrespective of the defect owner.
  • Mentorship:Mentor and guide the team, ensuring they can hit the ground running and effectively collaborate with the Data Science and DevOps teams.

Your Profile:

  • Proven experience with Azure Data Factory and Azure Databricks with Unity Catalog.
  • Strong proficiency in PySpark and Python for data processing.
  • Strong SQL knowledge.
  • Good understanding of GCP cloud services such as GCP BigQuery and others.
  • Good understanding of Airflow scheduling.
  • Knowledge of Terraform for infrastructure management.
  • Experience with CI/CD tools and practices.
  • Understanding of data security, access controls, and compliance.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills, with leadership qualities to mentor and lead the team in providing data solutions.
  • Ability to work independently and as part of a team.
  • Experience with other cloud platforms and data engineering tools.
  • Proven experience with data warehousing concepts and best practices.
  • Certification in relevant technologies (e.g., Azure, GCP).

About Capgemini

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world while creating tangible impact for enterprises and society. It is a responsible and diverse group of 350,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market-leading capabilities in AI, cloud, and data, combined with its deep industry expertise and partner ecosystem. The Group reported 2023 global revenues of €22.5 billion.

Get The Future You Want |www.capgemini.com
#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Pre-Employment Checks for Data Science Jobs: DBS, References & Right-to-Work and more Explained

Pre-employment screening in data science reflects the discipline's unique position at the intersection of statistical analysis, machine learning innovation, and strategic business intelligence. Data scientists often have privileged access to comprehensive datasets, proprietary algorithms, and business-critical insights that form the foundation of organisational strategy and competitive positioning. The data science industry operates within complex regulatory frameworks spanning GDPR, sector-specific data protection requirements, and emerging AI governance regulations. Data scientists must demonstrate not only technical competence in statistical modelling and machine learning but also deep understanding of research ethics, data privacy principles, and the societal implications of algorithmic decision-making. Modern data science roles frequently involve analysing personally identifiable information, financial data, healthcare records, and sensitive business intelligence across multiple jurisdictions and regulatory frameworks simultaneously. The combination of analytical privilege, predictive capabilities, and strategic influence makes thorough candidate verification essential for maintaining compliance, security, and public trust in data-driven insights and automated systems.

Why Now Is the Perfect Time to Launch Your Career in Data Science: The UK's Analytics Revolution

The United Kingdom stands at the forefront of a data science revolution that's reshaping how businesses make decisions, governments craft policies, and society tackles its greatest challenges. From the machine learning algorithms powering London's fintech innovation to the predictive models guiding Manchester's smart city initiatives, Britain's transformation into a data-driven economy has created an unprecedented demand for skilled data scientists that far outstrips the available talent. If you've been contemplating a career transition or seeking to position yourself at the cutting edge of the digital economy, data science represents one of the most intellectually stimulating, financially rewarding, and socially impactful career paths available today. The convergence of big data maturation, artificial intelligence mainstream adoption, business intelligence evolution, and cross-industry digital transformation has created the perfect conditions for data science career success.

Automate Your Data Science Jobs Search: Using ChatGPT, RSS & Alerts to Save Hours Each Week

Data science roles land daily across banks, product companies, consultancies, scaleups & the public sector—often buried in ATS portals or duplicated across boards. The fix: put discovery on rails with keyword-rich alerts, RSS feeds & a reusable ChatGPT workflow that triages listings, ranks fit, & tailors your CV in minutes. This copy-paste playbook is for www.datascience-jobs.co.uk readers. It’s UK-centric, practical, & designed to save you hours each week. What You’ll Have Working In 30 Minutes A role & keyword map spanning Core DS, Applied/Research, Product/Decision Science, NLP/CV, Causal/Experimentation, Time Series/Forecasting, MLOps-adjacent & Analytics Engineering overlaps. Shareable Boolean searches for Google & job boards that strip out noise. Always-on alerts & RSS feeds that bring fresh UK roles to you. A ChatGPT “Data Science Job Scout” prompt that deduplicates, scores match & outputs ready-to-paste actions. A simple pipeline tracker so deadlines & follow-ups never slip.