Python Developer with Pyspark

N Consulting Limited
Northampton
2 months ago
Applications closed

Related Jobs

View all jobs

Full Stack Software Developer

Principal Architect_Data Engineer_4

Senior Data Engineer

Senior Azure Data Engineer (SC Cleared) - Permanent - London, UK (Basé à London)

Data Science Consultant - Customer Data Modelling

Principal Data Engineer

Job Title:Python Developer with PySpark

Location:Northompton

Job Type:Contract

About the Role:
We are seeking a skilled Python Developer with expertise in PySpark to join our dynamic team. The ideal candidate will have strong experience in building and optimizing large-scale data processing pipelines and a deep understanding of distributed data systems. You will play a key role in designing and implementing data solutions that drive critical business decisions.

Key Responsibilities:

  • Develop, optimize, and maintain large-scale data pipelines using PySpark and Python.
  • Collaborate with data engineers, analysts, and stakeholders to gather requirements and implement data solutions.
  • Perform ETL (Extract, Transform, Load) processes on large datasets and ensure efficient data workflows.
  • Analyze and debug data processing issues to ensure accuracy and reliability of pipelines.
  • Work with distributed computing frameworks to handle large datasets efficiently.
  • Develop reusable components, libraries, and frameworks for data processing.
  • Optimize PySpark jobs for performance and scalability.
  • Integrate data pipelines with cloud platforms like AWS, Azure, or Google Cloud (if applicable).
  • Monitor and troubleshoot production data pipelines to minimize downtime and data issues.

Key Skills and Qualifications:

Technical Skills:

  • Strong programming skills inPythonwith hands-on experience inPySpark.
  • Experience with distributed data processing frameworks (e.g., Spark).
  • Proficiency in SQL for querying and transforming data.
  • Understanding of data partitioning, serialization formats (Parquet, ORC, Avro), and data compression techniques.
  • Familiarity with Big Data technologies such as Hadoop, Hive, and Kafka (optional but preferred).

Cloud Platforms (Preferred):

  • Hands-on experience with AWS services like S3, EMR, Glue, or Redshift.
  • Knowledge of Azure Data Lake, Databricks, or Google BigQuery is a plus.

Additional Tools and Frameworks:

  • Familiarity with CI/CD pipelines and version control tools (Git, Jenkins).
  • Experience with orchestration tools like Apache Airflow or Luigi.
  • Understanding of containerization and orchestration tools like Docker and Kubernetes (preferred).

Experience:

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
  • 5+ years of experience in Python programming.
  • 4+ years of hands-on experience with PySpark.
  • Experience with Big Data ecosystems and tools.

#J-18808-Ljbffr

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Portfolio Projects That Get You Hired for Data Science Jobs (With Real GitHub Examples)

Data science is at the forefront of innovation, enabling organisations to turn vast amounts of data into actionable insights. Whether it’s building predictive models, performing exploratory analyses, or designing end-to-end machine learning solutions, data scientists are in high demand across every sector. But how can you stand out in a crowded job market? Alongside a solid CV, a well-curated data science portfolio often makes the difference between getting an interview and getting overlooked. In this comprehensive guide, we’ll explore: Why a data science portfolio is essential for job seekers. Selecting projects that align with your target data science roles. Real GitHub examples showcasing best practices. Actionable project ideas you can build right now. Best ways to present your projects and ensure recruiters can find them easily. By the end, you’ll be equipped to craft a compelling portfolio that proves your skills in a tangible way. And when you’re ready for your next career move, remember to upload your CV on DataScience-Jobs.co.uk so that your newly showcased work can be discovered by employers looking for exactly what you have to offer.

Data Science Job Interview Warm‑Up: 30 Real Coding & System‑Design Questions

Data science has become one of the most sought‑after fields in technology, leveraging mathematics, statistics, machine learning, and programming to derive valuable insights from data. Organisations across every sector—finance, healthcare, retail, government—rely on data scientists to build predictive models, understand patterns, and shape strategy with data‑driven decisions. If you’re gearing up for a data science interview, expect a well‑rounded evaluation. Beyond statistics and algorithms, many roles also require data wrangling, visualisation, software engineering, and communication skills. Interviewers want to see if you can slice and dice messy datasets, design experiments, and scale ML models to production. In this guide, we’ll explore 30 real coding & system‑design questions commonly posed in data science interviews. You’ll find challenges ranging from algorithmic coding and statistical puzzle‑solving to the architectural side of building data science platforms in real‑world settings. By practising with these questions, you’ll gain the confidence and clarity needed to stand out among competitive candidates. And if you’re actively seeking data science opportunities in the UK, be sure to visit www.datascience-jobs.co.uk. It’s a comprehensive hub featuring junior, mid‑level, and senior data science vacancies—spanning start‑ups to FTSE 100 companies. Let’s dive into what you need to know.

Negotiating Your Data Science Job Offer: Equity, Bonuses & Perks Explained

Data science has rapidly evolved from a niche specialty to a cornerstone of strategic decision-making in virtually every industry—from finance and healthcare to retail, entertainment, and AI research. As a mid‑senior data scientist, you’re not just running predictive models or generating dashboards; you’re shaping business strategy, product innovation, and customer experiences. This level of influence is why employers are increasingly offering compensation packages that go beyond a baseline salary. Yet, many professionals still tend to focus almost exclusively on base pay when negotiating a new role. This can be a costly oversight. Companies vying for data science talent—especially in the UK, where demand often outstrips supply—routinely offer equity, bonuses, flexible work options, and professional development funds in addition to salary. Recognising these opportunities and effectively negotiating them can have a substantial impact on your total earnings and long-term career satisfaction. This guide explores every facet of negotiating a data science job offer—from understanding equity structures and bonus schemes to weighing crucial perks like remote work and ongoing skill development. By the end, you’ll be well-equipped to secure a holistic package aligned with your market value, your life goals, and the tremendous impact you bring to any organisation.