Jobs

Python Developer with Pyspark


Job details
  • N Consulting Ltd
  • 1 week ago

Job Title:Python Developer with PySpark

Location:Northompton

Job Type:Contract

About the Role:
We are seeking a skilled Python Developer with expertise in PySpark to join our dynamic team. The ideal candidate will have strong experience in building and optimizing large-scale data processing pipelines and a deep understanding of distributed data systems. You will play a key role in designing and implementing data solutions that drive critical business decisions.

Key Responsibilities:

  • Develop, optimize, and maintain large-scale data pipelines using PySpark and Python.
  • Collaborate with data engineers, analysts, and stakeholders to gather requirements and implement data solutions.
  • Perform ETL (Extract, Transform, Load) processes on large datasets and ensure efficient data workflows.
  • Analyze and debug data processing issues to ensure accuracy and reliability of pipelines.
  • Work with distributed computing frameworks to handle large datasets efficiently.
  • Develop reusable components, libraries, and frameworks for data processing.
  • Optimize PySpark jobs for performance and scalability.
  • Integrate data pipelines with cloud platforms like AWS, Azure, or Google Cloud (if applicable).
  • Monitor and troubleshoot production data pipelines to minimize downtime and data issues.

Key Skills and Qualifications:

Technical Skills:

  • Strong programming skills inPythonwith hands-on experience inPySpark.
  • Experience with distributed data processing frameworks (e.g., Spark).
  • Proficiency in SQL for querying and transforming data.
  • Understanding of data partitioning, serialization formats (Parquet, ORC, Avro), and data compression techniques.
  • Familiarity with Big Data technologies such as Hadoop, Hive, and Kafka (optional but preferred).

Cloud Platforms (Preferred):

  • Hands-on experience with AWS services like S3, EMR, Glue, or Redshift.
  • Knowledge of Azure Data Lake, Databricks, or Google BigQuery is a plus.

Additional Tools and Frameworks:

  • Familiarity with CI/CD pipelines and version control tools (Git, Jenkins).
  • Experience with orchestration tools like Apache Airflow or Luigi.
  • Understanding of containerization and orchestration tools like Docker and Kubernetes (preferred).

Experience:

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
  • 5+ years of experience in Python programming.
  • 4+ years of hands-on experience with PySpark.
  • Experience with Big Data ecosystems and tools.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

Senior Data Engineer - London - AWS - £80,000 + Benefits

Data Engineer - London - Hybrid - £70,000 - £80,000 + BenefitsCompany Overview:My client is a modern data consultancy that empowers data-driven organisations to realise the full value of its partnered technologies such as Amazon Web Services (AWS), Snowflake and more! They provide consulting and managed services in data engineering,...

City of London

Senior Data Engineer - Remote - £60k

Senior Data Engineer - Remote - £60kExciting opportunity for an experienced cloud data engineer to join an expanding data team who are using data in an exciting and advanced way. They will support your learning and development from a technical and leadership perspective as you help them design, build and...

Newcastle upon Tyne

Senior Data Engineer

Senior Data Engineer - Python / Data Pipelines / Data Platform / AWS - is required by fast growing, highly successful and tech focused organisation.About the jobYou will play a crucial role in designing, building, and maintaining their data platform, with a strong emphasis on streaming data, cloud infrastructure, and...

Cramlington

Senior Data Engineer

Senior Data Engineer - Python / Data Pipelines / Data Platform / AWS - is required by fast growing, highly successful and and tech focused organisation.About the jobYou will play a crucial role in designing, building, and maintaining their data platform, with a strong emphasis on streaming data, cloud infrastructure,...

Tech4 Northumberland

AWS Data Engineer

A world market research company are looking for a passionate Data Engineer to come and join their team.You will be working in the Data Engineering team whose main function is developing maintaining and improving the end-to-end data pipeline that includes real-time data processing; extract, transform load jobs artificial intelligence and...

Trafford Park

Senior Azure Data Engineer - Remote - £60,000

Senior Azure Data Engineer - Remote - £60,000I am working with a data driven Microsoft partnered consultancy who are looking for a Senior Data Engineer to join their growing team. You will have the opportunity to work with some of the latest Microsoft technologies such as Microsoft Fabric and Databricks.You...

Tenth Revolution Group Luton