Responsibilities
:
Build and Scale Data Infrastructure: Develop and maintain scalable data pipelines to support AI, analytics, and trading systems-particularly for time-series, unstructured, and text-heavy datasets.Collaborate with AI and Research Teams: Partner with data scientists and researchers to support model development, fine-tune data retrieval processes, and operationalise RAG systems.Support Experimentation and Prototyping: Contribute to flexible data systems that enable rapid experimentation and smooth transitions from prototype to production.Automate and Optimise Workflows: Streamline ETL processes, reduce manual overhead, and improve the performance and reliability of data operations.Ensure Data Quality and Monitoring: Implement validation frameworks, monitoring tools, and alerting systems to maintain high data integrity and availability.Contribute to Best Practices: Help shape documentation standards, coding practices, and dataernance processes.
Requirements:
5+ years of experience in data engineering, with a strong focus on building scalable data platforms. Proficiency in Python and modern data libraries ( Pandas, PySpark, Dask). Strong SQL skills and experience with cloud-native data tools (AWS, GCP, or Azure). Hands-on experience with tools like Airflow, Spark, Kafka, or Snowflake. Experience working with unstructured data, NLP pipelines, and time-series databases. Familiarity with deploying AI/ML models and supporting MLOps workflows. Interest in or experience with Retrieval-Augmented Generation (RAG) systems. Strongmunication skills and a collaborative, proactive mindset.
Nice to Have:
Experience with LLM pipelines and vector databases ( Pinecone, FAISS). Familiarity with data versioning and experiment tracking tools ( DVC, MLflow). Background in supporting AI/ML research teams or trading environments.
On Offer:
A role contributing to the development of next-generation AI and trading systems. Exposure to a high-calibre team of engineers, researchers, and data scientists.
If you're a data engineer looking to work on impactful systems in a collaborative, forward-thinking environment, we'd love to hear from you.
Job ID PR/550866