Jobs

Artificial Intelligence Engineer - Distributed Inference

Job details

Danucore
Birmingham
1 day ago

Create alert

AI Engineer - Distributed Inference Specialist

Do you want to be a spectator or a player as the world races to develop AGI?

Are you ready to be a pioneer of AI?

Why join us?

AtDanucore, we are on the hunt forBRILLIANT MINDSto join a team of visionaries and innovators dedicated to building distributedsupercomputersandAI systemswhich are:

Faster⚡️ –from building and deploying AI datacentres at speed to optimising the AI workloads that run on them we want to be the fastest

Cheaper–AI should be accessible to all. We lower the costs of AI deployment with careful hardware deployments and software systems to ensure efficient resource utilisation.

Kinder–Our systems are designed to benefit humanity. We do not allow our systems to participate in military, gambling or pornography applications

Greener–We optimise energy consumption with an integrated hardware and software solution to leverage renewable energy, optimise heat recovery - all running under energy aware orchestration systems to optimise workloads

Cleverer–We develop agentic AI systems and to make our systems intelligent and constantly improving

Help us build systems to ensure the power of frontier AI remainsaccessibleand give userssovereigntyover their AI systems

Join us in ensuring that the most transformative technology in human history remains in the hands of humanity itself. Let's make AI development transparent, accessible, and aligned with the interests of humanity, not just the profits of a few. ⚡

About the Role

This role is for those obsessed with pushing the boundaries of AI model performance.

We're looking for someone who gets excited about shaving milliseconds off inference time, every percentage point of GPU utilization gained and how many Watts were consumed to achieve it. ⚡️

You'll work directly with cutting-edge models — from LLMs to multimodal systems — and large GPU clusters, finding innovative ways to make them run faster, more efficiently, and more accessibly on diverse hardware setups. ️

What We're Looking For

In team members:

Passion for AI: A strong desire to influence the future of technology and its societal impact.
Willingness to Learn: we're looking for future experts with curious minds and a growth mindset.
Open-Mindedness: Ready to challenge the norm and think outside the box?

and for the role:

Evidence of deploying and optimising AI models in multi gpu and multi node systems ️ ️
Good working knowledge of leading AI runtimes: PyTorch, vLLM, TensorRT, ONNX Runtime, Llama.cpp ‍♂️‍➡️⏱️o
Experience with distributed inference engines: Ray Serve, Triton Inference Server, vLLM, SLURM
Knowledge of AI compilers: OpenXLA, torch.compile, OpenAI's triton, MLIR, Mojo, TVM, MLC-LLM ⚙️
Good working knowledge of inter-process communication: message queues, MPI, NCCL, gRPC
Good working knowledge of high performance networking: RDMA, RoCE, Infiniband, NVIDIA GPUDirect, NVLink, NVIDIA DOCA, MagnumIO, dpdk, spdk
Experience with model quantisation, pruning, and sparsity techniques for performance optimisation.

And bonus points if you have:

a homelab, blog, or a collection of git repos showcasing your talents and interests ‍ ‍
made contributions to open-source projects or publications in the field of AI/ML systems optimisation

Let us know which of the above you have worked with / are relevant in your cover letter! ✨

Key Responsibilities

Design and implement high-performance distributed inference systems for running large language models and multimodal AI models at scale
Optimise model serving infrastructure for maximum throughput, minimal latency, and optimal power efficiency ⚡
Develop and maintain deployment pipelines for efficient model serving, and monitoring in production
Research and implement cutting-edge techniques in model optimisation, including pruning, quantisation, and sparsity methods ‍
Design, build and configure experimental hardware setups for model serving and optimisation ️
Design and implement robust testing frameworks to ensure reliable model serving ✅
Collaborate with the team to build and improve our distributed inference platform, making it more accessible and efficient for users
Monitor, optimise and document system performance metrics, including latency, throughput, power consumption and benchmark scores

How Can We Tempt You?

Exceptional Financial Package: Enjoy a competitive compensation structure, including an enticing EMI scheme that rewards your brilliance.

Envious Compute Power: Gain access to a vast array of cutting-edge computing resources to bring your ideas to life!

Support for Your Vision: We believe that the brightest minds often have their own innovative projects. Let's collaborate! Share your ideas, and work with our team and support network to make them happen!

Make an Impact: Join a passionate team dedicated to creating positive change in the world. The future is ours to shape, and together we can ensure it's for the better.

Dynamic Start-Up Culture: Dive in from day one! Experience the thrill of a start-up environment where you can roll up your sleeves and make a real difference right away.

How to Apply

Email your cover letter and CV to with subject "AI Engineer - Distributed Inference"

In your cover letter, please include details of:

what parts or technologies mentioned in this job advert you have experience with and can add value with
links to any public work e.g. github profile, blogs or papers

Artificial Intelligence Engineer Job Alerts

Be the first to know when about new roles, get tailored job alerts directly to your inbox!

Job details

Danucore
Birmingham
1 day ago

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

Formula Recruitment Limited | Senior Artificial Intelligence Engineer

Technology -Python, TensorFlow, PyTorch, LLM's and AWS.LocationLondon (onsite twice a week)An innovative business focused on enhancing client experiences through advanced technology solutions is looking for a highly skilled Senior AI Engineer to join a new team focused on developing greenfield cutting-edge AI/ML models to streamline the customer journey.As a Senior...

Formula Recruitment Limited Newcastle upon Tyne

Quality Engineer Lead

Job DescriptionWe are in need of QA Lead.OverviewWe are seeking a highly skilled and motivated Quality Engineer with expertise in AWS, artificial intelligence (AI), resiliency, and performance testing. The ideal candidate will possess a strong background in quality assurance, a passion for cutting-edge technology, and the ability to ensure our...

Alltech Consulting Services Great Malvern

Data Scientist

At Peak21, we specialize in acquiring large direct-to-consumer (D2C) brands across the United States, Europe, and Asia, in addition to incubating our own. With a portfolio of D2C brands generating $200 million in sales, all bootstrapped and profitable, we are dedicated to fostering growth and innovation in the D2C sector....

Peak 21 London

Data Engineer (Snowflake)

A well-known financial services company are looking for a Data Engineer with Snowflake experience to join their growing Data Engineering team in London - this is a permanent role with hybrid working arrangements, split roughly 50/50 between home and the office.In this role you will be working within a new,...

City of London

Devops Engineer

Data / DevOps EngineerPermLocated in Stretford, Trafford Park and comprises of Hybrid working.Up to £65,000paYou will be working in the Data Engineering team whose main function is developing, maintaining and improving the end-to-end data pipeline. This includes real-time data processing; extract, transform, load (ETL) jobs; artificial intelligence; and data analytics...

Manchester

Data Engineer

Data Engineers (All Levels)About the OrganizationOur client is dedicated to developing and deploying advanced technology to solve some of humanity's most challenging and enduring problems. Guided by world-leading scientists and entrepreneurs, they accelerate innovation by driving scientific and technological advancements across critical areas such as health and medical science, food...

Hlx Life Sciences Oxford

Jobs

Artificial Intelligence Engineer - Distributed Inference

Artificial Intelligence Engineer Job Alerts

Sign up for our newsletter

Sign up for our newsletter

Similar Jobs

Formula Recruitment Limited | Senior Artificial Intelligence Engineer

Quality Engineer Lead

Data Scientist

Data Engineer (Snowflake)

Devops Engineer

Data Engineer

Find the perfect job? Subscribe to job alerts to stay informed about new opportunities.