Senior Data Scientist - AI/ML (CADD)

Chemify Ltd
Glasgow
2 months ago
Applications closed

Related Jobs

View all jobs

Senior Data Scientist

Senior Data Scientist

Senior Data Scientist (GenAI)

Senior Data Scientist

Senior Data Scientist

Senior Data Scientist

About Chemify

Chemify is revolutionising chemistry. We are creating a future where the synthesis of previously unimaginable molecules, drugs, and materials is instantly accessible. By combining AI, robotics, and the world's largest continually expanding database of chemical programs, we are accelerating chemical discovery to improve quality of life and extend the reach of humanity.


Job Description

We seek a talented and motivated Senior AI/ML Data Scientist to pioneer the development and application of cutting‑edge machine learning models for computer‑aided drug design (CADD) and small molecule discovery.


You will be joining a dynamic, cross‑disciplinary team of computational scientists, medicinal chemists, and engineers. Your primary focus will be on architecting, training, and deploying sophisticated models to predict molecular properties, generate novel compounds, and ultimately accelerate our drug discovery pipelines.


To be successful in this role, you’ll need deep expertise in modern machine learning, particularly generative AI (Transformers, Diffusion Models), Graph Neural Networks, and predictive modeling. You will leverage your skills to tackle complex scientific challenges, working with vast and diverse chemical and biological datasets.


If you are passionate about applying state‑of‑the‑art AI to solve fundamental challenges in chemistry and are driven to see your work make a real‑world impact on discovering new medicines, we’d love to have you join our team.


Key Responsibilities

  • Design, develop, and optimize state‑of‑the‑art generative models (e.g., Transformers, GNNs, Diffusion Models) for robotic‑assisted synthetic route design.
  • Architect and implement scalable MLOps pipelines for preprocessing large‑scale chemical and biological datasets, model training, and rigorous evaluation.
  • Translate cutting‑edge research in AI/ML into practical solutions that address critical challenges such as property prediction (ADMET/QSAR), reaction prediction, and binding affinity prediction.
  • Collaborate closely with computational chemists, medicinal chemists, and software engineers to define project goals, interpret model outputs, and integrate AI‑driven insights into our discovery platform.
  • Design and execute robust experiments to evaluate model performance, focusing on chemical validity, novelty, synthesizability, and predictive accuracy against experimental data.
  • Clearly communicate complex technical concepts, model results, and strategic recommendations to both technical and non‑technical stakeholders.
  • Stay at the forefront of AI for drug discovery, foundation models for science, and multimodal learning, continuously identifying and championing opportunities to enhance our capabilities.

What You’ll Bring

  • MSc or PhD with 5+ years of industry or academic experience in Computer Science, Machine Learning, Computational Chemistry/Biology, or a closely related field.
  • Demonstrated proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow.
  • Deep theoretical and practical knowledge of modern machine learning architectures, including Transformers, Graph Neural Networks (GNNs), and generative models (VAEs, GANs, Diffusion Models) as applied to scientific problems.
  • Proven ability to lead complex AI/ML projects from concept to deployment in a scientific or drug‑discovery context.
  • Extensive experience working with large‑scale molecular datasets (e.g., SMILES, 3D conformations), biological data (e.g., protein sequences, assay data), and other scientific data formats.
  • Experience with efficient model training and fine‑tuning techniques, such as LoRA, quantization, distillation, and model pruning.
  • Strong background or hands‑on experience applying ML to problems involving protein structures, small‑molecule interactions, or related biological data.
  • Familiarity with scalable computing environments, GPU acceleration, and distributed training.
  • Excellent communication and interpersonal skills for effective collaboration in a multidisciplinary team.
  • Collaborative mindset, strong communication skills, and ability to work effectively within a cross‑disciplinary team.
  • Excellent problem‑solving skills and a proactive, can‑do attitude.
  • Eagerness to learn new scientific concepts, computational methods, and software engineering practices from experienced mentors.
  • Good understanding of version control with Git.

Beneficial Skills

  • Hands‑on experience with cheminformatics toolkits such as RDKit.
  • Experience with Retrieval‑Augmented Generation (RAG) systems, including vector databases (e.g., Redis, FAISS, Milvus, Pinecone) for querying large chemical or biological databases.
  • Experience with Protein/DNA language models (e.g., ProtBERT, ESM, Evo) or protein structure prediction models (e.g., AlphaFold‑like approaches).
  • Experience with evaluation frameworks for reaction and synthetic route design, including human‑in‑the‑loop assessment and metrics for novelty, diversity, and feasibility of synthetic pathways.
  • Strong experience with relational and non‑relational databases (SQL/NoSQL), including data modeling and efficient querying for large‑scale AI workflows.
  • A portfolio of projects or open‑source contributions (e.g., a GitHub profile) that demonstrates your skills and passion for AI/ML development.


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Data Science Tools Do You Need to Know to Get a Data Science Job?

If you’re trying to break into data science — or progress your career — it can feel like you are drowning in names: Python, R, TensorFlow, PyTorch, SQL, Spark, AWS, Scikit-learn, Jupyter, Tableau, Power BI…the list just keeps going. With every job advert listing a different combination of tools, many applicants fall into a trap: they try to learn everything. The result? Long tool lists that sound impressive — but little depth to back them up. Here’s the straight-talk version most hiring managers won’t explicitly tell you: 👉 You don’t need to know every data science tool to get hired. 👉 You need to know the right ones — deeply — and know how to use them to solve real problems. Tools matter, but only in service of outcomes. So how many data science tools do you actually need to know to get a job? For most job seekers, the answer is not “27” — it’s more like 8–12, thoughtfully chosen and well understood. This guide explains what employers really value, which tools are core, which are role-specific, and how to focus your toolbox so your CV and interviews shine.

What Hiring Managers Look for First in Data Science Job Applications (UK Guide)

If you’re applying for data science roles in the UK, it’s crucial to understand what hiring managers focus on before they dive into your full CV. In competitive markets, recruiters and hiring managers often make their first decisions in the first 10–20 seconds of scanning an application — and in data science, there are specific signals they look for first. Data science isn’t just about coding or statistics — it’s about producing insights, shipping models, collaborating with teams, and solving real business problems. This guide helps you understand exactly what hiring managers look for first in data science applications — and how to structure your CV, portfolio and cover letter so you leap to the top of the shortlist.

The Skills Gap in Data Science Jobs: What Universities Aren’t Teaching

Data science has become one of the most visible and sought-after careers in the UK technology market. From financial services and retail to healthcare, media, government and sport, organisations increasingly rely on data scientists to extract insight, guide decisions and build predictive models. Universities have responded quickly. Degrees in data science, analytics and artificial intelligence have expanded rapidly, and many computer science courses now include data-focused pathways. And yet, despite the volume of graduates entering the market, employers across the UK consistently report the same problem: Many data science candidates are not job-ready. Vacancies remain open. Hiring processes drag on. Candidates with impressive academic backgrounds fail interviews or struggle once hired. The issue is not intelligence or effort. It is a persistent skills gap between university education and real-world data science roles. This article explores that gap in depth: what universities teach well, what they often miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data science.