About Us
We are a business identity platform that enables the onboarding of businesses of any size easily and instantly. Our products create trust for financial service providers (our customers) to seamlessly verify, onboard, and monitor businesses (their clients).
Position Overview
We are looking for a Founding Machine Learning Engineer (Natural Language Processing) to join our team. You will be an early shaper of the design and architecture of the core AI technology powering the product, and work closely with the founding team to deliver 10x value to our clients.
As an early-stage startup, we pride ourselves on our ability to roll up our sleeves and build from scratch. We will be shipping code frequently and learning as we go while satisfying all critical security, performance, and uptime requirements for our clients.
We all love to build, scale, and grow: Our leadership team has experience building products, scaling teams, and optimizing for growth while working with ambitious engineers from tech unicorns.
If you’re passionate about building products that make a complex problem easy this job is for you. As one of the first engineering hires, you’ll have the opportunity to shape both the product and the engineering culture. You should be ready to move fast, embrace uncertainty, and wear different hats.
Responsibilities
- Research and experiment with the core algorithms and ML models that will power our platform (intelligent document processing, web data parsing and classification, search and matching)
- Move results of your research to production, owning training and deployment pipelines, as well as inference backend services
- Impact and shape the data science culture
- Continuously improve data science practices and processes
Qualifications
- Effective and self-driven builder
- Bachelor's or Master's degree in Computer Science or a related field
- 5+ years of experience in machine learning and/or data science
- 3+ years of experience in Natural Language Processing - LLMs, classical NLP methods, NER models, syntax parsers, text embeddings & approximate nearest neighbors search
- Experience with any of the following NLP tasks - named entity recognition, intelligent document processing, website parsing & classification, sentiment analysis, information retrieval, entity matching & linking, spelling correction
- Strong knowledge of Mathematical Statistics, Algorithms & Data Structures, ML Theory
- Strong knowledge of Python & SQL
- Experience training & fine-tuning Transformers or Large Language Models (Huggingface Transformers, OpenAI, Llama 2, Langchain)
- Experience serving Neural Networks in production (PyTorch, ONNX, TorchServe, Triton)
- Experience developing & maintaining production ML services
- Experience with ad-hoc analytics, data visualisation, and BI tools (Superset, Redash, Metabase)
- Experience with workflow orchestration tools (Airflow, Prefect)
- Experience writing data processing pipelines & ETL (Python, Apache Spark)
- Excellent communication skills and ability to work collaboratively in a team environment
- Experience with web scraping
Perks & Benefits
- Competitive salary package (including equity)
- Flexible working policy (hybrid)
- Standard benefits
- 25 vacation days
- Sick days and compassionate leave as needed