Job VC
Strong Data Scientist (NLP / GenAI)
Technologies
Description
🚀 Who we are
Adaptiq
is a technology hub helping fast-growing product companies build and scale high-performing R&D teams. We partner with innovative startups and established tech businesses to deliver cutting-edge solutions across industries.
🧠 About the Product
Our platform is a cloud-based AI-driven workspace that automates statistical analysis validation and generation for clinical research. It serves large pharmaceutical and biotech clients by extracting, validating, and producing complex tabular outputs and regulatory deliverables.
The system handles high volumes of hierarchical tables, figures and listings, applying both classical and generative NLP to accelerate review cycles, reduce manual double-programming and maintain a full audit trail.
🎯 Your Role
We are looking for a
Strong Data Scientist with a focus on NLP and Generative AI
to drive the development of intelligent systems that automate complex analytical workflows.
This is a
research-driven role
, where you will take ownership of problems end-to-end — from understanding the data, to experimenting with approaches and delivering solutions that can be integrated into production.
🔧 What you’ll do
Define and drive the AI research roadmap, mentoring peers on practical implementation.
Design, develop and evaluate NLP and tabular-data algorithms using GenAI, retrieval-augmented generation (RAG), deep learning, classical ML, NER and rule-based methods.
Explore large clinical datasets, perform data cleaning and feature engineering for downstream model training.
Build and maintain data pipelines for extraction, transformation and preprocessing of structured and semi-structured inputs.
Stay current on state-of-the-art techniques in NLP, generative AI and tabular-data analysis, and integrate best practices.
Collaborate with cross-functional teams, including software developers, DevOps engineers to integrate the research solutions into production.
✅ What we’re looking for
3–5 years of industry experience with
NLP as a core focus
Experience working with
structured or tabular data
combined with NLP
2+ years of hands-on experience with deep learning methods and frameworks (e.g. PyTorch, TensorFlow).
Strong hands-on experience with
Generative AI / LLM-based systems
(e.g. RAG, structured output generation, text-to-SQL)
Solid understanding of
classical NLP techniques
(NER, parsing, rule-based methods)
Proven ability to
build and experiment with different approaches
, not just implement predefined solutions
Experience delivering
AI solutions in real-world / production environments
Strong Python skills for
data analysis, experimentation, and prototyping
MSc or PhD
in CS, ML, Data Science, or related field
Strong English communication skills
⭐️ Nice to have
Familiarity with cloud-based NLP platforms and MLOps tooling.
Experience with large-scale table analytics or regulatory statistical outputs.
🎁 What we offer
20 working days of paid vacation + public holidays
Full accounting & legal support
Fully remote setup + co-working option
High-performance equipment
Competitive compensation with
regular performance reviews
💡 Why this role is interesting
Work at the intersection of
NLP, GenAI, and real-world healthcare impact
Solve
non-trivial data problems
(tables + text + regulations)
Influence a
live product used by global pharma companies
Strong focus on
research + practical implementation
Adaptiq
is a technology hub helping fast-growing product companies build and scale high-performing R&D teams. We partner with innovative startups and established tech businesses to deliver cutting-edge solutions across industries.
🧠 About the Product
Our platform is a cloud-based AI-driven workspace that automates statistical analysis validation and generation for clinical research. It serves large pharmaceutical and biotech clients by extracting, validating, and producing complex tabular outputs and regulatory deliverables.
The system handles high volumes of hierarchical tables, figures and listings, applying both classical and generative NLP to accelerate review cycles, reduce manual double-programming and maintain a full audit trail.
🎯 Your Role
We are looking for a
Strong Data Scientist with a focus on NLP and Generative AI
to drive the development of intelligent systems that automate complex analytical workflows.
This is a
research-driven role
, where you will take ownership of problems end-to-end — from understanding the data, to experimenting with approaches and delivering solutions that can be integrated into production.
🔧 What you’ll do
Define and drive the AI research roadmap, mentoring peers on practical implementation.
Design, develop and evaluate NLP and tabular-data algorithms using GenAI, retrieval-augmented generation (RAG), deep learning, classical ML, NER and rule-based methods.
Explore large clinical datasets, perform data cleaning and feature engineering for downstream model training.
Build and maintain data pipelines for extraction, transformation and preprocessing of structured and semi-structured inputs.
Stay current on state-of-the-art techniques in NLP, generative AI and tabular-data analysis, and integrate best practices.
Collaborate with cross-functional teams, including software developers, DevOps engineers to integrate the research solutions into production.
✅ What we’re looking for
3–5 years of industry experience with
NLP as a core focus
Experience working with
structured or tabular data
combined with NLP
2+ years of hands-on experience with deep learning methods and frameworks (e.g. PyTorch, TensorFlow).
Strong hands-on experience with
Generative AI / LLM-based systems
(e.g. RAG, structured output generation, text-to-SQL)
Solid understanding of
classical NLP techniques
(NER, parsing, rule-based methods)
Proven ability to
build and experiment with different approaches
, not just implement predefined solutions
Experience delivering
AI solutions in real-world / production environments
Strong Python skills for
data analysis, experimentation, and prototyping
MSc or PhD
in CS, ML, Data Science, or related field
Strong English communication skills
⭐️ Nice to have
Familiarity with cloud-based NLP platforms and MLOps tooling.
Experience with large-scale table analytics or regulatory statistical outputs.
🎁 What we offer
20 working days of paid vacation + public holidays
Full accounting & legal support
Fully remote setup + co-working option
High-performance equipment
Competitive compensation with
regular performance reviews
💡 Why this role is interesting
Work at the intersection of
NLP, GenAI, and real-world healthcare impact
Solve
non-trivial data problems
(tables + text + regulations)
Influence a
live product used by global pharma companies
Strong focus on
research + practical implementation