Antoine Lucas

Antoine Lucas

Research Engineer · ML Systems & Scientific AI

Paris


I trained as an agronomy engineer, specialized in applied statistics, and somehow ended up building machine learning systems for Chanel’s fragrance R&D — and fine-tuning LLMs for scientific and regulated contexts. The path made sense at each step, even if it doesn’t look obvious on paper.

For the past 5+ years I’ve been embedded in R&D and clinical teams at L’Oréal, Sanofi, Abolis (microbiome biotech), and now Chanel Parfums Beauté — building ML systems at the intersection of data and experimental science.

Currently Data Scientist & ML Engineer at Chanel Parfums Beauté R&D, on mission via Astek / IT&M Stats. Building ML and computer vision tools for fragrance and cosmetics research.

What I do

Machine learning & deep learning

Computer vision models, NLP pipelines, and LLM fine-tuning for scientific applications. From texture classification and olfactory descriptor mining to training LoRA adapters on domain-specific datasets. Production-grade tools that go into daily research workflows.

LLM pipelines & agentic systems

End-to-end pipelines combining ASR, structured extraction (Instructor + Pydantic), and domain validation — turning unstructured scientific inputs into validated, auditable records. Local-first, compliance-aware architecture.

Statistical modeling & analysis

From design of experiments and mixed models to biostatistics in Good Practices (GxP) environments. I cover the full analytical chain — from raw assay data to interpretation for regulatory submissions.

Reproducible data science & MLOps

Everything in Git, containerized, runnable in 6 months with uv sync or renv::restore(). Drift monitoring, CI/CD pipelines, and model versioning in GxP-adjacent environments. I’m opinionated about this because I’ve seen what happens when you’re not.

Skills

Languages

Python R SQL Bash

Deep learning & LLM

PyTorch Hugging Face PEFT / LoRA Instructor faster-whisper sentence-transformers

ML & Python ecosystem

scikit-learn OpenCV Pandas FastAPI uv

Statistics & methods

DoE Mixed Models Bayesian Optimisation PLS Survival Analysis SPC

R ecosystem

Shiny Tidyverse ggplot2 renv Quarto

MLOps & infrastructure

Docker GitHub Actions Azure ML Databricks Posit Connect GxP / Regulated

Education

  • Diplôme d’Ingénieur — Statistiques appliquées aux Sciences de la VieInstitut Agro - Agrocampus Ouest, Rennes · 2021
  • Master — Mathématiques Appliquées · StatistiquesINSA Rennes · Agrocampus Ouest · Université Rennes 2 · ENSAI · ENSAE (joint programme) · 2021

Back to top