Antoine Lucas
  • Blog
  • Projects
  • CV
  • About

Curriculum Vitae

Antoine Lucas

R / Shiny Engineer · Scientific Software & Data Products

Pharma · Cosmetics · Biotech · Paris, France

 antoine.lucas.fra@gmail.com  linkedin.com/in/antoinelucasdata  github.com/antoinelucasfra  Download PDF
Antoine Lucas

Research Engineer & Data Scientist with 5+ years building production-grade software and data products in pharma, cosmetics, and biotech. R / Shiny delivery, reproducible analysis, stakeholder-facing architecture, and technical ownership from discovery through deployment. Background spans statistical modelling, scientific computing, internal platform tooling, and ML systems, with a strong interest toward maintainable delivery: documentation, validation, CI/CD, and handover for teams that need to keep using the product long after the first release.

Experience

Data Scientist & ML Engineer Chanel Parfums Beauté R&D · via Astek / IT&M Stats

Sep 2025 – present

  • Managed data science platform tooling across GitForge, Azure ML, Posit Connect, and Databricks, including deployment, access governance, and lifecycle management
  • Responsible for a catalog of 20+ R Shiny applications covering clinical data visualisation, formulation process automation, and enhanced data lifecycle management for researchers
  • Engineered reproducible MLOps and delivery infrastructure using Docker and Databricks
  • Designed and deployed computer vision models (object segmentation) for cosmetics formulation and regulatory R&D applications

Data Scientist — Internal & Consulting Astek / IT&M Stats

May – Sep 2025

  • Led internal upskilling training on bayesian statistics with R for internal consultants
  • Developed reusable internal R packages and Quarto automated reporting templates
  • Contributed to internal data science projects by quickly adapting to the current team processes and tools (Shiny app Development, python modules tooling, data analysis)
  • Fine-tuned Microsoft Phi-3-mini-4k-instruct (3.8B) using LoRA (PEFT) on the climatecheck for claim verification (full pipeline from data preparation to evaluation)

Data Scientist — Multi-omics Abolis · Microbiome Studio · via Astek / IT&M Stats

Dec 2024 – May 2025

  • Delivered end-to-end pipelines in R and Python for tabular and large-scale omics data (genomics, transcriptomics, metabolomics)
  • Embedded as the sole data scientist in a fast-paced 8-person startup team building a SaaS product (FastAPI Python backend, Vue.js front-end)
  • Participate to scientifical discussions with various agri-food partners, including experimental design, data analysis, and ML approaches for multi-omics integration

Data Scientist Sanofi R&D · Manufacturing Chain · via Astek / IT&M Stats

Jul 2023 – Nov 2024

  • Owned end-to-end delivery as sole R Shiny developer and Scrum Master, managing backlog, sprint ceremonies, and stakeholder alignment across researchers, managers, and project leads
  • Built R and Python applications to predict and simulate manufacturing plant resource capacity across global sites, driving drug production planning and decision-making from requirements through deployment

Biostatistician L’Oréal R&I · Scientific Computing · via Astek / IT&M Stats

Nov 2021 – Jul 2023

  • Supported a senior statistician on exposome and clinical datasets, including biophysical, omics, and microbiome data
  • Characterised clinical phenotypes with multi-block, integrative, multivariate, and variable-selection approaches
  • Contributed to data-visualisation work, including Bayesian-network and other graphical approaches, while proposing suitable statistical methods
  • Defined analysis methodologies for proof-of-concept clinical studies; wrote statistical analysis plans, supported blind-review meetings, programmed analyses in R, and delivered reports (100+ clinical studies analysed)

Data Scientist Intern L’Oréal R&I · Augmented Beauty

Feb 2021 – Aug 2021

Modelled hair customer data for personalised recommendations, combining literature review, data management, statistical analyses, machine learning, and communication to Research&Innovation teams.

Biostatistics Project Officer Intern Da Volterra · Metagenomics Team

Feb 2020 – Aug 2020

Performed data management and non-parametric statistical analyses for gut-microbiome clinical studies, then prepared and presented results to biotechnology research teams.

Data Scientist Intern Flinders University · Digital Health Research Center

Sep 2019 – Jan 2020

Supported digital-health and wearables studies in R through questionnaire design, workshop preparation, statistical evaluation, and presentation of results to researchers.

Education

Diplôme d’Ingénieur — Statistiques appliquées aux Sciences de la Vie Institut Agro - Agrocampus Ouest, Rennes

2021

Master — Mathématiques Appliquées · Statistiques INSA Rennes · Agrocampus Ouest · Université Rennes 2 · ENSAI · ENSAE (joint programme)

2021

Technical Skills

Languages Python · R · SQL · Bash
Deep learning & LLM PyTorch · Hugging Face (transformers, PEFT, datasets) · LoRA / QLoRA · Instructor · faster-whisper · sentence-transformers · Ollama
ML & Stats scikit-learn · tidymodels · XGBoost · Bayesian optimisation · DoE · PLS · PERMANOVA · survival analysis
Apps & Viz R Shiny (golem) · FastAPI · ggplot2 · Plotly · Quarto · Streamlit
MLOps Docker · GitHub Actions · Azure ML · Databricks · Posit Connect · renv · uv
Engineering Git · CI/CD · automated testing · documentation · code review · architecture design
Domains GxP · IQ/OQ/PQ · Clinical biostatistics · Multi-omics · Cosmetic R&D

Certifications

GitHub Foundations — GitHub (2024)

Scrum Master — Agilbee (2023)

Languages

French (native) · English (fluent, professional)

Back to top

Powered by Quarto.

© 2026 Antoine Lucas.

  • Edit this page
  • Report an issue

License: CC BY NC SA 4.0.