Mots-Clés
Ontologies
Programming
Databases
Knowledge graph
Biology
Description
As a global key player in yeasts and fermentation, Lesaffre designs, manufactures and markets innovative solutions for Baking, Food taste & pleasure, Health care and Biotechnology. Born in northern France in 1853, Lesaffre is now a multi-national and multicultural company committed to working with confidence towards better nourishing and protecting the planet. Lesaffre is well-positioned to nourish 9B people by 2050 by making the most of planet resources. Its capabilities to explore and reveal the infinite potential of microorganisms, as well as its leading fermentation knowledge bring Lesaffre as one of the most promising answers to this challenge. At the heart of this dynamic, the BioData team plays a strategic role by supporting the digital transformation of research activities. As an innovative and multidisciplinary team, BioData develops solutions to enhance scientific data, strengthen its quality, ensure compliance, and establish responsible and collaborative data governance.
Internship description:
The project is part of a broader initiative of data management (set of rules and processes to manage data) and knowledge management (structuring and enhancement of knowledge). Under the supervision of a data engineer and a data steward, you will work in collaboration with the biological research teams and the data teams to strengthen the quality control of biological data related to strains and explore new approaches in knowledge management.
The objectives:
- Define and implement rules for validating biological data and make quality dashboards available to business teams.
- Identify relevant ontologies, establish a mapping between internal attributes and the concepts of these ontologies.
- Define a model for enriching Lesaffre’s biological data with public data.
- Use ontology concepts to create a knowledge graph.
Profile:
- Master’s or engineering student (computer science or software engineering).
- Good programming skills in Python
- Strong knowledge of databases, code versioning and CI/CD practices
- Notions of ontologies, SHACL, OWL
- Analytical mindset, rigor and autonomy
- Good communication skills and fluency in English
- Interest in open science and biological data governance
The position is open for a duration of 6 months starting in January 2026 and not later than March 2026.