In the field of modern biology, the advent of high-throughput technologies has led to an avalanche of data, the management of which remains a challenge for the academic world.
Implementing ICT in agriculture relies on data produced by both public and private sectors along the whole value chain of agrifood systems. Developing sustainable ICT requires data workflows that maximize data and associated metadata access, minimizing carbon footprint, storage hazards, and data loss. Thanks to an ongoing 5-year project (BREIF project funded by the France 2030 Agroecology and ICT program, https://www.pepr-agroeconum.fr/pepr-agroeconum-eng/funded-projects/data-processing-and-modeling/infrastructures/breif), we are designing an operational continuum for data support from various facilities in charge of the data production to thematic sharing warehouses. We propose this proof of concept project as a case study of the partnership, to share expertise on this crucial issue of sustainable data workflows. We would like to share and compare experiences about the following topics
- co-development of naming rules for projects and files with all stakeholders, by specifying the role of each stakeholder for data management
- description of the fluxes of data across stakeholders
- development of templates for metadata collection, following international standards
- design of workflow, ensuring proper data documentation with validation steps, e.g. obtention of a BioSample unique ID of the biological samples before allowing said sample to be sequenced
- compliance to given files and directories naming convention for gaining access to the storage facility; naming convention facilitating data submission workflows to EMBL_EBI repositories
- development of guidelines dedicated to the project partners
INRAE :
The National Institute for Agricultural, Food and Environmental Research (INRAE) is France’s leading public research organization dedicated to sustainable agriculture, food systems and the environment. Its mission is to develop innovative solutions that promote durable farming practices, protect natural resources and enhance food security across Europe and beyond.
INRAE places a strong emphasis on open science: it operates the institutional HAL repository, provides tools for open‑access publishing, data sharing and collaborative research, and offers comprehensive guidance on open‑science best practices to its scientific community.
Through its procurement and partnership portals, INRAE also integrates responsible‑development and sustainability criteria into every stage of research and innovation, reinforcing its commitment to socially and environmentally responsible science.
Overall, INRAE combines cutting‑edge research with a transparent, open‑science culture to drive a more resilient and sustainable agricultural future.