Biodiversity services
Name | Description | ELIXIR Node |
---|---|---|
This study is divided into four Work Packages (WP). The first three aim to survey the biodiversity data domain and catalogue and assess relevant research data management elements (WP1), analysis tools, services, and infrastructures (WP2), as well as active and planned projects and stakeholders (WP3). WP4 aims to grow the ELIXIR Biodiversity Community by collating resources and materials for training and knowledge transfer amongst ELIXIR members and the wider research community. WP1: Research data management and data interoperabilityThe aim of WP1 is to survey and catalogue Research Data Management (RDM) elements relevant to the biodiversity domain, with a focus on molecular data but not excluding other relevant data collection activities. By focusing on data management and not data analysis it remains distinct from WP2. It is aligned with WP3 through the mapping of RDM-related activities being performed as part of ongoing projects (e.g. BGE, BiCIKL, TETTRIs), and will provide some of the substrate for discussions as part of WP3’s planned network event, contributing to creating connections with the wider community (particularly active ELIXIR Nodes). It will also highlight current capacities and gaps in RDM practices that will feed into WP4’s activities. Development and adoption of best practices in RDM for the biodiversity research domain remain extremely variable, hence the focus here on assessing the current status and collating existing best practices. This will strengthen ELIXIR’s position in the biodiversity domain by leveraging ELIXIR’s RDM-related strengths in other research domains including other ELIXIR Communities. LeadJosie Burgin (EMBL-EBI) Tasks
Milestones and deliverables
WP2: Services, tools, and analysisThe main aims of WP2 are to catalogue, review, and categorise tools, services, and analytical workflows currently in use by ELIXIR Nodes and the wider community, that process and analyse biodiversity-related data, with an emphasis on molecular data. This will include ELIXIR and non-ELIXIR tools. It remains distinct from WP1 by focusing on data analysis tools, services, and infrastructures and not on data management in research. It will benefit from WP3’s surveys of stakeholders/projects to help identify key tools and services that should be prioritised for review and cataloguing. These priorities can feed into WP4’s training activities by identifying community needs with respect to the usability of important tools and services. The outputs will ensure that ELIXIR is playing a leading role in identifying and exposing high-quality biodiversity-related analysis tools and services to the wider community of users, as well as fostering the development of such tools and services through connecting ELIXIR Nodes which are active in the domain. LeadToni Gabaldón and Matúš Kalaš Tasks
Milestones and deliverables
WP3: Outreach, networking, and communicationsBiodiversity is a very broad and diverse domain of research, in which many networks of infrastructures and initiatives are involved. The main aims of WP3 are to better describe the landscape of stakeholders ELIXIR is working with or needs to better engage with and to establish a “network of networks” for biodiversity research. It will feed into WP1 and WP2 by identifying ongoing or planned work directly relevant for improving RDM practices, and by exposing tools, services, and analysis workflows that the wider community are developing. It will also help to link training and knowledge transfer activities of various projects and stakeholders to the ELIXIR Training Platform and WP4. By leveraging ELIXIR’s federated structure, these network growing activities will support biodiversity research by improving connections between existing projects and communities, as well as by promoting synergies and aligning data and tools to the needs of researchers. This will also strengthen ELIXIR’s position as a partner in the planning and development of new projects and initiatives in the biodiversity data/research domain. LeadHenrik Lantz Tasks
Milestones and deliverables
WP4: Training and knowledge transferThe overarching aim of WP4 is to leverage the strengths of ELIXIR’s training experience to help support the growth of the ELIXIR Biodiversity Community through network-driven sharing of training experiences and knowledge transfer and materials. The work will be developed in close collaboration with the ELIXIR Training Platform and extensively use the methods already developed. Activities in Work Packages 1, 2, and 3 will help to connect to a wide range of relevant stakeholders and thereby feed WP4 with information about ongoing or planned training-related activities on the broader biodiversity community. LeadPatricia Palagi and Anne-Françoise Adam-Blondon Tasks
Milestones and deliverables
|
ELIXIR France, ELIXIR Greece, ELIXIR Italy, ELIXIR Germany, ELIXIR Portugal, ELIXIR Ireland, ELIXIR Norway, ELIXIR Spain, ELIXIR UK, ELIXIR Switzerland, ELIXIR Hungary, ELIXIR Sweden, EMBL-EBI, ELIXIR Czech Republic | |
Over the coming decade, Europe will face critical challenges in maintaining biodiversity, ensuring food security and combating pathogens. Our 2024–28 Programme will address these issues by mobilising and integrating molecular data, using successful coordination models from human genomics. Through strategic investments and collaboration in externally-funded projects, ELIXIR will enhance scientific services and support transnational research in these essential areas. The following projects have been selected as part of the ELIXIR 2024–28 Programme’s Biodiversity, food security and pathogens Science Tier:
With the declining cost of genome sequencing, the focus of plant researchers is shifting towards characterising the wide genomic diversity present within a species. Crop pan-genomes consist of the sequencing, comparison and integration of multiple different genomes from the same agriculturally important species such as wheat, rice and potatoes. Exploiting the information encoded within these pan-genomes can lead to the development of new cultivars more resilient to upcoming challenges like increased drought and heat stress. Multiple consortia are independently generating and integrating these pan-genomes, but there is currently little progress in streamlining and homogenising these efforts. While sequence quality is no longer a major issue, the completeness of both assembly and subsequent gene annotation are much harder to correctly quantify, while being the major drivers in explaining the adaptive differences between genotypes. Where there are efforts to visualise and browse pan-genomes, for example by using graph representations, the easy retrieval of gene Presence Absence Variation information or structural rearrangements is currently lacking, hampering knowledge learning. E-PAN aims to streamline the efforts of different research groups within the ELIXIR Plant Science Community. This encompasses the development of effective standards, computational pipelines and tutorials to assess the quality of pan-genomes and provide solutions to identified problems. We will also evaluate and integrate different approaches for data visualisation and browsing, which will be used by different partners sharing pan-genomics results. A one-day meeting and an online workshop will be organised to disseminate results and initiate new collaborative projects. These concerted efforts will lead to a standardised approach to be used in future pan-genome projects, a reduction in duplication efforts across consortia, and a set of tools to visualise and mine pan-genomics results. Nodes involved: ELIXIR Belgium, ELIXIR Germany, ELIXIR Portugal, ELIXIR Slovenia, ELIXIR UK Metagenomics Assembled Genomes (MAGs) are crucial for understanding biodiversity, enhancing food security and combating pathogens by providing insight on uncultured and unexplored genomes. This proposal outlines a comprehensive project aimed at advancing metagenomics research through the advancement, optimisation, evaluation and dissemination of robust FAIR workflows for building MAGs. Leveraging the Galaxy platform, our primary objectives include finalising a user-friendly state-of-the-art Galaxy workflow tailored for MAG construction, and ensuring its accessibility and reusability through integration with WorkflowHub. To support user adoption and proficiency, we will create FAIR educational materials hosted on the Galaxy Training Network (GTN), empowering researchers with the skills necessary to use the workflow effectively. The efficacy of the developed workflow will be rigorously evaluated by analysing MAGs generated from simulated and real-world data-spanning diverse environments: atmosphere, marine and cow gut microbiomes. This evaluation will provide valuable insights into the workflow's performance and its applicability across different sample types, complexities and ecosystems. We will also investigate the computational resources required for executing the assembly step of the workflow using data provided by several Galaxy servers and the MGnify team on various input datasets. The aim would be to optimise resource allocation to ensure efficient and cost-effective MAGs construction. A novel tool will be developed to facilitate this process, allowing researchers to accurately estimate and allocate resources for each step of the assembly pipeline. By addressing these objectives, our project aims to accelerate metagenomics research by providing researchers with a comprehensive and accessible framework for MAGs construction. This framework will not only streamline the workflow for building MAGs but also facilitate reproducibility, collaboration and innovation within the ELIXIR Microbiome Community. Nodes involved: ELIXIR France, ELIXIR Germany, ELIXIR Italy, EMBL-EBI The standardisation and accessibility of plant data is a major challenge for agricultural research. MIAPPE, which was developed as part of the transPLANT and ELIXIR-EXCELERATE projects, has made a decisive contribution to unifying data capturing. Also, the FONDUE Implementation Study facilitated the integration of phenotypic and genotypic data. Nevertheless, challenges persist in achieving full FAIRness of plant data. The development of guidelines and best practice documents within the Commissioned Service INCREASING has improved this. However, further enhancements are required, such as providing additional documentation and reference datasets. To address these needs, it is important to assess the practical effort required to FAIRify datasets using MIAPPE, ISA, ARC and RO-Crate standards. The idea is to provide biologist-friendly data documentation and at the same time introduce machine-actionable formats for bioinformaticians to use. A further challenge arises from the scattered nature of the information, as there is no single resource on which all the information is collated. In HARVEST, we aim to address these challenges by FAIRifying datasets (DROPS, AGENT) using the latest version of MIAPPE as a basis, which now covers more diverse and complex use cases. This process will include enriching the MIAPPE documentation in particular with example datasets, updating training material and refining mappings to other interoperable formats such as BrAPI, Bioschemas and ISA-Tab/JSON. We will also establish links using FAIDARE to repositories such as EMBL-EBI EVA, e!DAL-PGP, recherche.data.gouv and Zenodo, to enhance data sharing and reuse opportunities. An extension of the RDMkit Plant Sciences pages will be implemented to serve as a primary hub for information on FAIRification of plant data. Furthermore, we will be consolidating resources and improving accessibility through direct linking to the original web resources and recipes, also adding Jupyter notebooks to the FAIR Cookbook where possible. Nodes involved: ELIXIR Germany, ELIXIR France, ELIXIR Netherlands, ELIXIR UK, EMBL-EBI Understanding molecular biodiversity is essential for ecological conservation and sustainable development. While a vast array of molecular data awaits exploration, its lack of connectivity with other sources of data and metadata such as geographical reference, habitat, population size and phenotypic data often pose significant barriers to biodiversity research. This project proposal is about developing Odyssey, a web portal in the form of a user-friendly interface that will allow researchers, educators and citizens to navigate the world of molecular biodiversity using Greece and Norway as case studies – two countries with a characteristic and unique wealth of biodiversity, representative for Mediterranean and Nordic types of ecosystems respectively. Based on existing sources of information and prototype applications available for specific regions and taxa, this project aims to link actual efforts and develop a new interface to offer diverse functionalities for data exploration and analysis, such as descriptive statistics, graphs, maps, customisable data filters and dynamic visualisations. Through modular design, the application will ensure flexibility and scalability, enabling easy integration of new data sets and analytical tools in the future. This approach will be used for training and communication, inviting traditional biodiversity research groups to utilise new information concerning the spatial patterns of biodiversity and their connection with features that are important for designing conservation measures, such as habitat connectivity, representativity, population demographics, dynamics of adaptation and migration. Odyssey’s outcome will be a valuable tool for studying and, ultimately, offering a basis for managing and conserving the rich molecular biodiversity of Greece and Norway, as well as supporting the activities of the ELIXIR Biodiversity Community in the two Nodes and in Europe. This will promote collaboration, innovation and knowledge exchange in biodiversity research and beyond. This new tool will be developed and offered under an open-source licence, encouraging community participation and contribution to further enhance its capabilities and broaden its applications, fostering a robust network for biodiversity research in Greece and Norway. Nodes involved: ELIXIR Greece, ELIXIR Norway |
ELIXIR Belgium, ELIXIR France, ELIXIR Germany, ELIXIR Greece, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Slovenia, ELIXIR UK, EMBL-EBI, ELIXIR Italy | |
Cellular and molecular biology are fundamental to ELIXIR's mission. As part of our 2024–28 Programme, we are committed to advancing data services and software for research on nucleic acids, proteins and other biomolecules. This initiative will address new demands for multi-omics and multi-modal analyses, including imaging, by developing methods and partnerships. We will also expand expertise in reusable data and software to incorporate FAIR models, ensuring robust solutions for modelling at all scales. The following projects are key to connecting the latest developments with established data resources, unlocking the potential of cellular and molecular biology:
This project addresses the limitations of current ontologies in capturing the dynamic nature of disordered protein regions by pursuing several primary objectives. Firstly, novel structural and functional ontologies will be developed to accurately represent the structural heterogeneity and dynamic functional annotations of proteins. These ontologies will incorporate timescales, annotating the kinetics of structural transformations to elucidate molecular mechanisms and regulatory pathways governing protein dynamics. Collaborating with existing databases and consortia will ensure seamless integration of ontological resources and experimental data, fostering interoperability and accelerating discoveries. A standardised file format specification will also be developed in collaboration with the Human Proteome Organisation Proteomics Standards Initiative, facilitating the encoding of structural state transitions within disordered protein regions. This specification will enhance data interoperability and exchange among research groups and databases, providing a common language for describing structural transitions and advancing our understanding of the functional implications of protein dynamics in biological systems. Nodes involved: ELIXIR Belgium, ELIXIR Hungary, ELIXIR Italy, EMBL-EBI This project aims to strengthen the basis for a one-stop shop connecting databases, datasets and tools for the deployment of the engineering Design-Build-Test-Learn (DBTL) framework in biotechnology. It will do so by surveying the tools and data landscape, pinpointing gaps and opportunities, and establishing design patterns for task-specific workflows for analysis, integration and sharing of multimodal data. It will provide a resource that will allow users to navigate the complex landscape of biotechnology tooling and data, as well as to establish solutions that fit their specific DBTL requirements. Use cases from ongoing programmes in various communities will be used to ascertain and establish the pragmatic value of the solutions. The work will be carried out through hands-on activities, dedicated workshops and hackathons, providing training and resources, as well as fostering industrial engagement. The experience of the communities and platforms involved in systems biology, industrial biotechnology, metabolic modelling, metabolomics, enzymes, bioprospecting and data management will be particularly valuable in this respect, as well as their respective industrial relations. Accordingly, the project engages participants from seven ELIXIR nodes and connects researchers and their activities from six communities. The project outcomes will contribute to advancing the ambition of connecting the latest developments and established data resources across ELIXIR to realise the potential of cellular and molecular biology, particularly in the fields of industrial biotechnology and biomanufacturing. Nodes involved: ELIXIR Spain, ELIXIR Greece, ELIXIR France, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Slovenia, ELIXIR UK Spatial transcriptomics (ST) was named ‘Method of the Year 2020’ by Nature Methods and was more recently featured in Nature’s Seven technologies to watch in 2024. ST is now a prerequisite for researching transcriptional pathology at the cellular and molecular levels. Current use of ST is ubiquitously applied to multiple pathologies, including neurodegenerative disease, cancer, cardiomyopathy and nephrology. There is also an emerging application of ST in plant and microbiome research. While there are a plethora of spatial analysis applications, these are not unified or easily manageable by research scientists and they lack any hope of delivering FAIR and reproducible results. To address this challenge, we will implement Spatial2Galaxy (S2G) – a self-contained, reproducible, scalable FAIR spatial transcription analysis platform for researchers and bioinformaticians alike. We will develop S2G based on our success with developing Galaxy workflows, training materials and ST and single-cell analysis pipelines. S2G will provide state-of-the-art ST tools and workflows with proven high performance in benchmarking studies, ensuring the uptake of best practices. These tools will be demonstrated on datasets that connect various ST databases. This will consolidate community guidelines for integrative multi-modal single-cell omics and imaging analysis. Compared to non-spatial single-cell sequencing, presented as the Nature ‘Method of the Year 2013', it took six years until practical training and workflows for its analysis were FAIRified and available in Galaxy by 2019. In contrast, S2G aims to reduce this gap between technologies becoming relevant and provision of FAIR resources to the life science community for ST. Nodes involved: ELIXIR Germany, ELIXIR France, ELIXIR Netherlands, ELIXIR UK The ELIXIR metabolomics community relies on standards, formats and data treatment solutions development and adoption, but it remains challenging to ensure high-quality reported metadata, sufficiently contextualised results, interoperable and reusable datasets and to integrate these metabolomics data with other omics or studies. This project is designed to address these issues and aims to connect key international standards with ELIXIR resources, as well as creating associated community guidelines and training materials. Based on the FAIRification framework, activities in the project will: i) increase interoperability and reuse of public metabolomics datasets and workflows through enhanced and extended open data standards, resources and new semantic annotations, ii) define, ensure and establish quality control for study baselines in Metabolomics and Exposomics, and iii) facilitate metabolomic data interpretation and meta-analysis integration with multi-omics and systems biology studies. As a first necessary step, the project will create a Semantic Metabolomics Data Model to standardise metadata, ensuring unambiguous reuse of metabolomics projects. This model will focus on integrating key ontologies, providing open training initiative and enhancing the interoperability of metabolomics data through the production of open guidelines for annotation steps. By linking with ELIXIR’s Deposition databases, ISA Framework and other services, the project seeks to boost interconnection with ELIXIR platforms, other ELIXIR communities (Systems Biology, Food and Nutrition, Galaxy, Proteomics, Toxicology, Research Data Alliance Focus Group ...), the FAIR Cookbook and BioSchemas.org communities. Project outcomes are expected to promote the emergence of ambitious and innovative semantic-based solutions for inter-comparison of studies in healthcare, clinical and plant domains. Nodes involved: ELIXIR Czech Republic, ELIXIR Germany, ELIXIR Italy, ELIXIR Spain, ELIXIR France, ELIXIR Netherlands, ELIXIR Sweden, ELIXIR UK, EMBL-EBI |
ELIXIR Belgium, ELIXIR Czech Republic, ELIXIR France, ELIXIR Greece, ELIXIR Hungary, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Slovenia, ELIXIR Spain, ELIXIR Sweden, ELIXIR UK, EMBL-EBI | |