PhD on Multimodal Graphs for Biodiversity Intactness Understading

I looking for a high motivated PhD student to work on Multimodal Graphs for diversity. See more below.

Funding: DLA AI-Intervene, self-funding
Project Title: Multimodal Graphs for Biodiversity Intactness Index
Main Supervisor: Alan Guedes

Next-Generation Biodiversity Intactness Prediction Leveraging Multimodal Graphs

The escalating impact of human activities on biodiversity, intensified by climate change, and quantifiable by the Biodiversity Intactness Index (BII), demands new prediction-based methods. Current BII prediction methods primarily stem from an ecological perspective, often lacking the integration of recent advancements in machine learning. While a wealth of multimodal biodiversity data exists, from reports to environmental data (climate and land usage), these valuable sources remain isolated, preventing a holistic understanding of intricate ecological relationships. MM-BioGraph directly addresses BII by the integrating tinto graph-based structures and using existing knowledge bases (e.g. NHM Planetary Knowledge Base). This novel approach represents relationships among species and environmental variables, revealing patterns and uncovering hidden connections. This project is a strategic partnership with the Natural History Museum’s (NHM) Future Lab, leveraging their expertise on their BII dataset called PREDICTS. Central to our methodology is the innovative application of Graph Neural Networks (GNNs) and Large Language Models (LLMs). This approach will not only overcome the limitations of current ecology regression models but also provide graph-based explainability support. We will build predictive models capable of understanding these relationships in the context of BII prediction, identifying previously unknown ecological connections obscured by biodiversity data complexity. Therefore, this project is set to deliver highly impactful new methods for biodiversity conservation.

Our primary goals are to: 1) Establish a comprehensive multimodal biodiversity dataset encompassing knowedge graphs, ecological conversation, and land usage data to ensure robust graph representation, leveraging the PREDICTS dataset, followed by a GNN-based baseline model to identify species interactions and ecosystem dependencies. 2) Advance multimodal modeling with LLMs by developing cutting-edge models that integrate LLMs with visual data, surpassing the GNN baseline to predict complex relationships, delivering more accurate and comprehensive insights.

References

Datasets: