Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/224395
Title: Learning the syntax of plant assemblages
Author: Leblanc, César
Bonnet, Pierre
Servajean, Maximilien
Thuiller, Wilfried
Chytrý, Milan
Aćić, Svetlana
Argagnon, Olivier
Biurrun, Idoia
Bonari, Gianmaria
Bruelheide, Helge
Campos, Juan Antonio
Čarni, Andraž
Ćušterevska, Renata
De Sanctis, Michele
Dengler, Jürgen
Dziuba, Tetiana
Garbolino, Emmanuel
Jandt, Ute
Jansen, Florian
Lenoir, Jonathan
Moeslund, Jesper Erenskjold
Pérez Haase, Aaron
Pielech, Remigiusz
Sibik, Jozef
Stančić, Zvjezdana
Uogintas, Domas
Wohlgemuth, Thomas
Joly, Alexis
Keywords: Associacions vegetals
Hàbitat (Ecologia)
Biodiversitat
Plant communities
Habitat (Ecology)
Biodiversity
Issue Date: 13-Oct-2025
Abstract: To address the urgent biodiversity crisis, it is crucial to understand the nature of plant assemblages. The distribution of plant species is shaped not only by their broad environmental requirements but also by micro-environmental conditions, dispersal limitations, and direct and indirect species interactions. While predicting species composition and habitat type is essential for conservation and restoration purposes, it remains challenging. In this study, we propose an approach inspired by advances in large language models to learn the ‘syntax’ of abundance-ordered plant species sequences in communities. Our method, which captures latent associations between species across diverse ecosystems, can be fine-tuned for diverse tasks. In particular, we show that our methodology is able to outperform other approaches to (1) predict species that might occur in an assemblage given the other listed species, despite being originally missing in the species list (16.53% higher accuracy in retrieving a plant species removed from an assemblage than co-occurrence matrices and 6.56% higher than neural networks), and (2) classify habitat types from species assemblages (5.54% higher accuracy in assigning a habitat type to an assemblage than expert system classifiers and 1.14% higher than tabular deep learning). The proposed application has a vocabulary that covers over 10,000 plant species from Europe and adjacent countries and provides a powerful methodology for improving biodiversity mapping, restoration and conservation biology. As ecologists begin to explore the use of artificial intelligence, such approaches open opportunities for rethinking how we model, monitor and understand nature.
Note: Reproducció del document publicat a: https://doi.org/10.1038/s41477-025-02105-7
It is part of: 2025, vol. 11, num.10, p. 2026-2040
URI: https://hdl.handle.net/2445/224395
Related resource: https://doi.org/10.1038/s41477-025-02105-7
ISSN: 2055-026X
Appears in Collections:Articles publicats en revistes (Biologia Evolutiva, Ecologia i Ciències Ambientals)
Articles publicats en revistes (Institut de Recerca de la Biodiversitat (IRBio))

Files in This Item:
File Description SizeFormat 
904360.pdf3.74 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons