ForestForward: visualizing and accessing integrated world forest data from the last 50 years

Database (Oxford) 2025-03-12

Database (Oxford). 2025 Mar 3;2025:baaf018. doi: 10.1093/database/baaf018.

ABSTRACT

Mitigating the effects of environmental exploitation on forests requires robust data analysis tools to inform sustainable management strategies and enhance ecosystem resilience. Access to extensive, integrated plant biodiversity data, spanning decades, is essential for this purpose. However, such data are often fragmented across diverse datasets with varying standards, posing two key challenges: first, integrating these datasets into a unified, well-structured data warehouse, and second, handling the vast volume of data using big data technologies to analyze and monitor the temporal evolution of ecosystems. To address these challenges, we developed and used an extract, transform, and load (ETL) protocol that curated and integrates 4482 forestry datasets from around the world, dating back to the 18th century, into a 100-GB data warehouse containing over 172 million records sourced from the Global Biodiversity Information Facility repository. We implemented Python scripts and a NoSQL MongoDB database to streamline and automate the ETL process, using the data warehouse to create the ForestForward web platform. ForestForward is a free, user-friendly application developed using the Django framework, which enables users to consult, download, and visualize the curated data. The platform allows users to explore data layers by year and observe the temporal evolution of ecosystems through visual representations. Database URL: https://forestforward.udl.cat.

PMID:40037615 | PMC:PMC11879282 | DOI:10.1093/database/baaf018