Data Paper |
Corresponding author: Francesco Cozzoli ( francesco.cozzoli@cnr.it ) Academic editor: Tiziana Di Lorenzo
© 2025 Sarah Boulamail, Salvatore Inguscio, Sara Ventruti, Benedetta Barzaghi, Damiano Brognoli, Raffaele De Giorgi, Joachim Langeneck, Elia Lo Parrino, Emanuele Mancini, Raoul Manenti, Alejandro Martínez, Martina Pulieri, Emanuela Rossi, Francesco Cozzoli.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Boulamail S, Inguscio S, Ventruti S, Barzaghi B, Brognoli D, De Giorgi R, Langeneck J, Lo Parrino E, Mancini E, Manenti R, Martínez A, Pulieri M, Rossi E, Cozzoli F (2025) Integrating formal surveys and local knowledge: Insights into the subterranean fauna of Apulia. Subterranean Biology 52: 29-44. https://doi.org/10.3897/subtbiol.52.140693
|
The study of hypogean fauna is critical for preserving subterranean biodiversity and ecosystem functions despite climate change. However, underground habitats, often hosting unique and endemic species, present significant challenges for traditional biodiversity assessments owing to their inaccessibility and the specialised nature of their inhabitants, resulting in a paucity of academic studies. Furthermore, even when these studies exist, data are often held in personal notes and databases that are not interoperable according to current standards, making them less usable for research purposes. Reawakening this dormant data, standardising, and sharing it according to modern criteria offers enormous opportunities to expand existing knowledge and provide support for future studies.
In the Apulian region of Southern Italy, a biodiversity hotspot for subterranean life, the paucity of recent systematic surveys and reliance on expert knowledge poses both opportunities and challenges for ecological research. By integrating data from various sources, this study provides an overview of the subterranean faunal assemblages documented in this region. Overall, the dataset is comprehensive, comprising 109 species (29 of which are considered endemic to Apulia) and 224 sites, totalling 622 presence-only records. As our records have expanded over 93 years, this dataset represents a unique resource for elucidating the characteristics of Apulian subterranean ecosystems and potentially their changes over time. Integrating observations already published in previous studies with unpublished records, this is the first complete data collection on Apulian subterranean fauna organised according to modern standards of data sharing.
Apulia, biodiversity hotspot, historical data, presence-only, subterranean fauna
Groundwater ecosystems remain understudied due to the inherent challenges posed by subterranean habitats. The inaccessibility of these environments, combined with the specialised and often cryptic nature of their inhabitants, makes traditional biodiversity assessment methods difficult to implement and, in many cases, insufficient (
Apulia is one of the most interesting and fascinating European karst areas in terms of subterranean biodiversity (
The karst landscape of Apulia, covering approximately 8,600 km², includes over 2,300 natural caves, several of which are show caves or touristic sites, and holds historical significance with rock settlements, various hypogea, and numerous important Palaeolithic and Neolithic sites. The Alta Murgia Regional Park, which includes a large part of the corresponding aquifer, was designated as the 12th Italian UNESCO Geopark on 9 September 2024. However, as one of the southernmost major aquifers in Europe, southern Apulia’s groundwater temperatures are relatively high and potentially subjected to the effects of global warming (
Apulia is divided into six main hydrogeological units: Gargano, Tavoliere, Murge Plateau, Brindisi Plain, Ionian Arch of Taranto, and Salento (
The entire Apulia region, especially Salento, has undergone multiple marine regressions and transgressions throughout its geological history (
Although still noteworthy, the subterranean terrestrial fauna in Apulia is less diverse than the aquatic fauna. This significant disparity is likely due to the numerous marine transgressions that modified the distribution of terrestrial species (
The bibliographic dataset was compiled based on the authors’ personal knowledge of the limited literature existing on Apulian stygofauna, supplemented by: a) a systematic bibliographic search conducted using Google Scholar, SCOPUS, and ISI Web with the keywords “Apulia” ˄ (“Stygofauna” | “Groundwater fauna”) in both English and Italian; b) a substantial collection of unpublished records primarily contributed by Salvatore Inguscio and other amateur biospeleologists; c) the consultation of updated sources such as The New Checklist of Italian Fauna (Bologna et al. 2023). Owing to the incomplete and fragmentary nature of the available knowledge, our dataset does not claim to comprehensively cover the entire body of information on Apulian subterranean fauna, particularly regarding insects. However, the dataset will undergo continuous updates as part of the STIGE-CLIMAQUIFERI project. These updates incorporate both new observations and previously documented information that is currently not included.
Observations reported in the literature were collected using various methods, primarily visual surveys, baited traps, and plankton nets. Novel or repeated observations dated between 2020 and 2024 were collected mainly from the authors of this study using two different sampling techniques. The first method involved the use of a hand-operated net with a mesh size of 0.5 mm; three replicates of 10 shots each were taken at each sampling point. The second method consisted of the deployment of plastic cylindrical traps (length 30 cm and diameter 6 cm) baited with fish or meat fragments. The collected material was fixed in 85% ethanol and preserved in 75% ethanol. In the laboratory, the collected material was sorted, and the specimens were identified at the species level using a stereomicroscope (Olympus SZX-16) and an optical microscope (Leica DM2000LED).
Specimens have been identified morphologically mostly to the species level, with occasional classifications at the genus or subspecies level, and categorised based on taxonomy, lifestyle (aquatic or terrestrial), degree of adaptation to subterranean environments (stygo- and troglobites, stygo- and trogloxenes, and stygo- and troglophiles), and endemic status of the species within Apulian aquifers (TRUE or FALSE). These classifications were primarily based on
Currently, the dataset is available as Suppl. materials of this article accompanied by a summary of the bibliographic sources, recorded species, sampling sites, and metadata. Metadata are also available in the Metadata Catalogue of LifeWatch EIRC (
The dataset was organised and analysed within the R free software environment (
Dataset name: stygofauna_collection_Apulia_1948_2024.csv.
Format name: csv (separators=”,”; decimal=”.”).
Character encoding: UTF_8.
Distribution: The dataset is available as a Suppl. materials to this article.
Date of publication: 11/11/2024.
Date of last review: 05/11/2024.
Intellectual rights: This data package is released to the “public domain” under Creative Commons CC0 4.0 “No Rights Reserved”.
Language: English.
Database creators: Sarah Boulamail, Raffaele De Giorgi, Sara Ventruti, Francesco Cozzoli.
Metadata provider: Sara Ventruti, Martina Pulieri.
Temporal coverage: From 1931 to 2024.
Record basis: Literature records and unpublished personal observations.
Sampling design and methods: The data were collected using various methods, including visual surveys, baited traps, and plankton nets. The observations were georeferenced and integrated with previously published data, as well as unpublished records from field experts, primarily in caves and wells across the Apulian region.
Study area: Apulia, Italy.
Bounding box: 14.90000 W - 18.60000 E, 42.30000 N - 39.70000 S; WGS84 reference system.
Quality control for geographical data: Quality control was performed by displaying coordinates using the R free software environment (
Habitat type: Subterranean habitats, primarily caves and wells, spanning karstic environments. Habitat categories include aquatic and terrestrial zones within these subterranean ecosystems.
Quality control for ecological data: Habitat and species-specific ecological information were cross-checked with existing literature and expert knowledge on the biology and ecology of the recorded species.
Literature search method: A thorough review of literature on Apulian subterranean fauna was conducted, utilising academic databases and expert observations spanning over a century. Both published and unpublished data were incorporated.
Literature list: A total of 61 publications were referenced and used to gather data about species and the environments they inhabit. The dataset also integrates records from the ongoing PRIN 2022 STIGE-CLIMAQUIFERI project, which has added numerous recent observations, including new data on habitat conditions.
Quality control for literature data: The completeness of the literature was verified by cross-referencing bibliographies and datasets against known records and expert databases.
Taxonomic ranks: All extant species of Apulian stygofauna and troglofauna were included, spanning multiple genera and families of aquatic and terrestrial organisms.
Species names: The current accepted names for all species have been compiled in the dataset, following standard taxonomic resources and expert validation.
Taxonomic methods: Field-sampled species were identified to the lowest possible taxonomic rank using both classical morphological methods and expert knowledge from biospeleologists.
Taxonomic specialist: Salvatore Inguscio, Joachim Langeneck, Emanuele Mancini
Quality control for taxonomic data: All taxonomic ranks were verified using peer-reviewed literature and expert validation.
Currently, the dataset includes 622 records collected from over 224 sites. It is representative of 109 species, 72 genera, 49 families, and 26 orders of organisms observed in Apulian subterranean environments. Most of the observations were collected from caves (46 sites) and wells (149 sites). Stygobionts represent 30% of the 82 reported aquatic species, while troglobionts represent 60% of the 27 reported terrestrial species. Overall, 29 species were considered endemic to Apulia (Fig.
In terms of contributions to the dataset, the most influential studies are those of
The vast majority of the records pertain to Arthropoda, though there are also 41 records of Mollusca, 6 records of Platyhelminthes, 1 record of Annelida, and 1 notable record of Porifera, specifically the stygobitic sponge Higginsia ciccaresi Pansini & Pesce, 1998, found in the Zinzulusa caves. The recorded classes of Arthropoda included Copepoda (50 species), Malacostraca (19 species), and Ostracoda (6 species) for aquatic organisms and Arachnida (11 species) for terrestrial organisms (Fig.
The highest number of species (87) was reported in the Salento unit, which exhibited both higher intrinsic biodiversity and more intense investigations. A lower number of species was reported for the Murge Plateau (27), Gargano (23), and Tavoliere (12) units (Fig.
Species richness of the different sites included in the dataset, for the overall dataset and divided by the three most represented classes (Malacostraca, Copepoda and Gastropoda). “Others” include Arachnida, Clitellata, Collembola, Diplopoda, Diplura, Insecta, Ostracoda, Porifera, and Turbellaria.
By compiling observations from both published research and expert field records, this study fills a significant gap in our knowledge of subterranean biodiversity in Southern Italy. This comprehensive dataset offers a resource for understanding the rich and diverse stygofauna of the Apulian region, particularly its endemic species. However, it also highlights spatial and temporal gaps in knowledge, with some local areas (Tavoliere, Brindisi Plain, Ionian Arch of Taranto) unexplored or strongly underrepresented compared to others (mostly Salento, Gargano, and Murge Plateau). Although the data are not sufficient/appropriate to properly fit a Species Accumulation Curve, the high ratio between sampling effort and the number of observed species suggests that the biodiversity of the less sampled areas could be comparable to that of the more investigated ones. Moreover, many of the observations reported in our dataset were collected decades ago, and only a few of them have been recently repeated. Recent observations have confirmed the presence of notable species, such as S. bottazzii and T. salentina, at various historical sites. They also highlight the remarkable species richness of the L’Abisso site, although only a portion of the species recorded in the past was collected during recent surveys. However, the limited sampling efforts undertaken do not provide enough data to assess whether there has been a decline in the richness of subterranean species owing to recent environmental changes. While the relative abundance of past observations provides a crucial comparative baseline for interpreting the effects of recent changes, it might not represent the current-day biodiversity condition. Thus, there is a strong need to update these observations through a comprehensive sampling campaign covering the entire territory. Finally, it must be considered that the specimens documented in the literature have not been subjected to reanalysis; therefore, we adhered to the taxonomic classifications provided in the original publications or most recent checklists. Both recent and historical observation have relied on taxonomic identification based on morphological characteristics. Consequently, the presence of cryptic species cannot be ruled out. A reanalysis of these specimens using modern genetic tools could provide deeper insights into the true diversity of the taxa and reveal hidden evolutionary lineages, further enriching our understanding of subterranean biodiversity. The integration of modern data management principles and sharing through platforms such as LifeWatch ITA (https://dataportal.lifewatchitaly.eu/data) ensures that this dataset is accessible,and reusable for future research and conservation plans. As ongoing projects continue to provide new findings, we believe that this updatable database will form the basis for the study and conservation of Apulian stygofauna in the future.
We thank Dr. Alberto Potenza, Il Gruppo Speleologico Leccese ‘Ndronico, Il Gruppo Speleologico Tricase (GST) and TheMonumentsPeople for their irreplaceable logistical support and sharing of knowledge. We also thank Alessandro Silvestrini, Michele Bonfrate, Marcello Caramuscio, FORMICA S.R.L., and I.S.T.E. SUD for courtesy and assistance provided during the sampling operations. We extend our gratitude to Dr. Isabella Serena Liso for supporting the spatial analysis, Dr. Simone Cianfanelli for the malacological contribution, Dr. Tiziana Di Lorenzo and Dr. Fabio Stoch for their extremely valuable editorial comments and to Dr. Mattia De Cicco, as well as an anonymous reviewer, for their constructive feedback. Research funded by the project PRIN 2022 STIGE-CLIMAQUIFERI DTA.PN010.014 2022MM8P88_LS8_PRIN2022 - CUP:B53D23012170006 with the support of the project PRIN project “ANCHIALOS” (2022LLNF3N), funded by the Ministry of Universities and Research (Italy) Biodiversa+ (Project DarCo), the European Biodiversity Partnership under the 2021–2022 BiodivProtect joint call for research proposals, co-funded by the European Commission (GA N°101052342) and with the funding organisations Ministry of Universities and Research (Italy), Agencia Estatal de Investigación – Fundación Biodiversidad (Spain), Fundo Regional para a Ciência e Tecnologia (Azores, Portugal), Suomen Akatemia – Ministry of the Environment (Finland), Belgian Science Policy Office (Belgium), Agence Nationale de la Recherche (France), Deutsche Forschungsgemeinschaft e.V. (Germany), Schweizerischer Nationalfonds (Grant N° 31BD30_209583, Switzerland), Fonds zur Förderung der Wissenschaftlichen Forschung (Austria), Ministry of Higher Education, Science and Innovation (Slovenia), and the Executive Agency for Higher Education, Research, Development and Innovation Funding (Romania).
Dataset
Data type: csv
Species
Data type: csv
Sites
Data type: csv
Sources
Data type: csv