Sampling collections and metadata of planorbidae (Mollusca: Gastropoda) in Brazil: a comprehensive analysis of the Oswaldo Cruz Institute’s Mollusk Collection from 1948 to 2023

Planorbidae comprises approximately 40 genera of freshwater gastropods, including roughly 250 species. Among the Planorbidae subfamilies, the significance of Planorbinae is due to its genus Biomphalaria, whose species are intermediate hosts of the trematode Schistosoma mansoni Sambon, 1907, which causes schistosomiasis in humans and animals. Here, we present the analysis of the dataset of Planorbidae housed in the Collection of Mollusks of the Oswaldo Cruz Institute, with a special focus on Biomphalaria species. This dataset includes 7,267 lots originating from 55 countries, representing 20 genera and 75 species collected from 1948 to 2023. Collections were performed in all regions of Brazil, comprising specimens from 26 states and the Federal District, particularly from the Southeast and Northeast. Within the dataset, Biomphalaria includes 3,926 lots of 31 species from 42 countries. These records will help improve our comprehension of schistosomiasis transmission dynamics and the geographic distributions of these medically important species.

In this paper, we contributed a dataset derived from the Planorbidae species deposited in the Oswaldo Cruz Institute's Mollusk Collection, mainly from Brazil, but also from numerous other countries.This material was mainly the result of decades of study in freshwater ecosystems by Dr. Wladimir Lobato Paraense, known for his studies on the biology and taxonomy of Brazilian planorbids, and by his team in the laboratory of Malacology of the Oswaldo Cruz Institute.Many other specialists also contributed and examined several specimens from this collection.In 1948, when Dr. Wladimir Lobato Paraense worked at the Public Health Special Service, a Brazilian institution responsible for the control of parasitic diseases in the Rio Doce Valley in Minas Gerais, including schistosomiasis [6,7], he began his studies on the mollusks involved in the transmission of schistosomiasis in Brazil, creating the collection.
Currently, this collection includes mainly freshwater and land gastropods involved in the transmission of other parasitic diseases, such as fascioliasis and angiostrongyliasis, both cerebral and abdominal, but also includes gastropod species that cause economic losses in agriculture (mainly exotic species) and among native species from the Brazilian Biomes.The collection contributes to science, research, and education.It also serves as a repository of knowledge about Brazilian and global mollusk biodiversity [6].
The datasets presented in this study consist of metadata associated with each batch of Planorbidae specimens, featuring varying numbers of specimens.We filled in the obligatory fields and have successfully passed screening using the integrated publishing toolkit (IPT) of the Fundação Oswaldo Cruz (FIOCRUZ).For each lot of Planorbidae, our dataset includes fields providing, in Darwin Core Standard format, the following information: (i) taxonomy (kingdom, phylum, class, order, family, genus, specificEpithet, verbatimIdentification, infraspecificEpithet, scientificName, scientificNameAuthorship, taxonRank); (ii) collection details, including the collectors, collection date, collection site description (verbatimEventDate, eventTime, habitat, samplingProtocol); (iii) geolocation data (stateProvince, county, locality, locationRemarks, verbatimLatitude, verbatimLongitude, decimalLatitude, decimalLongitude, geodeticDatum); and (iv) catalog reference data (otherCatalogNumbers).This dataset is also available in the Sistema de Informação sobre a Biodiversidade Brasileira (SiBBr; i.e., Information System on the Brazilian Biodiversity), which integrates data and information, constituting the Brazilian Node of the Global Biodiversity Information Facility (GBIF), in an online platform for public use [8].

METHODS
This study included all reports of Planorbidae genera and species from the dataset obtained from the Oswaldo Cruz Institute's Mollusk Collection (CMIOC).These lots are mainly from Brazil, but also include material from more than 50 countries and continents (Figure 2).The temporal coverage of the Planorbidae dataset is from 1948 to 2023.
All mollusks were morphologically identified in the laboratory to their genus and species based on shell and anatomical characteristics through specimen dissections, literature, and comparison with the lots deposited in the CMIOC [2,[9][10][11][12].

DATA VALIDATION AND QUALITY CONTROL
Over the years, Dr. Lobato Paraense and his group published several works on planorbids from Brazil [9,.Their works described and redescribed species of Planorbinae, reinforcing the importance of planorbids in transmitting diseases.Data validation was also done via the GBIF data-validator tool upon data submission [8].
In Brazil, the geographical distribution covers all five territorial regions, spanning the 26 states and the Federal District.The present database presents records of the occurrence of representatives of the family Planorbidae in a total of 592 municipalities (Midwest: recorded in 32 municipalities; Northeast: 120 municipalities; North: 27 municipalities; Southeast: 339 municipalities; and South: 74 municipalities).Biomphalaria is the most well-represented genus in CMIOC, including lots also from Africa, Asia, and Latin and North Americas, with a total of 3,926 lots.These specimens are registered in the following countries: Angola, Antigua, Argentina, Barbados, Belize, Bolivia, Chile, China, Costa Rica, Cuba, Egypt, El Salvador, Ecuador, United States, Ghana, Guadeloupe, Guatemala, Guyana, French Guiana, Haiti, Hong Kong, Jamaica, Martinique, Mexico, Mozambique, Nicaragua, Panama, Paraguay, Peru, Puerto Rico, Dominican Republic, Santa Lucia, Suriname, Tanzania, Trinidad, Uganda, Uruguay, and Venezuela (Table 2).Another well-represented genus of Planorbinae is Drepanotrema, which includes twelve species from different countries, with 2,312 lots (Table 3).Considering lots from Brazil, the collection at CMIOC includes representatives from all 11 species of Biomphalaria that occur nationwide.Among them, three species play a significant role in the biological cycle of Brazil [33] and have a wide distribution.Together, these species account almost 40% of the total Planorbidae lots in the collection.Specifically, B. straminea is the most represented species with 1,257 lots, while B. tenagophila, and B. glabrata account for 811 and 654 lots, respectively.
The three species of Biomphalaria found in Brazil and recorded at CMIOC exhibit distinct geographic distributions within the country.However, there are certain states where these species overlap, indicating areas of coexistence (Figures 3-5).
B. straminea has the broadest geographic range among the three species in Brazil.It can be found in various states, from the North to the South of the country.Its distribution encompasses regions such as the Amazon, the Cerrado, the Caatinga, and the Atlantic Forest.
The extensive presence of B. straminea highlights its significance as an intermediate host for the parasitic trematode.For a complete record of B. straminea in Brazil [34], CMIOC only lacks samples from the states of Santa Catarina and Roraima (Figure 3).On the other hand,

REUSE POTENTIAL
The presented dataset is important because it provides information on the distribution of Planorbidae, Planorbinae, and Biomphalaria in Brazil based on a renowned collection of medical malacology (i.e., CMIOC), traditionally known for its studies within the country.This dataset can provide the basis for future studies in evolution, ecology, and epidemiology, among others, especially for species of medical interest from the public health perspective.
An important point of the current collection is the first recording of B. straminea in Amapá, serving as an important reference for research into the biodiversity of Planorbinae.
These data expand the distribution of these species and provide occurrence information on the other species of this genus in Brazil.In addition to supporting the surveillance and control of schistosomiasis in Brazil, these data also contribute to the knowledge of Biomphalaria biodiversity.They are also an important resource for managing the CMIOC (Figure 6).

Figure 2 .
Figure 2. Spatial coverage of the occurrence dataset of Planorbidae, from 1948 to 2023, based on the CMIOC, including mainly lots are from Brazil (6,372 lots).

B
. tenagophila has a more restricted distribution compared to B. straminea.B. tenagophila is commonly found in areas of the South and Southeast of Brazil, primarily encompassing the states of São Paulo, Paraná, Santa Catarina, and Rio Grande do Sul (Figure 4).Finally, B. glabrata has a less limited geographic distribution than B. tenagophila in Brazil.Specifically, B. glabrata is predominantly found in coastal areas, particularly in the Northeast region of the country (Figure5).

Figure 3 .
Figure 3. Spatial coverage of the occurrence dataset of B. straminea, from 1948 to 2023, based on the CMIOC, showing that most lots are from Rio Grande do Norte, with 256 lots.

Figure 4 .Figure 5 .
Figure 4. Spatial coverage of the occurrence dataset of B. tenagophila, from 1948 to 2023, based on the CMIOC, showing that most lots are from Rio de Janeiro State, with a total of 585 lots.The second more represented State is São Paulo, with 54 lots.

Table 1 .
Genera and species of the Planorbidae family deposited in the CMIOC.

Table 2 .
Origin of the lots of Biomphalaria deposited in the CMIOC.

Table 3 .
Origin of the lots of Drepanotrema deposited in the CMIOC.