diff --git a/DESCRIPTION b/DESCRIPTION index ed63e84..2078cd2 100644 --- a/DESCRIPTION +++ b/DESCRIPTION @@ -19,6 +19,7 @@ Imports: future, magrittr, rlang, + tidytext, vegan Suggests: ggforce, @@ -45,7 +46,6 @@ Suggests: themis, tidyr, tidyselect, - tidytext, tune, vip, workflows, diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png index 5e0d20b..4be0f42 100644 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png new file mode 100644 index 0000000..5e0d20b Binary files /dev/null and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png index 4c8b3d4..7f12c91 100644 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png index d830721..5f14c72 100644 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png index 5e6cd20..d830721 100644 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png index b23f703..5e6cd20 100644 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png new file mode 100644 index 0000000..b23f703 Binary files /dev/null and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png deleted file mode 100644 index d3b55b7..0000000 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png and /dev/null differ diff --git a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png index 4be0f42..d3b55b7 100644 Binary files a/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png and b/docs/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png differ diff --git a/docs/distribution.html b/docs/distribution.html index 4fe9b7b..34abce4 100644 --- a/docs/distribution.html +++ b/docs/distribution.html @@ -143,18 +143,24 @@

4 Nutrients distribution

4.1 Fish groups

-The bar chart illustrates the cumulative contribution of various marine food sources to the Recommended Nutrient Intake (RNI) for six essential nutrients, based on a 100g portion size. The x-axis is scaled in percentage terms, with the 100% mark indicating the complete RNI for a reproductive-age woman. Each horizontal bar is a stacked representation, segmented by color to denote the specific nutrient contributions from marine food sources. The marine food sources are labeled on the y-axis, which allows for a comparative visualization of their nutrient profiles, highlighting the diversity in nutrient density and emphasizing their potential significance in dietary nutrition. +The bar chart illustrates the contribution of a variety of marine food sources to the Recommended Nutrient Intake (RNI) for six fundamental nutrients, based on a 100g portion. The x-axis represents the proportion of RNI fulfilled, with the 100% benchmark signifying the complete RNI for an adult woman of reproductive age. Each bar is a color-segmented stacked visual, with distinct hues corresponding to individual nutrients, and white numbers within indicating the specific percentage contribution of each nutrient. The chart incorporates the total annual catch in metric tons for each marine species from 2018 to 2023, presented at the end of each bar, providing a view of both the nutritional value and the harvest volume of these essential food sources. The transparency of these values is adjusted to reflect each species' relative contribution to the overall catch

-Figure 4.1: The bar chart illustrates the cumulative contribution of various marine food sources to the Recommended Nutrient Intake (RNI) for six essential nutrients, based on a 100g portion size. The x-axis is scaled in percentage terms, with the 100% mark indicating the complete RNI for a reproductive-age woman. Each horizontal bar is a stacked representation, segmented by color to denote the specific nutrient contributions from marine food sources. The marine food sources are labeled on the y-axis, which allows for a comparative visualization of their nutrient profiles, highlighting the diversity in nutrient density and emphasizing their potential significance in dietary nutrition. +Figure 4.1: The bar chart illustrates the contribution of a variety of marine food sources to the Recommended Nutrient Intake (RNI) for six fundamental nutrients, based on a 100g portion. The x-axis represents the proportion of RNI fulfilled, with the 100% benchmark signifying the complete RNI for an adult woman of reproductive age. Each bar is a color-segmented stacked visual, with distinct hues corresponding to individual nutrients, and white numbers within indicating the specific percentage contribution of each nutrient. The chart incorporates the total annual catch in metric tons for each marine species from 2018 to 2023, presented at the end of each bar, providing a view of both the nutritional value and the harvest volume of these essential food sources. The transparency of these values is adjusted to reflect each species’ relative contribution to the overall catch +

+
+
+Distribution of nutritional content among different fish groups. This series of bar graphs delineates the contribution of various fish groups to the total nutrient stock, highlighting the top ten fish groups for calcium, omega-3, iron, protein, vitamin A, and zinc. Each graph is ordered to reflect the descending contribution of each fish group relative to each nutrient. +

+Figure 4.2: Distribution of nutritional content among different fish groups. This series of bar graphs delineates the contribution of various fish groups to the total nutrient stock, highlighting the top ten fish groups for calcium, omega-3, iron, protein, vitamin A, and zinc. Each graph is ordered to reflect the descending contribution of each fish group relative to each nutrient.

4.2 Habitat and gear type

-
-Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries. +
+Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries.

-Figure 4.2: Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries. +Figure 4.3: Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries.

diff --git a/docs/highlight.html b/docs/highlight.html index 80db5f9..1395e0a 100644 --- a/docs/highlight.html +++ b/docs/highlight.html @@ -185,8 +185,8 @@

3.1 Timor-Est SSF nutritional sce
  • POPULATION MEETING RNI REQUIREMENTS: Percentage of the population meeting the RNI requirements in each municipality: \(\frac{Number\ of\ people\ supplied\ daily}{Municipality\ population} \times 100\)
-
- +
+

diff --git a/docs/index.html b/docs/index.html index 581f9da..3cacf64 100644 --- a/docs/index.html +++ b/docs/index.html @@ -140,7 +140,7 @@

1 Content

diff --git a/docs/profiles.html b/docs/profiles.html index 017fd75..b32331b 100644 --- a/docs/profiles.html +++ b/docs/profiles.html @@ -169,23 +169,23 @@

5.2.1 Clusters -Distribution of nutrient adequacy across k-means clusters. The bar chart delineates the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch within identified k-means clusters. Each bar is categorized into six segments corresponding to the evaluated nutrients. The clusters are enumerated on the y-axis, each representing a group with a distinct nutritional profile as determined by the cluster analysis. The x-axis quantifies the count of individuals within each cluster that meet the RNI for the respective nutrients, underlining the variability in nutrient adequacy across clusters. Panels (a) through (d) compare these distributions across different fishing practices and locations, namely Atauro and the Mainland, using all gear types or exclusively gill nets. +
+Distribution of nutrient adequacy across k-means clusters. The bar chart delineates the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch within identified k-means clusters. Each bar is categorized into six segments corresponding to the evaluated nutrients. The clusters are enumerated on the y-axis, each representing a group with a distinct nutritional profile as determined by the cluster analysis. The x-axis quantifies the count of individuals within each cluster that meet the RNI for the respective nutrients, underlining the variability in nutrient adequacy across clusters. Panels (a) through (d) compare these distributions across different fishing practices and locations, namely Atauro and the Mainland, using all gear types or exclusively gill nets.

Figure 5.1: Distribution of nutrient adequacy across k-means clusters. The bar chart delineates the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch within identified k-means clusters. Each bar is categorized into six segments corresponding to the evaluated nutrients. The clusters are enumerated on the y-axis, each representing a group with a distinct nutritional profile as determined by the cluster analysis. The x-axis quantifies the count of individuals within each cluster that meet the RNI for the respective nutrients, underlining the variability in nutrient adequacy across clusters. Panels (a) through (d) compare these distributions across different fishing practices and locations, namely Atauro and the Mainland, using all gear types or exclusively gill nets.

The scatter plot from the k-means clustering (Figure 5.2) showed the distribution of nutrient profiles across different clusters in each data subset. The first two principal components explained a significant portion of the variance, indicating distinct groupings in nutrient profiles among the fishing trips.

-
-Nutritional profile clustering of fishing trips by region and gear type. Each plot presents a k-means clustering analysis of fishing trip observations, grouped by their nutritional contributions to the Recommended Nutrient Intake (RNI) for six nutrients. The four panels, labeled (a) through (d), display data subsets for Atauro and the Mainland, utilizing all gear types and gill nets specifically. The scatter plots within each panel are charted in a two-dimensional space defined by the first two principal components, with the axes denoting the percentage of explained variance. Points are color-coded to denote distinct nutritional profile clusters derived from the k-means algorithm. Convex hulls define the periphery of each cluster, providing insight into the cluster density and separation. Convex hulls around the clusters aid in visualizing the distribution and delineation of nutritional profile groupings across different fishing methods and geographic areas. +
+Nutritional profile clustering of fishing trips by region and gear type. Each plot presents a k-means clustering analysis of fishing trip observations, grouped by their nutritional contributions to the Recommended Nutrient Intake (RNI) for six nutrients. The four panels, labeled (a) through (d), display data subsets for Atauro and the Mainland, utilizing all gear types and gill nets specifically. The scatter plots within each panel are charted in a two-dimensional space defined by the first two principal components, with the axes denoting the percentage of explained variance. Points are color-coded to denote distinct nutritional profile clusters derived from the k-means algorithm. Convex hulls define the periphery of each cluster, providing insight into the cluster density and separation. Convex hulls around the clusters aid in visualizing the distribution and delineation of nutritional profile groupings across different fishing methods and geographic areas.

Figure 5.2: Nutritional profile clustering of fishing trips by region and gear type. Each plot presents a k-means clustering analysis of fishing trip observations, grouped by their nutritional contributions to the Recommended Nutrient Intake (RNI) for six nutrients. The four panels, labeled (a) through (d), display data subsets for Atauro and the Mainland, utilizing all gear types and gill nets specifically. The scatter plots within each panel are charted in a two-dimensional space defined by the first two principal components, with the axes denoting the percentage of explained variance. Points are color-coded to denote distinct nutritional profile clusters derived from the k-means algorithm. Convex hulls define the periphery of each cluster, providing insight into the cluster density and separation. Convex hulls around the clusters aid in visualizing the distribution and delineation of nutritional profile groupings across different fishing methods and geographic areas.

The PERMANOVA analyses (Table 5.1) revealed statistically significant differences between clusters, suggesting robust groupings based on the nutrient profiles. The pseudo-F statistics were remarkably high in all cases, indicating strong differentiation between clusters. Specifically, the R² values were 0.87, 0.88, 0.84, and 0.80 for Atauro AG, Atauro GN, Mainland AG, and Mainland GN respectively, indicating that between 80% to 88% of the variance in nutrient concentrations was explained by the clusters. The high R² values underscore the distinctness of the clusters, reinforcing the validity of the K-means clustering.

These findings were consistent across all the datasets, with p-values below 0.001, providing clear evidence to reject the null hypothesis of no difference between clusters. Hence, the PERMANOVA results robustly support the effectiveness of the K-means algorithm in capturing meaningful patterns in nutrient profiles.

-
- +
+

Table 5.1: Results of PERMANOVA analysis assessing the homogeneity of nutrient profiles within fishing trip clusters. The analysis was conducted across four datasets: Atauro with all gears (atauro_AG), Atauro with gill nets (atauro_GN), Mainland with all gears (mainland_AG), and Mainland with gill nets (mainland_GN). For each dataset, the term ‘clusters’ represents the within-group sum of squares (SUMOFSQS), which measures the variance within the nutritional profiles, while ‘Residual’ represents the variance between nutritional profiles Degrees of Freedom (DF), R-squared values (R2), and associated statistics indicate the strength and significance of the clustering. The R2 value quantifies the proportion of variance explained by the clusters.

- +
+

Table 5.2: Performance Metrics for XGBoost Model Across Fishing Data Subsets. This table provides a comprehensive overview of the predictive performance of an XGBoost classification model for four distinct subsets of fishing data: Atauro with all gears (ATAURO AG), Atauro with gill nets (ATAURO GN), Mainland with all gears (MAINLAND AG), and Mainland with gill nets (MAINLAND GN). Key performance indicators include ROC-AUC (area under the receiver operating characteristic curve), accuracy, Kappa (kap), sensitivity (sens), specificity (spec), positive predictive value (ppv), negative predictive value (npv), Matthew’s correlation coefficient (mcc), Youden’s J index (j_index), balanced accuracy (bal_accuracy), detection prevalence, precision, recall, and F measure (f_meas). The metrics collectively reflect the model’s ability to discriminate between nutritional profiles, its overall accuracy, and the balance between the sensitivity and specificity for each subset.


The analysis of SHAP values (see
ML model explanation) from gill net models (Figure 5.4), which provide insights into how different factors influence predictions in an XGBoost model, shows how mesh size and habitat together predict nutrient profiles in the Atauro region. It’s found that smaller mesh sizes, specifically below 40 mm, are closely linked with a higher likelihood of predicting nutrient profile NP3 across various habitats like reefs and beaches. These smaller sizes also have a lesser association with NP4, particularly when fishing occurs in deeper waters. In contrast, mesh sizes around 50mm are predominantly associated with nutrient profile NP2 in similar environments, with mangroves also playing a role.

As we look at larger mesh sizes, those ranging between 60 and 70 mm, there’s a notable association with nutrient profile NP5 across most habitats, including beaches, mangroves, and seagrass areas. There’s a smaller yet significant link to NP1, especially notable when fishing in reef areas. For meshes larger than 70 mm, the data suggests a shift, with nutrient profile NP4 emerging as the most likely prediction among various profiles, particularly within the Atauro subset.

When examining SHAP values derived from mainland data, a more varied pattern emerges. Smaller mesh sizes, less than 35 mm and used in deep water, reef, and FAD environments, are associated with nutrient profiles NP2 and NP4. The latter also shows a connection to beach habitats. Meshes in the 35 to 40mm range are strong predictors for nutrient profile NP2 across a variety of environments, including FAD, deep, reef, and beach. As mesh sizes increase to between 40 and 70mm, the predicted nutrient profiles become more dependent on the specific fishing ground. For example, while reefs are most likely to yield NP1 and to a lesser extent NP3, beaches or deep environments are typically associated with NP2. At the larger end of the spectrum, above 70mm, NP5 becomes the probable prediction when fishing in deeper habitats, although NP2 remains a likely outcome if fishing occurs near beaches.

-
-Differential influence of mesh size on nutritional profile predictions across habitats. The figure compiles subplots for five distinct nutrient profiles (NP1-NP5) as predicted by gill net XGBoost models, with each subplot showing the distribution of SHAP values across varying mesh sizes. Each data point is colored to represent different habitats: Beach, Deep, FAD, Mangrove, Reef, Seagrass and Traditional FAD, providing insight into the habitat-specific impact of mesh size on the predictive accuracy of the model. The x-axis delineates the mesh size range, while the y-axis quantifies the magnitude of the SHAP value, with positive values denoting a heightened probability of a nutrient profile's occurrence and negative values indicating a reduced probability. +
+Differential influence of mesh size on nutritional profile predictions across habitats. The figure compiles subplots for five distinct nutrient profiles (NP1-NP5) as predicted by gill net XGBoost models, with each subplot showing the distribution of SHAP values across varying mesh sizes. Each data point is colored to represent different habitats: Beach, Deep, FAD, Mangrove, Reef, Seagrass and Traditional FAD, providing insight into the habitat-specific impact of mesh size on the predictive accuracy of the model. The x-axis delineates the mesh size range, while the y-axis quantifies the magnitude of the SHAP value, with positive values denoting a heightened probability of a nutrient profile's occurrence and negative values indicating a reduced probability.

Figure 5.4: Differential influence of mesh size on nutritional profile predictions across habitats. The figure compiles subplots for five distinct nutrient profiles (NP1-NP5) as predicted by gill net XGBoost models, with each subplot showing the distribution of SHAP values across varying mesh sizes. Each data point is colored to represent different habitats: Beach, Deep, FAD, Mangrove, Reef, Seagrass and Traditional FAD, providing insight into the habitat-specific impact of mesh size on the predictive accuracy of the model. The x-axis delineates the mesh size range, while the y-axis quantifies the magnitude of the SHAP value, with positive values denoting a heightened probability of a nutrient profile’s occurrence and negative values indicating a reduced probability.

SHAP results of all gears models …

-
-Lore ipsum +
+Lore ipsum

Figure 5.5: Lore ipsum

-
-Lore ipsum2 +
+Lore ipsum2

Figure 5.6: Lore ipsum2

diff --git a/docs/reference-keys.txt b/docs/reference-keys.txt index a89d8d0..4624a2f 100644 --- a/docs/reference-keys.txt +++ b/docs/reference-keys.txt @@ -3,10 +3,11 @@ fig:unnamed-chunk-2 fig:unnamed-chunk-3 fig:unnamed-chunk-4 fig:unnamed-chunk-5 +fig:unnamed-chunk-6 fig:model-settings -fig:unnamed-chunk-8 fig:unnamed-chunk-9 fig:unnamed-chunk-10 +fig:unnamed-chunk-11 content data catch-weight-and-nutrional-content diff --git a/docs/search_index.json b/docs/search_index.json index 7e23f1c..87c41cd 100644 --- a/docs/search_index.json +++ b/docs/search_index.json @@ -1 +1 @@ -[["index.html", "Modelling scenarios for nutrient-sensitive fisheries management 1 Content", " Modelling scenarios for nutrient-sensitive fisheries management Lorenzo Longobardi Last update: 2023-12-23 1 Content This book contains analyses and reports of the paper ‘Modelling scenarios for nutrient-sensitive fisheries management’. All data and code to generate the analyses are in organised in https://github.com/WorldFishCenter/timor.nutrients. "],["data.html", "2 Data 2.1 Catch weight and nutrional content 2.2 Checks and limitations", " 2 Data The research presented in this book relies on two primary sources of data: Recorded Catch (RC): This dataset comprises detailed records of fishing trips that were documented by data collectors in the coastal municipalities of East Timor starting from January 2018. Estimated Catch (EC): This dataset provides a broader view of catch data on a regional level. It is created by combining RC with additional information, including the frequency of fishing trips made by each fishing boat and the total number of boats surveyed (censused) in each municipality. This combination extrapolates the recorded catch data to a larger scale. 2.1 Catch weight and nutrional content The total estimated catch weight is determined by the number of individuals and the length range of each catch. Specifically, during the initial phase of the Peskas project (July 2017 - April 2019), the standard length measurement used was the fork length (FL), which later changed to the total length (TL) in the subsequent and current version of the project. We utilized the API service offered by the FishBase database to incorporate length-to-length and length-to-weight conversion tables, using information from survey landings to calculate the weight in grams based on the following formula: W = a × L^b Here, W represents the weight in grams, L is the total length (TL) in centimeters, and a and b are the conversion parameters obtained from FishBase for each fish species. The FishBase database provides length-to-length and length-to-weight relationships for over 5,000 fish species. Typically, there are multiple records for the parameters a and b for each species. Since the length measurements in Peskas’ first version pertained to FL, we initially standardized all length measurements to TL using the FishBase length-to-length conversion tables. Subsequently, we applied the TL-to-weight conversion tables to estimate the weights. The FishBase length-to-weight conversion tables offer species-level taxonomic resolution. To derive a singular length-to-weight relationship for each fish group, we calculated the median values of parameters a and b for all species within a particular fish group. To ensure relevance to the region of interest, we refined the species list using FAO country codes (https://www.fao.org/countryprofiles/iso3list/en/) pertinent to Timor-Leste and Indonesia (country codes 626 and 360, respectively). For instance, to ascertain the weight of a catch categorized under the fish group labeled ECN (representing the Echeneidae family), we first identified the species within ECN documented in Timor-Leste and Indonesia. After this, we computed the average values of the parameters a and b for the identified species, which in this case were Echeneis naucrates and Remora remora (as illustrated in the figure below). To address the scarcity of measured nutrient values for fish, which are typically limited to a few species and countries. To overcome this data limitation, MacNeil et al. developed a Bayesian hierarchical model that leverages both phylogenetic information and trait-based information to predict concentrations of seven essential nutrients: calcium, iron, omega-3 fatty acids, protein, selenium, vitamin A, and zinc for both marine and inland fish species globally (see Hicks et al. 2019). For each catch, the nutritional yield was calculated by combining the validated weight estimates for each fish group with the modelled nutrient concentrations. Specifically, we used the highest posterior predictive density values for each of the seven nutrients, which can be found in the repository (https://github.com/mamacneil/NutrientFishbase). For non-fish groups—including octopuses, squids, cockles, shrimps, crabs, and lobsters—nutritional yield information was not available in the NutrientFishbase repository models. We retrieved the necessary data for these groups from the Global food composition database, using the same methodological approach as for the fish groups to estimate their nutritional content. To represent the nutrient concentration associated with each fish group, we used the median value as a summarizing metric. Figure 2.1: Distribution of nutrients’ concentration for each fish group. Dots represent the median, bars represent the 95% confidence interval. 2.2 Checks and limitations Check groups with higher dispersion… Dow we need to narrow species grouping? "],["highlight.html", "3 Highlight statistics 3.1 Timor-Est SSF nutritional scenario", " 3 Highlight statistics 3.1 Timor-Est SSF nutritional scenario The table uses the EC dataset and summarizes the main statistics on nutrient supply for each region. Below is a description of each table’ column: MUNICIPALITY (POPULATION): Municipality and number of people > 5 years old in 2022. NUTRIENT: Nutrient of reference ANNUAL SUPPLY: Aggregated annual value in kg. These values represent municipal-level estimates based on the number of fishing boats recorded in the 2021 Timor-Leste boat census, average number of fishing trips per boat and average landing weight values for each fish group. N. PEOPLE SUPPLIED DAILY: It describes the number of people meeting the nutrient’ RNI for each municipality. RNI values used are the following: Selenium Zinc Protein Total -3 PUFA Calcium Iron Vitamin-A 0.000026 0.0049 46 2.939 1 0.0294 0.0005 The 20% of RNIs values was take as reference in consideration of the fact that an ‘adequate diet’ is expected to comprise 5 food group. RNIs were then converted from grams to kg (dividing by 1000) and the requirements was calculated as: \\(\\frac{Anuual\\ supply\\ (kg)}{(RNI\\times 0.20) \\ / 1000} /365\\) POPULATION MEETING RNI REQUIREMENTS: Percentage of the population meeting the RNI requirements in each municipality: \\(\\frac{Number\\ of\\ people\\ supplied\\ daily}{Municipality\\ population} \\times 100\\) "],["distribution.html", "4 Nutrients distribution 4.1 Fish groups 4.2 Habitat and gear type", " 4 Nutrients distribution This section presents the analyses that illustrates the distribution of nutrients within various components of small-scale fisheries in East Timor. 4.1 Fish groups Figure 4.1: The bar chart illustrates the cumulative contribution of various marine food sources to the Recommended Nutrient Intake (RNI) for six essential nutrients, based on a 100g portion size. The x-axis is scaled in percentage terms, with the 100% mark indicating the complete RNI for a reproductive-age woman. Each horizontal bar is a stacked representation, segmented by color to denote the specific nutrient contributions from marine food sources. The marine food sources are labeled on the y-axis, which allows for a comparative visualization of their nutrient profiles, highlighting the diversity in nutrient density and emphasizing their potential significance in dietary nutrition. 4.2 Habitat and gear type Figure 4.2: Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries. "],["profiles.html", "5 Timor SSF nutrient profiles 5.1 Methods 5.2 Results 5.3 Preliminary considerations", " 5 Timor SSF nutrient profiles 5.1 Methods In this section, we identified recurrent nutritional profiles based on RC data, then, we predicted and explained the nutritional profiles on the basis of the fishing strategy and environmental factors. 5.1.1 Data analysis design and subset division As a first step we addressed the inherent imbalance in the RC data, a critical aspect for ensuring accurate and unbiased analysis. Notably, a substantial portion of the data, exceeding 40%, is from Atauro, with gill net being the most frequently reported gear type across all the municipalities. To mitigate the skew caused by this overrepresentation, we strategically divided the dataset into four distinct subsets: Atauro GN: Focused on data from Atauro using gill nets. Atauro AG: Included data from Atauro using fishing methods other than gill nets. Mainland GN: Comprised of gill net data from all municipalities excluding Atauro. Mainland AG: Encompassed data from all other municipalities using non-gill net fishing methods. This subdivision of the dataset was intended to reduce biases and enhance analytical precision. Furthermore, by isolating gill net data, we were able to specifically examine the impact of mesh size on the prediction of nutrient profiles in gill net catches, providing a more focused and detailed analysis of this gear type’s influence on nutritional outcomes. 5.1.2 Clustering and Classification After data partition, we identified recurrent nutritional profiles for each dataset. We assessed the total within sum of square (WSS) of six nutrient concentrations—excluding selenium—to identify the optimal number of clusters (distinctive nutritional profiles). Once established the optimal number of clusters for each dataset, we proceeded with the K-means clustering method to organize the data into distinct groups based on similarities in nutrient concentrations. Each trip was grouped based on its nutrient concentration profile, thereby enabling us to discern patterns and categorize trips according to their nutritional profile. The K-means algorithm functions by assigning each data point to the nearest cluster, based on the mean value of the points in the cluster. This iterative process continues until the assignment of points to clusters no longer changes, indicating that the clusters are as distinct as possible. The result is a set of clusters that represent unique nutritional profiles, each characterized by a specific combination of nutrient concentrations. Subsequent to the clustering, we conducted Permutational Multivariate Analysis of Variance (PERMANOVA) to validate the clustering methodology across four distinct datasets: Atauro AG, Atauro GN, Mainland AG, and Mainland GN. PERMANOVA is a robust non-parametric statistical test that evaluates whether there are significant differences between groups. Unlike traditional ANOVA, PERMANOVA does not rely on assumptions of normality and is therefore suitable for ecological data, which often do not follow normal distributions. Our PERMANOVA analysis was conducted on each of the four subsets on a distance matrix representing pairwise dissimilarities in nutrient concentrations across all fishing trips. This approach allowed us to test the hypothesis that the nutrient profiles of fishing trips within the same cluster are more similar to each other than to trips in different clusters. Finally, we performed a XGBoost model to each data subset to predict the nutritional profiles based on the fishing strategy, habitat and season. We employed the XGBoost algorithm due to its effectiveness in preventing overfitting and its ability to highlight key predictors. We used mesh size, habitat, quarter of the year, and vessel type as predictors for gill net subsets. For other gear types, the models used habitat x gear interaction, habitat, gear type, quarter of the year, and vessel type as predictors. Model tuning was conducted dynamically, adjusting several parameters including the number of trees, tree depth, loss reduction, sample size, and early stopping. The 4 data subsets were split into training (80%) and testing (20%) sets, with 10-fold cross-validation applied to the training set for enhanced accuracy and generalizability. The models’ performance was assessed using accuracy, ROC AUC, sensitivity, and specificity, providing a comprehensive understanding of their ability to accurately distinguish between different nutritional profiles. The ROC curves and AUC values offered an additional layer of model effectiveness evaluation. We employed SHapley Additive exPlanations (SHAP) values to dissect and quantify the influence of various predictors on the nutritional profiles predicted by our XGBoost models. SHAP values, rooted in cooperative game theory, offer a nuanced approach to understanding machine learning model outputs. They decompose a model’s prediction into contributions from each feature, illuminating not only the significance of these features but also the direction of their impact on the prediction. Specifically, for subsets involving gill net fishing methods (Atauro GN and Mainland GN), our focus was on understanding the impact of mesh size. In contrast, for the other subsets (Atauro AG and Mainland AG), which included different fishing methods, we concentrated on analyzing how the habitat and gear type interacted and influenced the nutritional profile predictions. 5.2 Results 5.2.1 Clusters The WSS analysis indicated that either 4 or 5 clusters were the best for organizing each subset of our data. We decided to use 5 clusters for all subsets to maintain uniformity across our analyses and to better represent the varied patterns in nutrient profiles. The bar chart (Figure 5.1) displaying nutrient adequacy across nutrient profiles indicated the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch for various nutrients. The profiles are the result of k-means clustering, reflecting distinct groupings based on the type and quantity of nutrients present in the catch. For the Atauro dataset using all gear types (Panel a),we observe diverse distributions of nutrient adequacy across the profiles Specifically, clusters 1 and 2 exhibit a notably higher content of vitamin A relative to the other clusters, whereas calcium and protein appear more evenly distributed among all nutrient profiles. The distribution of zinc varies greatly, with cluster 5 showing the greatest concentration. Iron is most abundant in cluster 4, distinguishing it from the rest. For the subset of data from Atauro using only gill net gear (Panel b), the distribution is characterized by higher proportions of calcium in clusters 3 and 5. Additionally, clusters 1 and 4 stand out due to their higher vitamin A content….etc…etc… Figure 5.1: Distribution of nutrient adequacy across k-means clusters. The bar chart delineates the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch within identified k-means clusters. Each bar is categorized into six segments corresponding to the evaluated nutrients. The clusters are enumerated on the y-axis, each representing a group with a distinct nutritional profile as determined by the cluster analysis. The x-axis quantifies the count of individuals within each cluster that meet the RNI for the respective nutrients, underlining the variability in nutrient adequacy across clusters. Panels (a) through (d) compare these distributions across different fishing practices and locations, namely Atauro and the Mainland, using all gear types or exclusively gill nets. The scatter plot from the k-means clustering (Figure 5.2) showed the distribution of nutrient profiles across different clusters in each data subset. The first two principal components explained a significant portion of the variance, indicating distinct groupings in nutrient profiles among the fishing trips. Figure 5.2: Nutritional profile clustering of fishing trips by region and gear type. Each plot presents a k-means clustering analysis of fishing trip observations, grouped by their nutritional contributions to the Recommended Nutrient Intake (RNI) for six nutrients. The four panels, labeled (a) through (d), display data subsets for Atauro and the Mainland, utilizing all gear types and gill nets specifically. The scatter plots within each panel are charted in a two-dimensional space defined by the first two principal components, with the axes denoting the percentage of explained variance. Points are color-coded to denote distinct nutritional profile clusters derived from the k-means algorithm. Convex hulls define the periphery of each cluster, providing insight into the cluster density and separation. Convex hulls around the clusters aid in visualizing the distribution and delineation of nutritional profile groupings across different fishing methods and geographic areas. The PERMANOVA analyses (Table 5.1) revealed statistically significant differences between clusters, suggesting robust groupings based on the nutrient profiles. The pseudo-F statistics were remarkably high in all cases, indicating strong differentiation between clusters. Specifically, the R² values were 0.87, 0.88, 0.84, and 0.80 for Atauro AG, Atauro GN, Mainland AG, and Mainland GN respectively, indicating that between 80% to 88% of the variance in nutrient concentrations was explained by the clusters. The high R² values underscore the distinctness of the clusters, reinforcing the validity of the K-means clustering. These findings were consistent across all the datasets, with p-values below 0.001, providing clear evidence to reject the null hypothesis of no difference between clusters. Hence, the PERMANOVA results robustly support the effectiveness of the K-means algorithm in capturing meaningful patterns in nutrient profiles. Table 5.1: Results of PERMANOVA analysis assessing the homogeneity of nutrient profiles within fishing trip clusters. The analysis was conducted across four datasets: Atauro with all gears (atauro_AG), Atauro with gill nets (atauro_GN), Mainland with all gears (mainland_AG), and Mainland with gill nets (mainland_GN). For each dataset, the term ‘clusters’ represents the within-group sum of squares (SUMOFSQS), which measures the variance within the nutritional profiles, while ‘Residual’ represents the variance between nutritional profiles Degrees of Freedom (DF), R-squared values (R2), and associated statistics indicate the strength and significance of the clustering. The R2 value quantifies the proportion of variance explained by the clusters. 5.2.2 XGBoost model In the analysis of the XGBoost model’s predictive performance, both quantitative and visual assessments were conducted, detailed in Table 5.2 and Figure 5.3, respectively. The Receiver Operating Characteristic (ROC) curves (see ML model interpretation) presented in Figure 5.3 offer a graphical evaluation of the model’s sensitivity and specificity across four subsets of fishing data, categorized by region and gear type. These curves plot the true positive rate against the false positive rate for each nutritional profile group identified within the data. An examination of the ROC curves reveals variability in the model’s ability to distinguish between nutritional profile groups. The areas under the curves (AUC) provide a numerical measure of the model’s discriminative power, with a value of 1 representing perfect prediction and 0.5 indicating no discriminative power. While none of the profile groups reach perfection, several demonstrate substantial AUC values, indicating a robust ability to classify observations accurately. In comparing these visual findings with the statistical data from Table 5.2, it is observed that subsets from Atauro (both with all gears and gill nets) yield higher AUC, accuracy, and kappa statistics, suggesting a more consistent and accurate classification of nutritional profiles. These subsets also show higher sensitivity and specificity, indicating a balanced predictive capability for identifying true positives and true negatives. Conversely, the Mainland subsets exhibit lower performance metrics, indicating a more challenging classification scenario. This is reflected in the ROC curves where the lines for the Mainland subsets are farther from the top-left corner, suggesting a lower true positive rate relative to the false positive rate compared to the Atauro subsets. The positive predictive value (PPV) and negative predictive value (NPV), which provide insight into the model’s precision and reliability, also align with the ROC curve analysis, showing higher values for the Atauro subsets. This indicates that when the model predicts a particular nutritional profile for these subsets, it is more likely to be correct. The Matthew’s correlation coefficient (MCC) values, a balanced measure of quality for binary classifications, corroborate the ROC analysis by indicating that the Atauro subsets maintain a higher quality of prediction across classes. In summary, the integrated analysis of Table 5.2 and Figure 5.3 reveals a differentiated performance of the XGBoost model across various subsets of fishing data. The model showcases commendable predictive strength in the Atauro subsets, with high AUC, accuracy, and kappa metrics indicating a reliable classification of nutritional profiles. The ROC curve analysis further supports this, with curves for Atauro subsets nearer to the desired top-left corner, denoting higher sensitivity and specificity. In contrast, the Mainland subsets, despite achieving moderate success, suggest an area for improvement, as seen by their relative distance from the optimal point on the ROC curves and lower performance metrics. This suggests that while the model is effective in identifying nutritional profiles in certain contexts, its performance is not uniformly high across all subsets. Figure 5.3: Receiver Operating Characteristic (ROC) Curves for evaluating the performance of a cluster-based XGBoost classification model across four distinct fishing datasets: Atauro with all gears (a), Atauro with gill nets (b), Mainland with all gears (c), and Mainland with gill nets (d). Each curve represents one of the five clusters obtained from the classification, with different colors marking each cluster. Data points on the curves indicate the trade-off between sensitivity (true positive rate) and 1-specificity (false positive rate) for each cluster. The proximity of the curves to the top-left corner reflects the accuracy of the model in classifying the nutritional profiles into the correct clusters. Table 5.2: Performance Metrics for XGBoost Model Across Fishing Data Subsets. This table provides a comprehensive overview of the predictive performance of an XGBoost classification model for four distinct subsets of fishing data: Atauro with all gears (ATAURO AG), Atauro with gill nets (ATAURO GN), Mainland with all gears (MAINLAND AG), and Mainland with gill nets (MAINLAND GN). Key performance indicators include ROC-AUC (area under the receiver operating characteristic curve), accuracy, Kappa (kap), sensitivity (sens), specificity (spec), positive predictive value (ppv), negative predictive value (npv), Matthew’s correlation coefficient (mcc), Youden’s J index (j_index), balanced accuracy (bal_accuracy), detection prevalence, precision, recall, and F measure (f_meas). The metrics collectively reflect the model’s ability to discriminate between nutritional profiles, its overall accuracy, and the balance between the sensitivity and specificity for each subset. The analysis of SHAP values (see ML model explanation) from gill net models (Figure 5.4), which provide insights into how different factors influence predictions in an XGBoost model, shows how mesh size and habitat together predict nutrient profiles in the Atauro region. It’s found that smaller mesh sizes, specifically below 40 mm, are closely linked with a higher likelihood of predicting nutrient profile NP3 across various habitats like reefs and beaches. These smaller sizes also have a lesser association with NP4, particularly when fishing occurs in deeper waters. In contrast, mesh sizes around 50mm are predominantly associated with nutrient profile NP2 in similar environments, with mangroves also playing a role. As we look at larger mesh sizes, those ranging between 60 and 70 mm, there’s a notable association with nutrient profile NP5 across most habitats, including beaches, mangroves, and seagrass areas. There’s a smaller yet significant link to NP1, especially notable when fishing in reef areas. For meshes larger than 70 mm, the data suggests a shift, with nutrient profile NP4 emerging as the most likely prediction among various profiles, particularly within the Atauro subset. When examining SHAP values derived from mainland data, a more varied pattern emerges. Smaller mesh sizes, less than 35 mm and used in deep water, reef, and FAD environments, are associated with nutrient profiles NP2 and NP4. The latter also shows a connection to beach habitats. Meshes in the 35 to 40mm range are strong predictors for nutrient profile NP2 across a variety of environments, including FAD, deep, reef, and beach. As mesh sizes increase to between 40 and 70mm, the predicted nutrient profiles become more dependent on the specific fishing ground. For example, while reefs are most likely to yield NP1 and to a lesser extent NP3, beaches or deep environments are typically associated with NP2. At the larger end of the spectrum, above 70mm, NP5 becomes the probable prediction when fishing in deeper habitats, although NP2 remains a likely outcome if fishing occurs near beaches. Figure 5.4: Differential influence of mesh size on nutritional profile predictions across habitats. The figure compiles subplots for five distinct nutrient profiles (NP1-NP5) as predicted by gill net XGBoost models, with each subplot showing the distribution of SHAP values across varying mesh sizes. Each data point is colored to represent different habitats: Beach, Deep, FAD, Mangrove, Reef, Seagrass and Traditional FAD, providing insight into the habitat-specific impact of mesh size on the predictive accuracy of the model. The x-axis delineates the mesh size range, while the y-axis quantifies the magnitude of the SHAP value, with positive values denoting a heightened probability of a nutrient profile’s occurrence and negative values indicating a reduced probability. SHAP results of all gears models … Figure 5.5: Lore ipsum Figure 5.6: Lore ipsum2 5.3 Preliminary considerations By using a profiling approach, we can avoid overfishing and habitat depletion. Indeed, instead of focusing on just one species, we spread our fishing efforts across multiple fish groups when sourcing a particular nutrient. The results suggest that in order to get a certain nutriotional supply (for example iron-rich foods) we can leverage on a diversified combination of gear types and habitats. From the results we can infer that gathering more information, particularly from less represented environments and fishing practices, can lead to new opportunities to improve the supply of foods targeting specific nutritional needs. "],["simple.html", "6 In simple terms 6.1 ML model interpretation 6.2 ML model explanation", " 6 In simple terms 6.1 ML model interpretation ROC Curve: The curve plots the true positive rate (sensitivity) against the false positive rate (1 - specificity) at various threshold settings. The true positive rate is on the y-axis, and the false positive rate is on the x-axis. Performance: A perfect classifier would have a point in the upper left corner of the graph, where the true positive rate is 1 (or 100%) and the false positive rate is 0. The closer the curve follows the left-hand border and then the top border of the ROC space, the more accurate the test. Diagonal Line: The dotted diagonal line represents a no-skill classifier (e.g., random guessing). A good classifier stays as far away from this line as possible (toward the upper left corner). Area Under the Curve (AUC): The area under each ROC curve (AUC) is a measure of the test’s accuracy. An AUC of 0.5 suggests no discrimination (no better than random chance), while an AUC of 1.0 suggests perfect discrimination. 6.2 ML model explanation SHAP values: help in understanding how each predictor in the dataset contributed to each particular prediction. A high positive SHAP value for a feature increases the probability of a certain prediction, while a high negative SHAP value decreases it. "],["references.html", "References", " References "],["404.html", "Page not found", " Page not found The page you requested cannot be found (perhaps it was moved or renamed). You may want to try searching to find the page's new location, or use the table of contents to find the page you are looking for. "]] +[["index.html", "Modelling scenarios for nutrient-sensitive fisheries management 1 Content", " Modelling scenarios for nutrient-sensitive fisheries management Lorenzo Longobardi Last update: 2023-12-27 1 Content This book contains analyses and reports of the paper ‘Modelling scenarios for nutrient-sensitive fisheries management’. All data and code to generate the analyses are in organised in https://github.com/WorldFishCenter/timor.nutrients. "],["data.html", "2 Data 2.1 Catch weight and nutrional content 2.2 Checks and limitations", " 2 Data The research presented in this book relies on two primary sources of data: Recorded Catch (RC): This dataset comprises detailed records of fishing trips that were documented by data collectors in the coastal municipalities of East Timor starting from January 2018. Estimated Catch (EC): This dataset provides a broader view of catch data on a regional level. It is created by combining RC with additional information, including the frequency of fishing trips made by each fishing boat and the total number of boats surveyed (censused) in each municipality. This combination extrapolates the recorded catch data to a larger scale. 2.1 Catch weight and nutrional content The total estimated catch weight is determined by the number of individuals and the length range of each catch. Specifically, during the initial phase of the Peskas project (July 2017 - April 2019), the standard length measurement used was the fork length (FL), which later changed to the total length (TL) in the subsequent and current version of the project. We utilized the API service offered by the FishBase database to incorporate length-to-length and length-to-weight conversion tables, using information from survey landings to calculate the weight in grams based on the following formula: W = a × L^b Here, W represents the weight in grams, L is the total length (TL) in centimeters, and a and b are the conversion parameters obtained from FishBase for each fish species. The FishBase database provides length-to-length and length-to-weight relationships for over 5,000 fish species. Typically, there are multiple records for the parameters a and b for each species. Since the length measurements in Peskas’ first version pertained to FL, we initially standardized all length measurements to TL using the FishBase length-to-length conversion tables. Subsequently, we applied the TL-to-weight conversion tables to estimate the weights. The FishBase length-to-weight conversion tables offer species-level taxonomic resolution. To derive a singular length-to-weight relationship for each fish group, we calculated the median values of parameters a and b for all species within a particular fish group. To ensure relevance to the region of interest, we refined the species list using FAO country codes (https://www.fao.org/countryprofiles/iso3list/en/) pertinent to Timor-Leste and Indonesia (country codes 626 and 360, respectively). For instance, to ascertain the weight of a catch categorized under the fish group labeled ECN (representing the Echeneidae family), we first identified the species within ECN documented in Timor-Leste and Indonesia. After this, we computed the average values of the parameters a and b for the identified species, which in this case were Echeneis naucrates and Remora remora (as illustrated in the figure below). To address the scarcity of measured nutrient values for fish, which are typically limited to a few species and countries. To overcome this data limitation, MacNeil et al. developed a Bayesian hierarchical model that leverages both phylogenetic information and trait-based information to predict concentrations of seven essential nutrients: calcium, iron, omega-3 fatty acids, protein, selenium, vitamin A, and zinc for both marine and inland fish species globally (see Hicks et al. 2019). For each catch, the nutritional yield was calculated by combining the validated weight estimates for each fish group with the modelled nutrient concentrations. Specifically, we used the highest posterior predictive density values for each of the seven nutrients, which can be found in the repository (https://github.com/mamacneil/NutrientFishbase). For non-fish groups—including octopuses, squids, cockles, shrimps, crabs, and lobsters—nutritional yield information was not available in the NutrientFishbase repository models. We retrieved the necessary data for these groups from the Global food composition database, using the same methodological approach as for the fish groups to estimate their nutritional content. To represent the nutrient concentration associated with each fish group, we used the median value as a summarizing metric. Figure 2.1: Distribution of nutrients’ concentration for each fish group. Dots represent the median, bars represent the 95% confidence interval. 2.2 Checks and limitations Check groups with higher dispersion… Dow we need to narrow species grouping? "],["highlight.html", "3 Highlight statistics 3.1 Timor-Est SSF nutritional scenario", " 3 Highlight statistics 3.1 Timor-Est SSF nutritional scenario The table uses the EC dataset and summarizes the main statistics on nutrient supply for each region. Below is a description of each table’ column: MUNICIPALITY (POPULATION): Municipality and number of people > 5 years old in 2022. NUTRIENT: Nutrient of reference ANNUAL SUPPLY: Aggregated annual value in kg. These values represent municipal-level estimates based on the number of fishing boats recorded in the 2021 Timor-Leste boat census, average number of fishing trips per boat and average landing weight values for each fish group. N. PEOPLE SUPPLIED DAILY: It describes the number of people meeting the nutrient’ RNI for each municipality. RNI values used are the following: Selenium Zinc Protein Total -3 PUFA Calcium Iron Vitamin-A 0.000026 0.0049 46 2.939 1 0.0294 0.0005 The 20% of RNIs values was take as reference in consideration of the fact that an ‘adequate diet’ is expected to comprise 5 food group. RNIs were then converted from grams to kg (dividing by 1000) and the requirements was calculated as: \\(\\frac{Anuual\\ supply\\ (kg)}{(RNI\\times 0.20) \\ / 1000} /365\\) POPULATION MEETING RNI REQUIREMENTS: Percentage of the population meeting the RNI requirements in each municipality: \\(\\frac{Number\\ of\\ people\\ supplied\\ daily}{Municipality\\ population} \\times 100\\) "],["distribution.html", "4 Nutrients distribution 4.1 Fish groups 4.2 Habitat and gear type", " 4 Nutrients distribution This section presents the analyses that illustrates the distribution of nutrients within various components of small-scale fisheries in East Timor. 4.1 Fish groups Figure 4.1: The bar chart illustrates the contribution of a variety of marine food sources to the Recommended Nutrient Intake (RNI) for six fundamental nutrients, based on a 100g portion. The x-axis represents the proportion of RNI fulfilled, with the 100% benchmark signifying the complete RNI for an adult woman of reproductive age. Each bar is a color-segmented stacked visual, with distinct hues corresponding to individual nutrients, and white numbers within indicating the specific percentage contribution of each nutrient. The chart incorporates the total annual catch in metric tons for each marine species from 2018 to 2023, presented at the end of each bar, providing a view of both the nutritional value and the harvest volume of these essential food sources. The transparency of these values is adjusted to reflect each species’ relative contribution to the overall catch Figure 4.2: Distribution of nutritional content among different fish groups. This series of bar graphs delineates the contribution of various fish groups to the total nutrient stock, highlighting the top ten fish groups for calcium, omega-3, iron, protein, vitamin A, and zinc. Each graph is ordered to reflect the descending contribution of each fish group relative to each nutrient. 4.2 Habitat and gear type Figure 4.3: Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries. "],["profiles.html", "5 Timor SSF nutrient profiles 5.1 Methods 5.2 Results 5.3 Preliminary considerations", " 5 Timor SSF nutrient profiles 5.1 Methods In this section, we identified recurrent nutritional profiles based on RC data, then, we predicted and explained the nutritional profiles on the basis of the fishing strategy and environmental factors. 5.1.1 Data analysis design and subset division As a first step we addressed the inherent imbalance in the RC data, a critical aspect for ensuring accurate and unbiased analysis. Notably, a substantial portion of the data, exceeding 40%, is from Atauro, with gill net being the most frequently reported gear type across all the municipalities. To mitigate the skew caused by this overrepresentation, we strategically divided the dataset into four distinct subsets: Atauro GN: Focused on data from Atauro using gill nets. Atauro AG: Included data from Atauro using fishing methods other than gill nets. Mainland GN: Comprised of gill net data from all municipalities excluding Atauro. Mainland AG: Encompassed data from all other municipalities using non-gill net fishing methods. This subdivision of the dataset was intended to reduce biases and enhance analytical precision. Furthermore, by isolating gill net data, we were able to specifically examine the impact of mesh size on the prediction of nutrient profiles in gill net catches, providing a more focused and detailed analysis of this gear type’s influence on nutritional outcomes. 5.1.2 Clustering and Classification After data partition, we identified recurrent nutritional profiles for each dataset. We assessed the total within sum of square (WSS) of six nutrient concentrations—excluding selenium—to identify the optimal number of clusters (distinctive nutritional profiles). Once established the optimal number of clusters for each dataset, we proceeded with the K-means clustering method to organize the data into distinct groups based on similarities in nutrient concentrations. Each trip was grouped based on its nutrient concentration profile, thereby enabling us to discern patterns and categorize trips according to their nutritional profile. The K-means algorithm functions by assigning each data point to the nearest cluster, based on the mean value of the points in the cluster. This iterative process continues until the assignment of points to clusters no longer changes, indicating that the clusters are as distinct as possible. The result is a set of clusters that represent unique nutritional profiles, each characterized by a specific combination of nutrient concentrations. Subsequent to the clustering, we conducted Permutational Multivariate Analysis of Variance (PERMANOVA) to validate the clustering methodology across four distinct datasets: Atauro AG, Atauro GN, Mainland AG, and Mainland GN. PERMANOVA is a robust non-parametric statistical test that evaluates whether there are significant differences between groups. Unlike traditional ANOVA, PERMANOVA does not rely on assumptions of normality and is therefore suitable for ecological data, which often do not follow normal distributions. Our PERMANOVA analysis was conducted on each of the four subsets on a distance matrix representing pairwise dissimilarities in nutrient concentrations across all fishing trips. This approach allowed us to test the hypothesis that the nutrient profiles of fishing trips within the same cluster are more similar to each other than to trips in different clusters. Finally, we performed a XGBoost model to each data subset to predict the nutritional profiles based on the fishing strategy, habitat and season. We employed the XGBoost algorithm due to its effectiveness in preventing overfitting and its ability to highlight key predictors. We used mesh size, habitat, quarter of the year, and vessel type as predictors for gill net subsets. For other gear types, the models used habitat x gear interaction, habitat, gear type, quarter of the year, and vessel type as predictors. Model tuning was conducted dynamically, adjusting several parameters including the number of trees, tree depth, loss reduction, sample size, and early stopping. The 4 data subsets were split into training (80%) and testing (20%) sets, with 10-fold cross-validation applied to the training set for enhanced accuracy and generalizability. The models’ performance was assessed using accuracy, ROC AUC, sensitivity, and specificity, providing a comprehensive understanding of their ability to accurately distinguish between different nutritional profiles. The ROC curves and AUC values offered an additional layer of model effectiveness evaluation. We employed SHapley Additive exPlanations (SHAP) values to dissect and quantify the influence of various predictors on the nutritional profiles predicted by our XGBoost models. SHAP values, rooted in cooperative game theory, offer a nuanced approach to understanding machine learning model outputs. They decompose a model’s prediction into contributions from each feature, illuminating not only the significance of these features but also the direction of their impact on the prediction. Specifically, for subsets involving gill net fishing methods (Atauro GN and Mainland GN), our focus was on understanding the impact of mesh size. In contrast, for the other subsets (Atauro AG and Mainland AG), which included different fishing methods, we concentrated on analyzing how the habitat and gear type interacted and influenced the nutritional profile predictions. 5.2 Results 5.2.1 Clusters The WSS analysis indicated that either 4 or 5 clusters were the best for organizing each subset of our data. We decided to use 5 clusters for all subsets to maintain uniformity across our analyses and to better represent the varied patterns in nutrient profiles. The bar chart (Figure 5.1) displaying nutrient adequacy across nutrient profiles indicated the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch for various nutrients. The profiles are the result of k-means clustering, reflecting distinct groupings based on the type and quantity of nutrients present in the catch. For the Atauro dataset using all gear types (Panel a),we observe diverse distributions of nutrient adequacy across the profiles Specifically, clusters 1 and 2 exhibit a notably higher content of vitamin A relative to the other clusters, whereas calcium and protein appear more evenly distributed among all nutrient profiles. The distribution of zinc varies greatly, with cluster 5 showing the greatest concentration. Iron is most abundant in cluster 4, distinguishing it from the rest. For the subset of data from Atauro using only gill net gear (Panel b), the distribution is characterized by higher proportions of calcium in clusters 3 and 5. Additionally, clusters 1 and 4 stand out due to their higher vitamin A content….etc…etc… Figure 5.1: Distribution of nutrient adequacy across k-means clusters. The bar chart delineates the number of individuals meeting the Recommended Nutrient Intake (RNI) per 1kg of catch within identified k-means clusters. Each bar is categorized into six segments corresponding to the evaluated nutrients. The clusters are enumerated on the y-axis, each representing a group with a distinct nutritional profile as determined by the cluster analysis. The x-axis quantifies the count of individuals within each cluster that meet the RNI for the respective nutrients, underlining the variability in nutrient adequacy across clusters. Panels (a) through (d) compare these distributions across different fishing practices and locations, namely Atauro and the Mainland, using all gear types or exclusively gill nets. The scatter plot from the k-means clustering (Figure 5.2) showed the distribution of nutrient profiles across different clusters in each data subset. The first two principal components explained a significant portion of the variance, indicating distinct groupings in nutrient profiles among the fishing trips. Figure 5.2: Nutritional profile clustering of fishing trips by region and gear type. Each plot presents a k-means clustering analysis of fishing trip observations, grouped by their nutritional contributions to the Recommended Nutrient Intake (RNI) for six nutrients. The four panels, labeled (a) through (d), display data subsets for Atauro and the Mainland, utilizing all gear types and gill nets specifically. The scatter plots within each panel are charted in a two-dimensional space defined by the first two principal components, with the axes denoting the percentage of explained variance. Points are color-coded to denote distinct nutritional profile clusters derived from the k-means algorithm. Convex hulls define the periphery of each cluster, providing insight into the cluster density and separation. Convex hulls around the clusters aid in visualizing the distribution and delineation of nutritional profile groupings across different fishing methods and geographic areas. The PERMANOVA analyses (Table 5.1) revealed statistically significant differences between clusters, suggesting robust groupings based on the nutrient profiles. The pseudo-F statistics were remarkably high in all cases, indicating strong differentiation between clusters. Specifically, the R² values were 0.87, 0.88, 0.84, and 0.80 for Atauro AG, Atauro GN, Mainland AG, and Mainland GN respectively, indicating that between 80% to 88% of the variance in nutrient concentrations was explained by the clusters. The high R² values underscore the distinctness of the clusters, reinforcing the validity of the K-means clustering. These findings were consistent across all the datasets, with p-values below 0.001, providing clear evidence to reject the null hypothesis of no difference between clusters. Hence, the PERMANOVA results robustly support the effectiveness of the K-means algorithm in capturing meaningful patterns in nutrient profiles. Table 5.1: Results of PERMANOVA analysis assessing the homogeneity of nutrient profiles within fishing trip clusters. The analysis was conducted across four datasets: Atauro with all gears (atauro_AG), Atauro with gill nets (atauro_GN), Mainland with all gears (mainland_AG), and Mainland with gill nets (mainland_GN). For each dataset, the term ‘clusters’ represents the within-group sum of squares (SUMOFSQS), which measures the variance within the nutritional profiles, while ‘Residual’ represents the variance between nutritional profiles Degrees of Freedom (DF), R-squared values (R2), and associated statistics indicate the strength and significance of the clustering. The R2 value quantifies the proportion of variance explained by the clusters. 5.2.2 XGBoost model In the analysis of the XGBoost model’s predictive performance, both quantitative and visual assessments were conducted, detailed in Table 5.2 and Figure 5.3, respectively. The Receiver Operating Characteristic (ROC) curves (see ML model interpretation) presented in Figure 5.3 offer a graphical evaluation of the model’s sensitivity and specificity across four subsets of fishing data, categorized by region and gear type. These curves plot the true positive rate against the false positive rate for each nutritional profile group identified within the data. An examination of the ROC curves reveals variability in the model’s ability to distinguish between nutritional profile groups. The areas under the curves (AUC) provide a numerical measure of the model’s discriminative power, with a value of 1 representing perfect prediction and 0.5 indicating no discriminative power. While none of the profile groups reach perfection, several demonstrate substantial AUC values, indicating a robust ability to classify observations accurately. In comparing these visual findings with the statistical data from Table 5.2, it is observed that subsets from Atauro (both with all gears and gill nets) yield higher AUC, accuracy, and kappa statistics, suggesting a more consistent and accurate classification of nutritional profiles. These subsets also show higher sensitivity and specificity, indicating a balanced predictive capability for identifying true positives and true negatives. Conversely, the Mainland subsets exhibit lower performance metrics, indicating a more challenging classification scenario. This is reflected in the ROC curves where the lines for the Mainland subsets are farther from the top-left corner, suggesting a lower true positive rate relative to the false positive rate compared to the Atauro subsets. The positive predictive value (PPV) and negative predictive value (NPV), which provide insight into the model’s precision and reliability, also align with the ROC curve analysis, showing higher values for the Atauro subsets. This indicates that when the model predicts a particular nutritional profile for these subsets, it is more likely to be correct. The Matthew’s correlation coefficient (MCC) values, a balanced measure of quality for binary classifications, corroborate the ROC analysis by indicating that the Atauro subsets maintain a higher quality of prediction across classes. In summary, the integrated analysis of Table 5.2 and Figure 5.3 reveals a differentiated performance of the XGBoost model across various subsets of fishing data. The model showcases commendable predictive strength in the Atauro subsets, with high AUC, accuracy, and kappa metrics indicating a reliable classification of nutritional profiles. The ROC curve analysis further supports this, with curves for Atauro subsets nearer to the desired top-left corner, denoting higher sensitivity and specificity. In contrast, the Mainland subsets, despite achieving moderate success, suggest an area for improvement, as seen by their relative distance from the optimal point on the ROC curves and lower performance metrics. This suggests that while the model is effective in identifying nutritional profiles in certain contexts, its performance is not uniformly high across all subsets. Figure 5.3: Receiver Operating Characteristic (ROC) Curves for evaluating the performance of a cluster-based XGBoost classification model across four distinct fishing datasets: Atauro with all gears (a), Atauro with gill nets (b), Mainland with all gears (c), and Mainland with gill nets (d). Each curve represents one of the five clusters obtained from the classification, with different colors marking each cluster. Data points on the curves indicate the trade-off between sensitivity (true positive rate) and 1-specificity (false positive rate) for each cluster. The proximity of the curves to the top-left corner reflects the accuracy of the model in classifying the nutritional profiles into the correct clusters. Table 5.2: Performance Metrics for XGBoost Model Across Fishing Data Subsets. This table provides a comprehensive overview of the predictive performance of an XGBoost classification model for four distinct subsets of fishing data: Atauro with all gears (ATAURO AG), Atauro with gill nets (ATAURO GN), Mainland with all gears (MAINLAND AG), and Mainland with gill nets (MAINLAND GN). Key performance indicators include ROC-AUC (area under the receiver operating characteristic curve), accuracy, Kappa (kap), sensitivity (sens), specificity (spec), positive predictive value (ppv), negative predictive value (npv), Matthew’s correlation coefficient (mcc), Youden’s J index (j_index), balanced accuracy (bal_accuracy), detection prevalence, precision, recall, and F measure (f_meas). The metrics collectively reflect the model’s ability to discriminate between nutritional profiles, its overall accuracy, and the balance between the sensitivity and specificity for each subset. The analysis of SHAP values (see ML model explanation) from gill net models (Figure 5.4), which provide insights into how different factors influence predictions in an XGBoost model, shows how mesh size and habitat together predict nutrient profiles in the Atauro region. It’s found that smaller mesh sizes, specifically below 40 mm, are closely linked with a higher likelihood of predicting nutrient profile NP3 across various habitats like reefs and beaches. These smaller sizes also have a lesser association with NP4, particularly when fishing occurs in deeper waters. In contrast, mesh sizes around 50mm are predominantly associated with nutrient profile NP2 in similar environments, with mangroves also playing a role. As we look at larger mesh sizes, those ranging between 60 and 70 mm, there’s a notable association with nutrient profile NP5 across most habitats, including beaches, mangroves, and seagrass areas. There’s a smaller yet significant link to NP1, especially notable when fishing in reef areas. For meshes larger than 70 mm, the data suggests a shift, with nutrient profile NP4 emerging as the most likely prediction among various profiles, particularly within the Atauro subset. When examining SHAP values derived from mainland data, a more varied pattern emerges. Smaller mesh sizes, less than 35 mm and used in deep water, reef, and FAD environments, are associated with nutrient profiles NP2 and NP4. The latter also shows a connection to beach habitats. Meshes in the 35 to 40mm range are strong predictors for nutrient profile NP2 across a variety of environments, including FAD, deep, reef, and beach. As mesh sizes increase to between 40 and 70mm, the predicted nutrient profiles become more dependent on the specific fishing ground. For example, while reefs are most likely to yield NP1 and to a lesser extent NP3, beaches or deep environments are typically associated with NP2. At the larger end of the spectrum, above 70mm, NP5 becomes the probable prediction when fishing in deeper habitats, although NP2 remains a likely outcome if fishing occurs near beaches. Figure 5.4: Differential influence of mesh size on nutritional profile predictions across habitats. The figure compiles subplots for five distinct nutrient profiles (NP1-NP5) as predicted by gill net XGBoost models, with each subplot showing the distribution of SHAP values across varying mesh sizes. Each data point is colored to represent different habitats: Beach, Deep, FAD, Mangrove, Reef, Seagrass and Traditional FAD, providing insight into the habitat-specific impact of mesh size on the predictive accuracy of the model. The x-axis delineates the mesh size range, while the y-axis quantifies the magnitude of the SHAP value, with positive values denoting a heightened probability of a nutrient profile’s occurrence and negative values indicating a reduced probability. SHAP results of all gears models … Figure 5.5: Lore ipsum Figure 5.6: Lore ipsum2 5.3 Preliminary considerations By using a profiling approach, we can avoid overfishing and habitat depletion. Indeed, instead of focusing on just one species, we spread our fishing efforts across multiple fish groups when sourcing a particular nutrient. The results suggest that in order to get a certain nutriotional supply (for example iron-rich foods) we can leverage on a diversified combination of gear types and habitats. From the results we can infer that gathering more information, particularly from less represented environments and fishing practices, can lead to new opportunities to improve the supply of foods targeting specific nutritional needs. "],["simple.html", "6 In simple terms 6.1 ML model interpretation 6.2 ML model explanation", " 6 In simple terms 6.1 ML model interpretation ROC Curve: The curve plots the true positive rate (sensitivity) against the false positive rate (1 - specificity) at various threshold settings. The true positive rate is on the y-axis, and the false positive rate is on the x-axis. Performance: A perfect classifier would have a point in the upper left corner of the graph, where the true positive rate is 1 (or 100%) and the false positive rate is 0. The closer the curve follows the left-hand border and then the top border of the ROC space, the more accurate the test. Diagonal Line: The dotted diagonal line represents a no-skill classifier (e.g., random guessing). A good classifier stays as far away from this line as possible (toward the upper left corner). Area Under the Curve (AUC): The area under each ROC curve (AUC) is a measure of the test’s accuracy. An AUC of 0.5 suggests no discrimination (no better than random chance), while an AUC of 1.0 suggests perfect discrimination. 6.2 ML model explanation SHAP values: help in understanding how each predictor in the dataset contributed to each particular prediction. A high positive SHAP value for a feature increases the probability of a certain prediction, while a high negative SHAP value decreases it. "],["references.html", "References", " References "],["404.html", "Page not found", " Page not found The page you requested cannot be found (perhaps it was moved or renamed). You may want to try searching to find the page's new location, or use the table of contents to find the page you are looking for. "]] diff --git a/docs_book/03-nutrients_distribution.Rmd b/docs_book/03-nutrients_distribution.Rmd index 22c5e5f..9564476 100644 --- a/docs_book/03-nutrients_distribution.Rmd +++ b/docs_book/03-nutrients_distribution.Rmd @@ -4,7 +4,8 @@ This section presents the analyses that illustrates the distribution of nutrient ## Fish groups -```{r echo=FALSE, fig.height=8, fig.width=7, message=FALSE, warning=FALSE, out.width='80%', fig.cap="The bar chart illustrates the cumulative contribution of various marine food sources to the Recommended Nutrient Intake (RNI) for six essential nutrients, based on a 100g portion size. The x-axis is scaled in percentage terms, with the 100% mark indicating the complete RNI for a reproductive-age woman. Each horizontal bar is a stacked representation, segmented by color to denote the specific nutrient contributions from marine food sources. The marine food sources are labeled on the y-axis, which allows for a comparative visualization of their nutrient profiles, highlighting the diversity in nutrient density and emphasizing their potential significance in dietary nutrition."} +```{r echo=FALSE, fig.height=8, fig.width=7, message=FALSE, warning=FALSE, out.width='80%', fig.cap="The bar chart illustrates the contribution of a variety of marine food sources to the Recommended Nutrient Intake (RNI) for six fundamental nutrients, based on a 100g portion. The x-axis represents the proportion of RNI fulfilled, with the 100% benchmark signifying the complete RNI for an adult woman of reproductive age. Each bar is a color-segmented stacked visual, with distinct hues corresponding to individual nutrients, and white numbers within indicating the specific percentage contribution of each nutrient. The chart incorporates the total annual catch in metric tons for each marine species from 2018 to 2023, presented at the end of each bar, providing a view of both the nutritional value and the harvest volume of these essential food sources. The transparency of these values is adjusted to reflect each species' relative contribution to the overall catch"} + library(ggplot2) library(ggforce) @@ -15,7 +16,17 @@ catch_groups_name <- catch_name ) -timor.nutrients::nutrients_table %>% +tot_catch <- + timor.nutrients::region_stats %>% + dplyr::group_by(grouped_taxa) %>% + dplyr::summarise(catch = sum(catch)) %>% + dplyr::left_join(catch_groups_name) %>% + dplyr::select(-grouped_taxa) %>% + dplyr::select(catch_name, catch) %>% + dplyr::mutate(catch = catch / 1000) + +base_plot <- + timor.nutrients::nutrients_table %>% dplyr::left_join(catch_groups_name) %>% dplyr::select(catch_name, Selenium_mu:Vitamin_A_mu) %>% rename_nutrients_mu(hyphen = FALSE) %>% @@ -32,17 +43,86 @@ timor.nutrients::nutrients_table %>% ) %>% dplyr::mutate(rdi = (concentration * 100) / conv_factor) %>% dplyr::group_by(catch_name) %>% - dplyr::mutate(tot = sum(rdi)) %>% - ggplot2::ggplot(ggplot2::aes(rdi, reorder(catch_name, rdi), fill = nutrient)) + + dplyr::mutate(tot = sum(rdi, na.rm = T)) %>% + dplyr::arrange(-tot) %>% + dplyr::left_join(tot_catch) %>% + tidyr::replace_na(list(catch = 0)) + +tot_catch_plot <- + base_plot %>% + dplyr::group_by(catch_name) %>% # Added .drop false to ensure that all factor levels remain in plot + dplyr::summarise( + catch = dplyr::first(catch), + tot = dplyr::first(tot) + ) + + +ggplot2::ggplot() + ggplot2::theme_minimal() + - ggplot2::geom_col(alpha = 0.85) + + ggplot2::geom_col(base_plot, mapping = ggplot2::aes(rdi, reorder(catch_name, tot), fill = nutrient), alpha = 0.85) + + geom_text(base_plot, + mapping = aes(rdi, reorder(catch_name, tot), label = round(rdi, 2) * 100), + position = position_stack(0.5), + color = "white", + size = 3, + inherit.aes = FALSE + ) + + geom_text(tot_catch_plot, + mapping = aes(tot, reorder(catch_name, tot), + label = scales::comma(round(catch, 0), suffix = " t"), + alpha = catch + ), + size = 4, + nudge_x = 0.1 + ) + ggplot2::scale_fill_manual(values = timor.nutrients::palettes$nutrients_palette) + + ggplot2::scale_color_viridis_c() + ggplot2::scale_x_continuous(labels = scales::percent, n.breaks = 10) + ggplot2::labs(y = "", x = "Matched RNI from 100g portion", fill = "") + - ggplot2::theme(legend.position = "bottom") + ggplot2::theme(legend.position = "bottom") + + coord_cartesian(expand = FALSE, xlim = c(0, 1.55)) + + guides(alpha = "none") + ``` +```{r echo=FALSE, fig.height=8, fig.width=7, message=FALSE, warning=FALSE, out.width='80%', fig.cap="Distribution of nutritional content among different fish groups. This series of bar graphs delineates the contribution of various fish groups to the total nutrient stock, highlighting the top ten fish groups for calcium, omega-3, iron, protein, vitamin A, and zinc. Each graph is ordered to reflect the descending contribution of each fish group relative to each nutrient."} + +timor.nutrients::region_stats %>% + dplyr::group_by(grouped_taxa) %>% + dplyr::summarise(dplyr::across(dplyr::where(is.numeric), ~ sum(.x, na.rm = T))) %>% + tidyr::pivot_longer(-c(grouped_taxa, catch), names_to = "nutrient") %>% + dplyr::group_by(nutrient, grouped_taxa) %>% + dplyr::summarise(value = sum(value)) %>% + dplyr::arrange(-value, .by_group = TRUE) %>% + dplyr::slice_head(n = 10) %>% + dplyr::left_join(catch_groups_name) %>% + dplyr::select(-grouped_taxa) %>% + dplyr::select(catch_name, nutrient, value) %>% + dplyr::ungroup() %>% + dplyr::mutate( + nutrient = as.factor(nutrient), + catch_name = tidytext::reorder_within(catch_name, value, nutrient) + ) %>% + dplyr::filter(!nutrient == "selenium") %>% + dplyr::mutate( + nutrient = stringr::str_to_title(nutrient), + nutrient = dplyr::case_when( + nutrient == "Omega3" ~ "Omega-3", + nutrient == "Vitamina" ~ "Vitamin-A", + TRUE ~ nutrient + ) + ) %>% + ggplot(aes(value / 1000, catch_name, fill = nutrient)) + + theme_minimal() + + geom_col(alpha = 0.85) + + facet_wrap(. ~ nutrient, ncol = 2, scales = "free") + + tidytext::scale_y_reordered() + + labs(x = "tons", y = "") + + theme(legend.position = "") + + scale_fill_manual(values = timor.nutrients::palettes$nutrients_palette) +``` + ## Habitat and gear type ```{r echo=FALSE, fig.height=8, fig.width=7, message=FALSE, warning=FALSE, fig.cap="Sankey diagram showing the relative distribution of key nutrients across various marine habitats and the corresponding extraction by different fishing gear types used in Timor-Est small-scale fisheries."} @@ -102,7 +182,7 @@ parallel_plot$y <- factor(parallel_plot$y, levels = c( )) parallel_plot %>% - dplyr::filter(!Nutrient == "Selenium") %>% + dplyr::filter(!Nutrient == "Selenium") %>% na.omit() %>% ggplot(aes(x, id = id, split = y, value = concentration_g)) + ggforce::geom_parallel_sets(aes(fill = Nutrient), alpha = 0.7, axis.width = 0.1) + @@ -122,4 +202,3 @@ parallel_plot %>% ) + labs(fill = "") ``` - diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png index 5e0d20b..4be0f42 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-10-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png index 0e69bc5..5e0d20b 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-11-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-12-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-12-1.png index a5570f2..5e0d20b 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-12-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-12-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-13-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-13-1.png index fb0cd29..5e0d20b 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-13-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-13-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-16-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-16-1.png index 496142f..1af8966 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-16-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-16-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-17-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-17-1.png index 6b3bef7..5f14c72 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-17-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-17-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-18-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-18-1.png new file mode 100644 index 0000000..d830721 Binary files /dev/null and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-18-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-19-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-19-1.png new file mode 100644 index 0000000..5e6cd20 Binary files /dev/null and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-19-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png index 4c8b3d4..7f12c91 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-2-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-20-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-20-1.png new file mode 100644 index 0000000..b23f703 Binary files /dev/null and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-20-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-23-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-23-1.png index 4438b5c..d3b55b7 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-23-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-23-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-24-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-24-1.png index ce76702..4be0f42 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-24-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-24-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-25-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-25-1.png index 149134c..5e0d20b 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-25-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-25-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png index d830721..5f14c72 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-3-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png index 5e6cd20..d830721 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-4-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png index b23f703..5e6cd20 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-5-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png index d830721..b23f703 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-6-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-7-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-7-1.png index 5e6cd20..b23f703 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-7-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-7-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png index d3b55b7..b23f703 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-8-1.png differ diff --git a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png index 4be0f42..d3b55b7 100644 Binary files a/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png and b/docs_book/Timor-nutrient-sensitive-fisheries-management_files/figure-html/unnamed-chunk-9-1.png differ