My professional interests cover multiple areas of predictive analytics and modeling. A substantial portion of my work is related to supervised machine learning application, including Bayesian statistics. Generally, I focus on:
My dissertation research describes the development of statistical and mechanistic models in order to improve our ability of forecasting and eventually preventing eutrophication-driven water quality problems in estuarine and coastal systems. Eutrophication is an enrichment of natural waters with nutrients, as a result of human activities like excessive use of agricultural fertilizer or wastewater effluent. Eutrophication promotes harmful algal blooms and hypoxia (dissolved oxygen concentration < 2 mg/L) which damage fisheries, ecosystems and water supplies. According to NOAA, 65% of the estuaries and coastal waters in the US are moderately to severely degraded by the excessive nutrient inputs. The conservative estimated cost of damage from eutrophication only in the US freshwaters is approximately $2.2 billion annually.
I developed mechanistic and empirical water quality models for the Neuse River Estuary, located in southeastern North Carolina, USA. The models represent the dynamics of near-bottom dissolved oxygen and water column phytoplankton (chlorophyll a, a proxy for algal blooms), which are commonly used as indicators of water quality. The estuary has been experiencing hypoxia, algal blooms, and fish kills for over two decades. These problems has been attributed to an increased amount of nutrients appearing in the river due to rapid growth around the watershed.
First, I formulated process-based biophysical model to simulate the temporal and spatial variability of near-bottom dissolved oxygen. The complex interactions of hydrologic, meteorologic, and anthropogenic influences, affecting the oxygen levels, were described via differential equations. Model was calibrated within Bayesian framework, which enables probabilistic estimation of uncertain rate parameters based on prior scientific knowledge and comparison to historical data.
Model results reveal the influence of hydrometeorological factors on oxygen levels across daily-to-yearly time scales, demonstrating a strong linkage between the riverine inputs and year-to-year oxygen variability. Higher winter river discharge intensifies longitudinal velocities, leading to increased flushing and reduced time to produce and settle organic matter in the estuary, which elevates oxygen levels in summer. Results further indicate that while sediments are the dominant oxygen sink in the estuary, hypoxia can still be mitigated through seasonal nutrient loading reductions.
The probabilistic predictions of the model allow for finding the probabilities of achieving the increases in oxygen levels under multiple nutrient regulations. For example, given a 30% nutrient loading reduction, if samples are only collected twice per month (as is currently the case), it will take about 10 years to be 90% confident of producing an increase in mean May–October near-bottom oxygen.
The model naturally lends itself to be applied for forecasting hypoxia, given riverine flows and nutrient loads, which can be obtained through watershed modeling. I have been producing the annual summer Neuse hypoxia forecast since 2018. All available forecasts and their evaluation can be found below:
This study was published in 2019: Katin, A., Del Giudice, D., & Obenour, D. R. (2019). Modeling biophysical controls on hypoxia in a shallow estuary using a Bayesian mechanistic approach. Environmental Modelling & Software, 120, 104491.
Second, I built and compared hierarchical piecewise regression and mechanistic predictive models for testing hypotheses about biophysical factors driving the temporal and spatial variability of phytoplankton (as chlorophyll a). Parameters for empirical and mechanistic models were calibrated using Bayesian framework, where the inference was numerically implemented using Markov Chain Monte Carlo sampling, performed via Hamiltonian Monte Carlo (in Stan) and an adaptive Metropolis algorithms, respectively.
Hierarchical piecewise regression model with varying slopes and intercepts suggested that river discharge had a strong effect on chlorophyll a, with a change point depending on the magnitude of the flow. However, for nutrient management scenarios process-based models are preferable as they explicitly describe the biochemical mechanisms.
Mechanistic model results indicate that nitrogen is the major nutrient, limiting phytoplankton growth even under phosphorus loading reductions. Results also suggested that nutrient limitation was temporarily and spatially variable, being more pronounced during dry conditions. What is more, results indicate that nutrient concentration reductions, rather than total loading reductions, are the key to controlling estuary chlorophyll a levels. This process-based model also predicted that a 10% change in nitrogen loading (flow held constant) would produce an approximate 4.3% change in estuary chlorophyll a concentration, while hierarchical piecewise regression implied a larger (and unlikely realistic 10%) effect.
This study was published in 2021: Katin, A., Del Giudice, D., Hall, N. S., Paerl, H. W., & Obenour, D. R. (2021). Simulating algal dynamics within a Bayesian framework to evaluate controls on estuary productivity. Ecological Modelling, 447, 109497.
The important finding of this research is that implementing a 30% nutrient loading reduction for the Neuse River Estuary would help to reduce chlorophyll a concentration by 14% and decrease the amount of hypoxic days by 25%. These suggested nutrient loading reductions will help to meet compliance with state water quality standards and will decrease the ecosystem stress, potentially reducing fish mortality.
In this project I focused on two major tasks: 1) leveraging the existing Bayesian mechanistic model for forecasting dissolved oxygen levels in advance of hypoxic season for Northern Gulf of Mexico 2) using a Bayesian delta-lognormal approach to model distributions of Brown and White Shrimp for Louisiana shelf of the Northern Gulf.
A combination of anthropogenic and environmental factors controls the size the of the low dissolved oxygen (hypoxic) zone, occurring each summer in the Northern Gulf of Mexico. Hypoxia influences the distribution of commercially important finfish and shellfish, with implications for fisheries management. Most existing hypoxia forecasts for the Gulf are based only on spring nitrogen load and report only late-July hypoxic conditions.
However, temporally-resolved forecasts are critical for informing fisheries that are active across the summer season. I present a hypoxia forecasting system, which provides dissolved oxygen concentrations and hypoxic area size at daily resolution with up to four months of lead time. The underlying Bayesian mechanistic model allows for testing various representations of riverine and meteorological model inputs, explicitly distinguishing between different sources of uncertainty.
Results show that constraining summer riverine inputs based on spring conditions, including precipitation over the Mississippi River basin, can be used to improve hypoxia forecasting skill. Results also suggest that inclusion of spring wind data further enhances hypoxia forecasts. Overall, retrospective forecasts explain about 50% of the temporal variability in hypoxic area for two different sections of the Louisiana–Texas shelf of the Gulf (1985−2016). In addition, the forecasting approach explicitly quantifies the uncertainty associated with model parameters, variability in data inputs, and residual error, allowing for risk-based ecosystem management.
This manuscript was published in 2022: Katin, A., Del Giudice, D., Obenour, D.R. (2022). Temporally resolved coastal hypoxia forecasting and uncertainty assessment via Bayesian mechanistic modeling, Hydrology and Earth System Sciences, Volume 26.
Delta-lognormal approach, combining logistic (presence/absence) and multiple linear (non-zero) regressions, are developed for two shrimp species for the Louisiana–Texas shelf of the Northern Gulf of Mexico, USA. The models represent the distribution of Brown (Penaeus aztecus) and White Shrimp (Penaeus setiferus) and identify the influence of multiple environmental, water quality (including dissolved oxygen), and anthropogenic predictor variables. The survey data of both shrimp species are analyzed using Bayesian statistical models, which are built on findings from generalized additive models (GAMs). The manuscript is in preparation.
A list of all above-mentioned peer-reviewed publications are available on Google scholar. If you need a full text of any of my manuscripts, feel free to contact me, I will be happy to share.