Skip to article frontmatterSkip to article content

Exploring Salinity Patterns in South Florida Coastal Waters

This notebook introduces a user-facing science workflow that explores salinity patterns in South Florida using curated buoy data published through the EnviStor smart data pipeline and made accessible via PelicanFS as part of the Open Science Data Federation (OSDF).

EnviStor is an AI-assisted, modular data pipeline developed at Florida International University (FIU) to process and publish environmental datasets. It automates tasks like file type detection, metadata generation, geospatial transformations, and dataset publication. For this use case, EnviStor ingested buoy data, processed and cleaned it, and published it through the OSDF federation, making it available to science users via PelicanFS.

EnviStor logo

Overview

This notebook introduces a real-world scientific workflow built around the question: “What are the salinity patterns in South Florida?” Using curated data from FIU’s buoy network—processed by the EnviStor smart pipeline and accessed through OSDF using PelicanFS, we explore how environmental datasets can be made reproducible, discoverable, and usable by domain scientists.

Below is a breakdown of what this notebook covers:

1. What is Salinity? A brief introduction to salinity, its units, and its role in coastal science.

2. Why South Florida? Context around why this region is important for salinity monitoring, including environmental and societal implications.

3. About the Dataset Description of the FIU CREST buoy network, the variables collected, and how the data is prepared through EnviStor.

4. Data Access with PelicanFS Explanation of how the dataset is accessed directly from OSDF using PelicanFS, and how that supports open science.

5. Research Question A clear framing of the central question that will be explored in the analysis notebook.

6. What’s Next? A preview of what readers will do in the second notebook: loading, analyzing, and visualizing the data.

By the end of this notebook, the reader should understand the scientific motivation, the data context, and how curated environmental data made available via OSDF can be leveraged in reproducible Jupyter-based workflows.

Prerequisites

To fully understand the context and content of this notebook, readers should be familiar with a few foundational concepts and tools. While this notebook itself is explanatory and doesn’t run any code, it introduces ideas and data structures that are implemented in the accompanying technical notebook.

Below is a table of key concepts and how important they are for this material:

ConceptsImportanceNotes
What is Salinity?NecessaryCore environmental concept used in the notebook
Intro to Buoy-based monitoringHelpfulUnderstanding how buoy data is collected
PelicanFS OverviewNecessaryExplains how the curated data is accessed from OSDF
What is OSDFHelpfulHigh-level context for how EnviStor data is shared
  • Time to learn: 20-30 minutes. (~5–10 minutes per external concept; extra time optional for those new to Pelican/OSDF)

What is Salinity?

Salinity measures how much salt is dissolved in water, typically in Practical Salinity Units (PSU).

Salinity influences ocean circulation, marine life, and freshwater availability. In coastal regions like South Florida, it’s a key indicator of environmental change, particularly for detecting saltwater intrusion into freshwater systems.

Why South Florida?

South Florida is particularly sensitive to salinity changes due to its mix of freshwater inflows, tidal dynamics, sea level rise, and storm surge events. Monitoring salinity helps scientists detect:

  • Saltwater intrusion into aquifers
  • Seasonal or storm-related shifts in water quality
  • Long-term climate-driven changes

About the Dataset

This dataset comes from FIU’s CREST buoy network, which continuously measures water quality parameters including salinity, temperature, turbidity, and more.

For this analysis, we selected three stations:

  • Buoy 2: NW Biscayne Bay
  • Buoy 3: Haulover Inlet
  • Buoy 3-2: Little River

These datasets were processed through the EnviStor smart pipeline, which cleaned, standardized, and published them into the Open Science Data Federation (OSDF). The data is now directly accessible using PelicanFS, a high-performance data access layer.

Data Access with PelicanFS

The curated data is available in the /envistor namespace on OSDF. We will access it directly in the next notebook using PelicanFS, a high-performance file system interface developed by the OSG and Pathfinders community.

No pre-downloaded files are needed — all data will be loaded live from OSDF via PelicanFS.

Research Question

We aim to explore:
“How does water salinity vary across time and space in South Florida coastal waters?”

This question is central to understanding environmental patterns related to:

  • Seasonality (e.g., wet vs. dry season)
  • Freshwater discharge (e.g., from canals or rivers)
  • Saltwater intrusion (e.g., due to sea level rise or storm surge)

To answer it, we will compare salinity data collected from three distinct coastal monitoring stations (NW Biscayne Bay, Haulover Inlet, and Little River) over a multi-month period. These stations span both urban and natural areas, helping us examine spatial variation.

By plotting and analyzing this data over time, we can begin to identify how salinity responds to both natural processes and human-driven impacts — providing insights useful for coastal management and long-term monitoring.


Summary

In this notebook, we explored the motivation, context, and scientific importance of analyzing salinity patterns in South Florida. We introduced the curated buoy datasets processed through the EnviStor smart data pipeline and made accessible via PelicanFS, part of the Open Science Data Federation. The reader now understands why salinity matters, how the data was collected and prepared, and what question the upcoming technical notebook will answer. This context sets the stage for a reproducible, user-facing environmental analysis powered by FAIR data infrastructure.

What’s Next?

In the next notebook, we will:

  • Load salinity datasets for three buoy stations (NW Biscayne Bay, Haulover Inlet, and Little River) directly from OSDF via PelicanFS.
  • Combine and clean the data: We’ll merge the datasets into a single time series, convert timestamps, label each station, and handle any missing or invalid salinity readings.
  • Resample to daily averages to reduce noise and highlight broader trends.
  • Visualize the results using line plots to show salinity over time for each location, making it easy to compare patterns across stations.
  • Analyze temporal and spatial patterns, such as seasonal variations, sudden shifts, or location-specific behaviors that may suggest environmental changes like saltwater intrusion or storm impact.

This notebook will give users a complete, reproducible workflow that demonstrates how curated environmental data can be analyzed using standard Python tools — all made possible by the EnviStor pipeline and OSDF infrastructure.