Machine-learning framework to identify and validate biochemical regime clusters in the global blue carbon ecosystem

dc.contributor.authorSingh, Bhan
dc.contributor.supervisorPopli, Navneet
dc.contributor.supervisorSima, Mihai
dc.date.accessioned2026-01-07T21:16:16Z
dc.date.available2026-01-07T21:16:16Z
dc.date.issued2025
dc.degree.departmentDepartment of Electrical and Computer Engineering
dc.degree.levelMaster of Applied Science MASc
dc.description.abstractThe Earth’s climate system is undergoing profound transformation, driven by changes in natural and anthropogenic stressors that disrupt environmental balance across land, air, and sea. Among these domains, the ocean stands as both a stabilizer and a sentinel, absorbing excess heat and carbon while revealing the earliest signs of ecological stress. Yet, the ocean itself is changing, shaped by interacting forces such as temperature, salinity, oxygen depletion, depth stratification, and biological productivity. Understanding how these stressors combine to reshape marine ecosystems requires not just observation but intelligent pattern recognition. This thesis approaches the problem as one of learning structure within complexity. Rather than relying on political boundaries or fixed geographic regions, it asks: can we allow the data itself to define the ocean’s natural divisions? Using in-situ observations from the World Ocean Database (WOD), a machine-learning framework was developed to uncover underlying biogeochemical regimes, clusters of ocean states defined by their physical and chemical signatures. Through careful preprocessing and hierarchical spatialtemporal imputation, the dataset was refined to reflect true environmental variability rather than sampling noise. The analysis employed multiple clustering algorithms to let ocean data “self-organize,” followed by classification models that validated and explained the separability of the discovered regimes. This hybrid approach revealed five coherent and interpretable patterns corresponding to familiar yet dynamically interconnected oceanic systems: productive coastal upwellings, oligotrophic gyres, polar waters, oxygen-minimum zones, and transitional open-ocean regimes. Together, these patterns tell a story of a living ocean, one organized not by political maps, but by the natural language of its own chemistry and biology. By combining unsupervised discovery with supervised validation, the research demonstrates how global ocean observations can be transformed into quantitative, interpretable indicators of ocean health. The resulting framework contributes to the emerging vision of a digital twin ocean, a system where data, models, and machine learning work together to monitor, predict, and ultimately safeguard the resilience of the planet’s largest ecosystem.
dc.description.scholarlevelGraduate
dc.identifier.urihttps://hdl.handle.net/1828/23057
dc.languageEnglisheng
dc.language.isoen
dc.rightsAvailable to the World Wide Web
dc.subjectMachine-learning
dc.subjectOcean health
dc.subjectClustering
dc.subjectClassification
dc.titleMachine-learning framework to identify and validate biochemical regime clusters in the global blue carbon ecosystem
dc.typeThesis

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Singh_Bhan_MASc_2025.pdf
Size:
4.69 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: