the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Synthesis of data products for ocean carbonate chemistry
Abstract. As the largest active carbon reservoir on Earth, the ocean is a cornerstone of the global carbon cycle, playing a pivotal role in modulating ocean health and regulating climate. Understanding these crucial roles requires access to a broad array of data products documenting the changing chemistry of the global ocean as a vast and interconnected system. This review article provides a comprehensive overview of 60 existing ocean carbonate chemistry data products, encompassing compilations of cruise datasets, derived gap-filled data products, model simulations, and compilations thereof. It is intended to help researchers identify and access data products that best align with their research objectives, thereby advancing our understanding of the ocean's evolving carbonate chemistry.
Competing interests: One of the co-authors, Anton Velo (Instituto de Investigacions Mariñas, IIM – CSIC, Vigo, Spain), is a member of the editorial board of Earth System Science Data.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.- Preprint
(935 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 09 Jul 2025)
-
CC1: 'Comment on essd-2025-255', Kunal Chakraborty, 21 May 2025
reply
The review article ‘Synthesis of Data Products for Ocean Carbonate Chemistry’ offers a thorough and valuable summary of existing ocean carbonate data products, which will greatly benefit the scientific community. However, it overlooks two machine learning-based products that focus on improving surface pCO2 estimates in the Indian Ocean region.
The first is an ML-based climatological pCO2 data product recently developed and published in the journal Scientific Data for the Bay of Bengal region (https://www.nature.com/articles/s41597-024-03236-w) (Joshi et al., 2024). This data product integrates publicly available open-ocean observations with data from the Indian Exclusive Economic Zone. Given that the Bay of Bengal is a unique basin with very limited publicly accessible pCO2 observations, this high-resolution (~0.083°) climatological pCO2 data product represents a significant advancement in our understanding of pCO2 dynamics in the region. Therefore, it may be appropriate to include this product in Section 3.1.3 (i.e., Gridded and derived data products) of the manuscript to enhance its visibility and encourage its use within the scientific community.
The second is a hybrid data product that corrects long-term (1980–2019), high-resolution (~0.083° or 1/12°) modeled surface pCO2 for the Indian Ocean region (as a part of RECCAPv2) using cruise-based observations and an XGB algorithm. This product, available at https://www.nature.com/articles/s41597-025-04914-z (Ghoshal et al., 2025), falls under Section 3.1.6 (i.e., Model-based and hybrid data products and analysis) of this manuscript.
In this study, a machine learning (ML) approach is employed to correct biases in surface pCO2 simulations generated by the INCOIS-BIO-ROMS model (pCO2model) over the period 1980–2019. The ML model is trained using the differences between observed (pCO2obs) and modeled pCO2 to estimate the spatio-temporal deviations (pCO2obs − pCO2model). These interannually and climatologically varying deviations are then added back to the original model output, resulting in two improved data products: pCIBR_Int and pCIBR_Clim.
Evaluation against independent datasets, including moored observations (BOBOA), the gridded SOCAT product, and other ML-based pCO2 products (such as CMEMS-LSCE-FFNN and OceanSODA), demonstrates a significant improvement of approximately 40% ± 3.31% in RMSE compared to the original model. These corrected pCO2 products are expected to improve the accuracy of air–sea CO₂ flux estimates across the Indian Ocean from 1980 to 2019, helping to better identify key source and sink regions and enhancing our understanding of the Indian Ocean’s contribution to the global carbon budget.
Further, in Section 3.1.6 (i.e., Model-based and hybrid data products and analysis), you may also consider including the model-based dataset and analysis of ocean acidification in the Indian Ocean from 1980 to 2019, as presented by Chakraborty et al. (2024). The paper is available at: https://agupubs.onlinelibrary.wiley.com/doi/pdf/10.1029/2024GB008139. This study provides a comprehensive assessment of ocean acidification trends across the Indian Ocean and its sub-regions, utilizing outputs from a numerical model, an offline biogeochemical (BGC) model, and two machine learning-based products. Overall, the research consolidates the current state of knowledge on Indian Ocean acidification by integrating available field observations, reconstructed datasets, and model simulations.
References:
Joshi, A. P., Ghoshal, P. K., Chakraborty, K., & Sarma, V. V. S. S. (2024). Sea-surface p CO2 maps for the Bay of Bengal based on advanced machine learning algorithms. Scientific Data, 11(1), 384.
Ghoshal, P. K., Joshi, A. P., & Chakraborty, K. (2025). An improved long-term high-resolution surface p CO2 data product for the Indian Ocean using machine learning. Scientific Data, 12(1), 577.
Chakraborty, K., Joshi, A. P., Ghoshal, P. K., Baduru, B., Valsala, V., Sarma, V. V. S. S., Metzl, N., Gehlen, M., Chevallier, F., & Lo Monaco, C. (2024). Indian Ocean acidification and its driving mechanisms over the last four decades (1980–2019). Global Biogeochemical Cycles, 38(9), e2024GB008139. https://doi.org/10.1029/2024GB008139.
Citation: https://doi.org/10.5194/essd-2025-255-CC1 -
AC1: 'Reply on CC1', L.-Q. Jiang, 27 May 2025
reply
Citation: https://doi.org/
10.5194/essd-2025-255-AC1
-
AC1: 'Reply on CC1', L.-Q. Jiang, 27 May 2025
reply
Data sets
Surface Ocean CO2 Atlas Database Version 2024 (SOCATv2024) (NCEI Accession 0293257) Dorothee C. E. Bakker et al. https://doi.org/10.25921/9wpn-th28
Global Ocean Data Analysis Project version 2.2023 (GLODAPv2.2023) (NCEI Accession 0283442) Siv K. Lauvset et al. https://doi.org/10.25921/zyrq-ht66
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
359 | 78 | 8 | 445 | 7 | 7 |
- HTML: 359
- PDF: 78
- XML: 8
- Total: 445
- BibTeX: 7
- EndNote: 7
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1