Published April 28, 2023 | Version v1
Dataset

Data for A method for assessment of the general circulation model quality using k-means clustering algorithm

  • 1. Tallinn University of Technology

Description

The dataset consists of simulated and observed salinity/temperature data which were used in the manuscript "A method for assessment of the general circulation model quality using k-means clustering algorithm" submitted to Geoscientific Model Development. 
The model simulation dataset is from long-term 3D circulation model simulation (Maljutenko and Raudsepp 2014, 2019).  The observations are from the "Baltic Sea - Eutrophication and Acidity aggregated datasets 1902/2017 v2018"  SMHI (2018). 
 
The files are in simple comma separated table format without headers. 
The Dout-t_z_lat_lon_Smod_Sobs_Tmod_Tobs.csv   file contains columns with following variables [units]: 
Time [matlab datenum units], Vertical coordinate [m], latitude [oN], longitude [oE], model salinity [g/kg], observed salinity [g/kg], model temperature [oC], observed temperature [oC].

The Dout-t_z_lat_lon_dS_dT_K1_K2_K3_K4_K5_K6_K7_K8_K9.csv  file contains columns with following variables [units]: 
4 first columns are the same as in the previous file,  salinity error [g/kg], temperature error [oC],  columns 7-8 are integers showing the cluster to which the error pair is designated. 

do_clust_valid_DataFig.m  is a Matlab script which reads the two csv files (and optionally mask file Model_mask.mat), performs the clustering analysis and creates plots which are used in Manuscript.  The script is organized into %% blocks which can be executed separately (default: ctrl+enter).

k-means function is used from the Matlab Statistics and Machine Learning Toolbox.

Additional software used in the do_clust_valid_DataFig.m:

Author's auxiliary formatting scripts script/
datetick_cst.m  
do_fitfig.m  
do_skipticks.m  
do_skipticks_y.m

Colormaps are generated using cbrewer.m (Charles, 2021).
Moving average smoothing is performed using nanmoving_average.m (Aguilera, 2021).

Other (English)

Refferences:

Aguilera, C. A.  V., 2021. moving_average v3.1 (Mar 2008) (https://www.mathworks.com/matlabcentral/fileexchange/12276-moving_average-v3-1-mar-2008), MATLAB Central File Exchange. Retrieved March 2, 2021.

Charles, 2021. cbrewer : colorbrewer schemes for Matlab (https://www.mathworks.com/matlabcentral/fileexchange/34087-cbrewer-colorbrewer-schemes-for-matlab), MATLAB Central File Exchange. Retrieved March 2, 2021.

Maljutenko, I., Raudsepp, U., 2019. Long-term mean, interannual and seasonal circulation in the Gulf of Finland—the wide salt wedge estuary or gulf type ROFI. Journal of Marine Systems, 195, pp.1-19. doi:10.1016/j.jmarsys.2019.03.004

Maljutenko, I., Raudsepp, U., 2014. Validation of GETM model simulated long-term salinity fields in the pathway of saltwater transport in response to the Major Baltic Inflows in the Baltic Sea. Measuring and Modeling of Multi-Scale Interactions in the Marine Environment - IEEE/OES Baltic International Symposium 2014, BALTIC 2014, 6887830. doi:10.1109/BALTIC.2014.6887830

SMHI 2018, Swedish Meteorological and Hydrological Institute (SMHI) (2018). Baltic Sea - Eutrophication and Acidity aggregated datasets 1902/2017 v2018. Aggregated datasets were generated in the framework of EMODnet Chemistry III, under the support of DG MARE Call for Tender EASME/EMFF/2016/006 - lot4. doi:10.6092/595D233C-3F8C-4497-8BD2-52725CEFF96B

Additional details

Related works

Is supplement to
Publication: 10.1016/j.jmarsys.2019.03.004 (DOI)
Publication: 10.1109/BALTIC.2014.6887830 (DOI)
Publication: 10.6092/595D233C-3F8C-4497-8BD2-52725CEFF96B (DOI)

Dates

Available
2021