Journal topic
Ocean Sci., 15, 1307–1326, 2019
https://doi.org/10.5194/os-15-1307-2019
Ocean Sci., 15, 1307–1326, 2019
https://doi.org/10.5194/os-15-1307-2019

Research article 07 Oct 2019

Research article | 07 Oct 2019

# Assessment of ocean analysis and forecast from an atmosphere–ocean coupled data assimilation operational system

Assessment of ocean analysis and forecast from an atmosphere–ocean coupled data assimilation operational system
Catherine Guiavarc'h, Jonah Roberts-Jones, Chris Harris, Daniel J. Lea, Andrew Ryan, and Isabella Ascione Catherine Guiavarc'h et al.
• Met Office, FitzRoy Road, EX1 3PB, Exeter, UK

Correspondence: Chris Harris (christopher.harris@metoffice.gov.uk)

Abstract

The development of coupled atmosphere–ocean prediction systems with utility on short-range numerical weather prediction (NWP) and ocean forecasting timescales has accelerated over the last decade. This builds on a body of evidence showing the benefit, particularly for weather forecasting, of more correctly representing the feedbacks between the surface ocean and atmosphere. It prepares the way for more unified prediction systems with the capability of providing consistent surface meteorology, wave and surface ocean products to users for whom this is important. Here we describe a coupled ocean–atmosphere system, with weakly coupled data assimilation, which was operationalised at the Met Office as part of the Copernicus Marine Environment Service (CMEMS). We compare the ocean performance to that of an equivalent ocean-only system run at the Met Office and other CMEMS products. Sea surface temperatures in particular are shown to verify better than in the ocean-only systems, although other aspects including temperature profiles and surface currents are slightly degraded. We then discuss the plans to improve the current system in future as part of the development of a “coupled NWP” system at the Met Office.

1 Introduction

Coupled systems are used in a wide range of applications (short- and medium-range forecasts, seasonal forecasts, climate prediction, and future scenario projections) and improving the initialisation of these systems can play a significant part to reduce the development of errors. Using separate atmosphere and ocean analyses to initialise a coupled system can result in an imbalanced system. The imbalance can cause an initialisation shock, which potentially increases the development of errors during the forecast. Using a coupled data assimilation (DA) approach has been shown to reduce this initialisation shock .

For many years, the Met Office has been running short-range prediction systems separately for the atmosphere and the ocean. Although seasonal forecasts use coupled models, they are initialised from separate uncoupled analyses. Since October 2016, the Met Office coupled atmosphere–land–ocean–ice data assimilation (CPLDA) system has been running operationally, and its ocean forecast and analysis have been delivered daily to the Copernicus Marine Environment Monitoring Service (CMEMS) since July 2017. The CPLDA system is the first operational Met Office system providing a seamless coupled analysis and forecast. It follows work done in the Met Office in recent years to highlight the impact of coupling for the short- to medium-range forecast and to develop a coupled DA system that could in the future replace uncoupled atmosphere and ocean short-range prediction systems. In , a large set of global coupled atmosphere–ocean–sea ice 15 d hindcasts were initialised from separate atmosphere and ocean–ice analyses. Compared to uncoupled forecasts, the coupled 15 d hindcasts show improved forecasting skill, especially in the tropics, a strongly coupled region where small changes in sea surface temperature (SST) can exert a major influence on patterns of convection and remote responses to diabatic heating. Both atmosphere and ocean biases benchmarked against control simulations suggest a more vigorous water cycle, improved large-scale circulation, enhanced convection, stronger teleconnections and somewhat improved representation of the Madden–Julian oscillations. They conclude that significant benefits should arise from the full integration of a coupled NWP system.

In recent years, several centres have developed operational coupled data assimilation systems. provide an overview of many of these efforts, showing the diversity of approaches available. For example, the European Centre for Medium-Range Weather Forecasts (ECMWF) has developed a weakly coupled ocean–atmosphere data assimilation system which is now operational. This system is shown to reduce forecast errors compared with forecasts initialised from uncoupled analyses. The National Centers for Environmental Prediction (NCEP) is using a coupled data assimilation system for seasonal and sub-seasonal predictions . At JAMSTEC, a low-resolution strongly coupled system was developed to be used for experimental seasonal and decadal prediction, while the US Naval Research Laboratory coupled model is initialised from separate analyses but uses a high-resolution ocean component ($\mathrm{1}/\mathrm{25}{}^{\circ }$). Similarly, the Canadian Centre for Meteorological and Environmental Prediction operational global coupled system uses uncoupled analyses but taking great care to maintain the consistency of surface boundary conditions between components in order to avoid initialisation shocks. show good forecast performance compared to the atmosphere-only counterpart of this system, with the most significant impact associated with decreased tropical cyclone intensification.

The approach taken to develop the CPLDA system is described in detail in . It is based on the existing coupled model developed for seasonal forecasting and climate prediction and on existing data assimilation systems for the ocean, sea ice, land and atmosphere. The data assimilation follows a “weakly coupled” approach. This means the coupled model provides background information for separate analyses in each subcomponent with the increments being added back into the coupled model. assessed the performance of the system against uncoupled control experiments for two short trials (December 2011 and June 2012). In this paper, we present results from the first long simulation run with the CPLDA system. We run the system for a year (February 2015 to January 2016) with 6 h analysis and daily 7 d forecast.

The Met Office Ocean Forecasting R&D group (OFRD) has been providing ocean analyses and forecasts from the Forecast Ocean Assimilation Model (FOAM) for many years. FOAM is an accurate system that verifies well against similar international systems , making it a suitable benchmark against which to assess the ocean component of the CPLDA system. In this study, we investigate the added value of an ocean analysis and forecast provided by a coupled system and try to understand the causes of the differences between the coupled and uncoupled systems. As the primary focus of this study is to assess the quality of the product delivered to CMEMS, our work focussed mainly on the assessment of the ocean component of the CPLDA system. To avoid duplication of previous work not every aspect has been investigated. For instance, we did not investigate the impact on the diurnal cycle already covered in . They investigated the impact of the initialisation on the diurnal cycle of SST by comparing to geostationary satellite data from SEVIRI. They looked at the diurnal SST range and showed that the initialisation of the coupled forecast has very little impact on this. showed that in the South Pacific the coupled model diurnal cycle errors against observations are somewhat amplified compared to the ocean-only control. However, the diurnal cycle is perhaps underestimated in the central North Atlantic .

A description of the CPLDA system and of the experimental set-up used to assess the system is presented in Sect. 2, together with a description of the differences with the FOAM configuration. The results are presented in Sect. 3. We focus on the assessment of the ocean analysis and forecast from CPLDA as well as comparison with the FOAM analysis and forecast and with the Mercator $\mathrm{1}/\mathrm{12}{}^{\circ }$ analysis (PSY4). Assessments of the differences at the air–sea interface between the coupled system and the uncoupled systems are also presented. Section 4 summarises the main results and provides discussions and plans for future work.

2 Description of the coupled data assimilation system and experiments

The weakly coupled atmosphere–land–ocean–ice data assimilation system (CPLDA) is built on the coupled system developed by . Details of the scientific and technical implementation of the system are described in their paper. The CPLDA system has been running operationally since October 2016 and is the first Met Office operational coupled forecasting system with complete consistency between the analyses and the forecast. The coupled forecast is initialised by coupled analysis for both ocean and atmosphere components. This continuity means that the atmosphere and ocean components are identical in the analysis and in the forecast. The previous coupled forecasting system was using uncoupled analyses from FOAM and NWP (numerical weather prediction) to initialise the GloSea coupled forecast . In that case both the atmosphere and ocean components differed between the analysis and the forecast. The different components of the CPLDA system are described below, and the differences with the FOAM system used for comparison in the next sections are highlighted.

## 2.1 Atmosphere component and surface forcing

In the CPLDA system, analysis and forecast fluxes are calculated using bulk formulae based on COARE3.0 within the Unified Model (MetUM) atmosphere at 40 km resolution and interpolated and passed to the NEMO ocean component by the OASIS3 coupler (Valcke2006). The FOAM system used interpolated atmospheric fields from the operational Met Office global NWP configuration of the MetUM (at 17 km resolution in 2015) with CORE bulk formulae to calculate the turbulent fluxes. The CPLDA atmospheric component is the GA6.0 atmospheric science configuration (also used by GloSea) for both the analysis and forecast, while the NWP configuration used to force FOAM was GA6.1 (note that this distinction mainly relates to aspects of the land surface treatment and is largely irrelevant from the ocean point of view). The CPLDA atmosphere data assimilation system is described in detail in . It uses an incremental strong constraint 4D-Var system similar to . One addition to the system described in is that the CPLDA atmosphere data assimilation now uses a variational bias correction (VarBC) to continuously update the bias correction applied to observations. VarBC (Lorenc2013) was not implemented operationally in the NWP system until March 2016; however, all the CPLDA experiments in this paper use VarBC.

Table 1The FOAM , CPLDA and GloSea ocean configurations.

## 2.2 Ocean component

The ocean configuration is described in detail below, and the CPLDA and FOAM ocean components are summarised in Table 1. Both the CPLDA and FOAM systems use the global ocean configuration GO5 . GO5 uses version 3.4 of the NEMO modelling system with the ORCA025 tripolar horizontal grid (which has a $\mathrm{1}/\mathrm{4}{}^{\circ }$ or 28 km horizontal grid spacing at the Equator, reducing to 7 km at high southern latitudes and 10 km in the Arctic Ocean) and is based on the configuration developed by Mercator Ocean. The vertical coordinate system is based on geopotential levels using the DRAKKAR 75-level set, which provides an increased near-surface resolution (including 1 m surface layers to help resolve shallow mixed layers and potentially capture diurnal variability) without compromising resolution at depth. The model bathymetry is DRAKKAR v3.3, which is based on the ETOPO1 dataset with additional data in coastal regions from GEBCO (General Bathymetric Chart of the Oceans; ). Partial cell thicknesses at the ocean floor allow for a better representation of ocean topography and, in combination with an energy- and enstrophy-conserving momentum advection scheme and a free-slip lateral momentum boundary condition, improve the mesoscale circulation, in particular the simulation of western boundary currents. The tracers are advected using a total-variation-diminishing (TVD) scheme (Zalesak1979) and a linear filtered free surface is used to remove high-frequency gravity waves. Tracer diffusion is Laplacian along isopycnals, and horizontal momentum diffusion is performed using a bi-Laplacian operator along geopotential levels. The vertical diffusion implemented in the CPLDA system is the turbulent kinetic energy (TKE) scheme of updated to ensure dynamical consistency in the space–time discretisations (Burchard2002). The background diffusivity and viscosity include a double diffusive mixing parameterisation and a parameterisation to attempt to model the mixing effect of Langmuir circulation. In CPLDA, both analysis and the forecast use a Haney retroaction to control sea surface salinities, a 3-D Newtonian damping towards a World Ocean Atlas 2001 climatology (to prevent long term drift of subsurface tracer fields) and a pressure gradient bias correction in the tropics to ensure temperature and salinity increments are retained by the model. The Haney flux correction (Haney1971) is applied to the sea surface salinity (SSS) based on the difference between the model and climatology; the magnitude of the restoring on SSS is −33.33 mm day−1 psu−1. A summary of the settings for the ocean component in FOAM and CPLDA is shown in Table 1. Most settings are identical except for necessary differences in the surface forcing and a shorter assimilation time window for CPLDA. The ocean data assimilation is described in more detail in Sect. 2.5.

## 2.3 Sea ice component

The sea ice model (CICE version 4.1; Hunke and Lipscomb2010) runs on the same ORCA025 grid as NEMO and with five ice thickness categories. The CICE model determines the spatial and temporal evolution of the ice thickness distribution (ITD) due to advection, thermodynamic growth and melt, and mechanical redistribution and ridging . The CPLDA system documented predates the development work described in and . Therefore, when running coupled to the Unified Model (MetUM) atmosphere, the CPLDA system uses the zero-layer thermodynamic model of , with a single layer of both ice and snow in CICE. The FOAM system, on the other hand, uses a five-layer thermodynamic model (four ice layers and one snow layer). Ice dynamics are calculated using the elastic–viscous–plastic (EVP) scheme of . In CPLDA, for both the analysis and forecast, the ice top and bottom conductive heat fluxes are calculated within the atmosphere model, interpolated by OASIS and then passed to NEMO from which they can be accessed by CICE. In GloSea, the heat fluxes calculated by the atmosphere model were also used, but in the FOAM analysis CICE used its own bulk formulation to specify surface boundary conditions.

In FOAM the freezing temperature is dependent on salinity to provide a more realistic representation of ice melting and freezing mechanisms and to give better consistency when assimilating both sea surface temperature and sea ice concentration. For technical reasons the initial coupled DA system assessed here used a fixed freezing temperature of $-\mathrm{1.8}{}^{\circ }$C, as did the GloSea system. However, salinity dependence was later introduced in CPLDA in September 2018. The GSI6 global sea ice configuration used in the CPLDA system is detailed in .

## 2.4 Ocean observations

The ocean observations assimilated into the CPLDA system are the same as in FOAM. These are as follows.

• Satellite SSTs are from subsampled level 2 data supplied by the Global High-Resolution Sea Surface Temperature (GHRSST) project comprising Advanced Very High Resolution Radiometer data (NOAA & MetOp), microwave AMSR-2 data and Visible Infrared Imaging Radiometer Suite (VIIRS) data. Note that daytime satellite SST data with a strong diurnal signal wherein the wind speed is less than 6 m s−1 are not used.

• In situ SSTs are from moored buoys, drifting buoys and ships (the in situ observations are considered unbiased and used as a reference for satellite SST bias correction).

• Sea level anomaly (SLA) observations are from the Jason-2, Jason-3, CryoSat-2, SARAL–AltiKa and Sentinel-3a platforms.

• Subsurface temperature and salinity profiles are from Argo profiling floats, underwater gliders, moored buoys, marine mammals and manual profiling methods.

• Sea ice concentration is from Special Sensor Microwave Imager/Sounder (SSMIS) data provided by the EUMETSAT Ocean Sea Ice Satellite Application Facility (OSI-SAF) as a daily gridded product on a 10 km polar stereographic projection.

Model fields are mapped into observation space using the NEMO observation operator to create nearest-time-step model counterparts at the observation location using bilinear interpolation in the horizontal and cubic splines in the vertical directions.

## 2.5 Ocean data assimilation

The CPLDA system is based on a weakly coupled data assimilation approach. The coupled model is used to provide background information for separate ocean, sea ice, atmosphere and land analyses. The increments generated from these separate analyses are added back into the coupled model .

For the ocean and sea ice DA both FOAM and CPLDA use NEMOVAR – an incremental 3D-Var, first guess at appropriate time (FGAT) assimilation scheme designed specifically for use with NEMO and further tuned at the Met Office for the $\mathrm{1}/\mathrm{4}{}^{\circ }$ global model . The state vector in NEMOVAR consists of temperature, salinity, surface elevation, sea ice concentration and horizontal velocities. Key features of NEMOVAR are the multivariate relationships, which are specified through a linearised balance operator , and the use of an implicit diffusion operator to model background error correlations . As detailed in , the NEMOVAR system includes bias correction schemes for both sea surface temperature (SST) and altimeter data (using the CNES-CLS09 mean dynamic topography (MDT) of , as a reference). The temperature and unbalanced salinity are assimilated using two horizontal correlation length scales following the method described in . FOAM and CPLDA run one outer loop and 40 iterations of the inner loop minimisation of the cost function. Note that the sea ice DA is run in a separate assimilation step to the ocean DA.

Analysis updates are made to the state variables in the NEMO model, with the exception of sea ice concentration updates, which are made in the CICE model. Updates increasing ice concentration are always made to the thinnest category ice, whilst updates decreasing ice concentration are made to the thinnest ice thickness category available in that grid cell . The snow thickness on ice is preserved where there is existing ice but initialised to zero where new ice is added to a previously ice-free grid cell .

A few technical changes to the FOAM ocean data assimilation system are needed to fit into the coupled framework. The first technical change made was to match the atmosphere cycling. In the FOAM system, the observation operator and incremental analysis update (IAU) are performed in separate NEMO runs. In CPLDA, this has been altered such that the IAU is followed by observation operator in one combined model run. The second major change from FOAM to CPLDA was to reduce the assimilation time window of the ocean from 24 to 6 h to match the time window of the atmosphere (see Lea et al.2015). The third change was to reduce the period over which the increment is added to the ocean model in the IAU from 24 to 3 h. This choice is in part to ensure that on the 00Z cycle (with window from 21Z to 03Z) the full increments have been added to the ocean state by 00Z. Hence these ocean restarts can subsequently be used, in exactly the same way as 00Z ocean restarts from FOAM, by groups running initialised coupled forecasts. It is also important to note that differences in observation cutoff due to differences in operational scheduling impact the number of observations assimilated by each system.

No specific tuning has been made for the coupled system. The error covariances used in CPLDA were calculated in the context of the uncoupled FOAM system with a 24 h cycle. It is expected that we would improve the DA results by recalculating the error covariances for a coupled model and for a 6 h window (this is planned for future work).

## 2.6 Experiments

In order to assess the CPLDA system, we ran a 13-month calibration period from 1 January 2015 to 31 January 2016 with 6 h analysis and daily 7 d forecast. To allow parallel running, the 13 months were realised in two sections, each starting from “uncoupled” initial conditions from the FOAM and NWP operational analyses. The first section was from 1 December 2014 to 30 June 2015, whilst the second section was from 1 June 2015 to 31 January 2016. For both sections, the first month is taken as a spin-up period and discarded. No discontinuity has been observed between June 2015 from the first section and July 2015 from the second section, so all assessments undertaken consider the whole of 2015. Every 6 h, a delayed “best analysis” runs 24 h behind real time with a catch-up to real time only required to launch the 7 d forecasts on the 00Z cycle. For running delayed time trials, like that for 2015 described in this paper, the operational availability of observations is replicated by using receipt time information when they are extracted from the Met Office observations database (MetDB). In the following sections, we assess the CPLDA system against the ocean-only FOAM system for the 2015 period. FOAM has been used as the Met Office operational ocean forecasting system for many years. It uses atmospheric forcing from the Met Office NWP operational system and is described in detail in . Differences between the CPLDA and FOAM ocean configurations are shown in Table 1. There are a few differences between the CPLDA system assessed in the majority of this paper and the final operational implementation. As the systems were running for different periods, observations not available or not yet operationally implemented in 2015 (Jason-3 and Sentinel-3 sea level anomaly data and VIIRS sea surface temperatures) are only assimilated in the final operational version. More significantly, the scheduling of the operational system has been modified to allow additional observations to be assimilated. The result of these changes is that the best analysis runs further behind real time on the 06Z, 12Z and 18Z cycles (42, 36 and 30 h, respectively, compared to 24 h previously). As for the 2015 trial the catch-up to real time is only required to launch the forecast (operationally this is now 10 d) once per day on the 00Z cycle. Where any results are shown from the CPLDA system running with this modified scheduling (as opposed to that used in the 2015 trial), this is clearly indicated.

Additional experiments have been undertaken using the ocean-only FOAM system to look at the impact of running the data assimilation with a shorter assimilation window. The FOAM system was run in hindcast mode (forecasts were not generated) for 6 months from 1 March to 31 August 2018 with the analysis length and assimilation window set to 6 and 24 h. These experiments should decompose the effect of the shortened assimilation window from the effect of the coupling and aid in the interpretation of the comparisons to the FOAM system described herein.

3 Results

In this section, we present the results from a 1-year assessment of the CPLDA system from February 2015 to January 2016. A 1-year experiment is required to obtain representative results, and longer experiments are complex to run because of the computational cost and the constant evolution of the observation network, particularly for the atmosphere component. We assess the CPLDA ocean component against observations and other benchmark operational ocean analyses and forecasts (Met Office FOAM and Mercator $\mathrm{1}/\mathrm{12}{}^{\circ }$ PSY4 ). “Class 4” metrics are widely used to assess the accuracy of ocean forecasting systems. They are statistics of the differences between oceanic observations (in situ or satellite) and their model (forecast or analysis) equivalent at the time and location of the observation .

In the following subsections, we present in detail the results from the assessment of the sea surface height, sea ice, sea surface temperature, three-dimensional temperature, mixed layer depth (MLD) and the currents at 15 m. Although this paper largely presents results from the ocean component of the coupled system, we also discuss the impact of differences in surface forcing between CPLDA and FOAM and how these relate to the atmospheric configurations used in the two systems.

## 3.1 Sea surface height

CPLDA 2015 and FOAM sea surface height are assessed against observations using class 4 statistics . The observations used are provided by CMEMS and include data from the AltiKa, CryoSat-2 and Jason-2 satellites. Altimeter bias correction is applied to the observations. For each model comparison against the satellite observations, the model's own altimeter bias is used. This is important because the altimeter bias contains information from the model mesoscale, so correcting observations using the altimeter bias from one model to assess a second model penalises this second model. Figure 1 shows a time series of the sea level anomaly (SLA) difference statistics assessed against CMEMS observations. In the 2015 experiment with the original scheduling, the CPLDA SLA root mean square error (RMSE) is significantly larger than the FOAM SLA RMSE (Fig. 1a). The larger RMSE in CPLDA can be attributed to the difference in the number of SLA observations assimilated. Figure 2 shows the number of observations assimilated daily by both systems in 2015; the number assimilated by CPLDA is significantly smaller than the number assimilated by FOAM. Differences in scheduling can explain this; the CPLDA 2015 experiment best analysis runs earlier in the day than the comparable FOAM analysis, so fewer observations are available. Following these results, the scheduling of the CPLDA operational system was updated in April 2018. The best analyses at 06Z, 12Z and 18Z are now delayed allowing more observations to be assimilated, as already described in Sect. 2. This change, along with a change in the Met Office database (MetDB), allowing a more frequent ingestion of SLA observations, has resulted in a significant reduction of the CPLDA RMSE (see Fig. 1b). The experiments using the FOAM system to look at the effect of the shorter assimilation window in CPLDA showed no significant difference in SLA statistics between the 6 and 24 h systems.

Figure 1SLA class 4 statistics with respect to the CMEMS satellite product (Jason2, CryoSat-2 and AltiKa) for CPLDA and FOAM. (a) For the long CPLDA experiment in 2015 with the old scheduling. (b) From the operational systems for 2018; the CPLDA scheduling was updated to allow more observations to be assimilated in April 2018.

Figure 2Daily number of SLA observations assimilated in the FOAM operational system for 2015 (black line, foam_opfc) and in the CPLDA 2015 experiment (blue line, cplda).

## 3.2 Sea ice

Figure 3 shows the sea ice extent and volume, respectively, for CPLDA and FOAM best analyses. It shows that the sea ice extent is similar between CPLDA and FOAM and is also comparable to OSTIA. Figure 3 shows that the sea ice volumes are similar in the Arctic, but in the Antarctic CPLDA has a reduced volume compared to FOAM in the Southern Hemisphere winter and spring. Differences in volume can be attributed to differences in sea ice thickness probably caused by differences in freezing temperature treatment. Figure 4 shows the sea ice extent forecast for the melting season in the Arctic (August) and Antarctic (February) for CPLDA and FOAM. Both models have the tendency to melt too much ice during the forecast, especially in the Arctic; in the Antarctic the exaggerated melting in CPLDA is slightly reduced compared to that observed in FOAM. The differences in freezing temperature treatment referred to in Sect. 2.3 may have impacted the sea ice simulation.

Figure 3Time series of sea ice extent and sea ice volume for CPLDA (2015 experiment) and FOAM.

Figure 4Sea ice extent forecast for CPLDA (2015 experiment) and FOAM during the melting season (February in the Antarctic and August in the Arctic).

## 3.3 Sea surface temperature

We present class 4 statistics against in situ drifting buoy observations provided by US GODAE. The model SST is defined to be the temperature of the top ocean model grid box, which is at a depth of 0.5 m. The RMSE and mean bias statistics for the CPLDA analysis and forecast are shown in Fig. 5 and compared to FOAM, PSY4 and GloSea statistics. A small cold bias (−0.02 K) is present in CPLDA in the analysis but does not increase during the forecast. In comparison, the coupled forecast from the GloSea system initialised from the uncoupled system exhibits a warm bias significantly increasing during the forecast (+0.09 K after 132 h; Fig. 5a). CPLDA is the only system showing a cold bias; the two uncoupled systems FOAM and PSY4 have a small warm bias. Experiments investigating reducing the assimilation window from 24 to 6 h in the FOAM system show that the difference observed in bias is unlikely to be due to the shorter assimilation window. The cold bias is therefore more likely related to the differences in heat fluxes between the systems described in Sect. 3.5.

The SST RMSE is reduced in the CPLDA analyses compared to the FOAM analyses (this is also observed for the 12 h forecast). Between the 12 and 36 h forecasts the rate of increase of RMSE is greater in CPLDA than in FOAM so that at a forecast length of 36 h and beyond the RMSE is similar in both systems (Fig. 5a). The shorter data assimilation window in CPLDA (6 h) compared to FOAM (24 h) explains the improved SST analysis RMSE in CPLDA. Experiments investigating reducing the assimilation window in the FOAM system improved SST analysis statistics (both RMSE and bias). This supports the results of , who suggested that with a shorter cycle the SST model errors have less time to grow. The increase in RMSE between the 12 and 36 h forecasts in CPLDA could be due to the analysis overfitting the observations. The analysis fitting the observations too closely improves the statistics for the analysis, but as a consequence there is a quicker degradation during the forecast. Future work is required to assess the magnitude of this overfitting by withholding a subset of observations from the data assimilation (these observations can then be used as an independent validation dataset). The RMSE for analysis SSTs in PSY4 is significantly larger than CPLDA, but unlike CPLDA and FOAM, PSY4 does not assimilate SSTs from drifting buoys directly; instead it assimilates the near-real-time Operational Sea surface Temperature and sea Ice Analysis (OSTIA) , in which the drifting buoys have been assimilated. Furthermore, when using grid point verification PSY4 statistics may suffer from a “double penalty” whereby higher-resolution models are often penalised compared to coarser-resolution models for missed or misplaced events or false alarms.

The SST RMSE from all models exhibits a seasonal cycle (Fig. 5b) with an increased RMSE during the Northern Hemisphere summer. This increase can be attributed to the Northern Hemisphere bias of the observing network in terms of the number of observations. During the Northern Hemisphere summer the mixed layer depth is shallow, which leads to a more responsive mixed layer; thus, SST variability is increased, which is reflected in the increase in the global RMSE during these months.

The CPLDA SST analysis is also assessed via comparison with OSTIA (Fig. 6a). The global average CPLDA SST is warmer than OSTIA, with the difference being 0.06 K. This difference corresponds to a known cold bias in the OSTIA analysis relative to independent top-level Argo observations , so it could illustrate a potential bias in OSTIA. However, CPLDA SSTs (from the top 1 m thick model layer) are also expected to differ from OSTIA because they will capture some of the diurnal cycle, while OSTIA is a foundation SST free of diurnal warming. Figure 6b shows the difference between CPLDA and FOAM SST analysis annual means. CPLDA SST is generally colder than FOAM, which corresponds to the biases observed in Fig. 5. The differences between the two systems are much smaller than that observed between OSTIA and CPLDA as expected due to similarity in the ocean models and data. Bimodal differences are observed in the areas with a high SST gradient, such as the western boundary currents, Antarctic Circumpolar Current and in the Zapiola Rise region. These differences are due to differences in the position of SST fronts rather than due to the ability of the two analyses to resolve mesoscale features. To investigate mesoscale feature resolution we calculated the SST spectral power using the method described in . The SST spatial variability for each of the experiments was determined by wavenumber auto-spectra analysis. All spectra results were calculated on the native grid of the analysis (ORCA025). For each region of interest, the spectra were calculated along the horizontal coordinates closest to a given latitude. The power spectral density of each latitude was calculated using the Welch's average periodogram method, with 28 degrees of freedom and a Hanning window. The overlapping length for each band averaging was half the band length (Fiedler2018). The results for CPLDA and FOAM are very similar; in both systems the effective SST resolution is around 50 km.

On average, the CPLDA temperature increment at the surface is negative (Fig. 6c), meaning that the data assimilation is cooling the model at the surface; this is also an issue in FOAM to a lesser degree and may be due to the underestimation of the wind stress described in Sect. 3.5. The largest increments (both positive and negative) are observed in regions of enhanced SST variability such as the Gulf Stream, Kuroshio current and the Antarctic Circumpolar Current. Large negative increments are also applied in the tropics in regions of tropical instability waves. In the tropical Pacific, the imprint from the Tropical Atmosphere Ocean (TAO) mooring array is clearly visible. Figure 6d shows the difference in temperature increment at the surface between CPLDA and FOAM. Bimodal differences are observed in the regions of enhanced SST variability and at the location of the TAO moorings. The degree to which the increments (and therefore the data assimilation) are responsible for the differences in SST between the two systems shown in Figs. 5 and 6b varies regionally, but large-scale similarities are seen in the pattern of the differences.

Figure 5SST (model minus observation) class 4 statistics with respect to drifting buoys: (a) RMSE and mean biases at various lead times; (b) time series of RMSE and mean biases in the “best analysis” from each system.

Figure 6CPLDA SSTs in 2015 compared to (a) OSTIA (0.06 K) and (b) FOAM (−0.03 K); (c) mean SST increments in 2015 for CPLDA (−0.04 K); (d) mean SST increments in 2015 for CPLDA compared to FOAM (−0.01 K). Values in parentheses are averages for the global ocean.

## 3.4 Temperature and mixed layer depth

The temperature of CPLDA is assessed against Argo profile observations provided by CMEMS . The class 4 global temperature statistics for the best analysis are presented in Fig. 7a. The results for the forecast (not shown) are similar to the best analysis, with the mean bias staying unchanged and the RMSE slightly increasing from 0.67 K at 12 h to 0.73 K at 108 h. CPLDA has a cold bias in the subsurface which is maximum (around 0.1 K) at 10 m and present in the thermocline down to approximately 50 m. A smaller warm bias is present at around 100 m, but this is over a greater depth range so it represents a large amount of heat. A subsurface cold bias is present in the ocean-only FOAM system but with a smaller amplitude. For the CPLDA results shown, the number of profile observations being assimilated was smaller than in FOAM due to differences in the scheduling of assimilation cycles. Even in tests with the operational scheduling (referred to in Sect. 2.6) the subsurface bias in CPLDA persists despite the increased number of observations being assimilated. In both CPLDA and FOAM the temperature RMSE is largest in the thermocline, which highlights its variability; the RMSE is larger in CPLDA than in FOAM.

The temperature profile can be affected by the vertical propagation of surface temperature increments through the water column. show that the succession of positive and negative temperature increments has an asymmetric effect on the vertical temperature structure due to the way the temperature increment at the surface is propagated to the bottom of the mixed layer. A negative surface increment weakens the stratification and so deepens the mixed layer; this means that a subsequent positive surface increment is projected deeper. Experiments looking at the assimilation time window in an ocean-only system show that the shorter assimilation cycle in CPLDA (6 h) relative to FOAM (24 h) leads to increased temporal noise in the increments and a deepening of the mixed layer.

In CPLDA the large negative increment applied at the surface (Fig. 7b) is propagated down to approximately 50 m; below this a small warm increment is applied down to approximately 150 m. The fact that the negative increment is projected deeper in CPLDA than in FOAM and the dipolar structure in the vertical is consistent with idealised experiments into the vertical propagation of temperature increments (Robert R. King, personal communication, 2018). CPLDA has a larger negative increment at the surface than FOAM. Below approximately 200 m the magnitude of the average temperature increment is small (Fig. 7b) and the increments applied by CPLDA and FOAM are similar. The differences observed in global average temperature increments are reproduced in the ocean-only assimilation time-window experiments comparing a 6 h to a 24 h window. It is therefore likely that the differences observed in the global temperature increment are due to the shorter assimilation window in CPLDA.

Figure 7(a) CPLDA and FOAM global average mean bias and RMSE temperatures compared to Argo observations using class 4 methodology; (b) global average mean temperature increments applied in CPLDA and FOAM.

Mixed layer depth statistics from the analysis confirm that CPLDA has a deeper MLD than the assimilated profile observations. The MLD statistics presented here use the Kara mixed layer depth with density-based criteria. CPLDA mean error against the assimilated observations is 5.2 m too deep, while the RMSE is 34.7 m. Similarly, the MLD in FOAM is deeper than the observations but both the mean error (2.1 m too deep) and the RMSE (32.6 m) are reduced. These statistics confirm that the CPLDA MLD is deeper than FOAM (Fig. 8). During the forecast MLD RMSE and bias persist for both systems (not shown). Experiments running FOAM with a 6 h assimilation time window show that the shorter window results in an over-deepening of the MLD by 2.9 m relative to that using a 24 h window; this is consistent with the 3.1 m difference observed between CPLDA and FOAM. The change in MLD due to the shorter assimilation window may be caused by the asymmetric effect of the surface temperature increment on vertical temperature structure. The over-deepening of the mixed layer will lead to a misplacement of the thermocline, which will contribute to the larger RMSE in the thermocline in CPLDA than in FOAM observed in Fig. 7a. It is worth noting that differences in wind stress between CPLDA and FOAM will also affect the MLD.

Figure 8Annual mean Kara mixed layer depth in metres (, using a temperature criterion of 0.8 C): (a) CPLDA and (b) the difference between CPLDA and FOAM. Positive value means CPLDA MLD is deeper than FOAM MLD.

Experiments support the shorter assimilation window in CPLDA causing the differences observed in the global increments (Fig. 7b) and in MLD (Fig. 8) between CPLDA and FOAM. However, these experiments were not able to reproduce the subsurface bias observed in CPLDA relative to FOAM (Fig. 7a). We are currently unable to attribute this bias to the shorter assimilation window. This may be due to the experiments not being long enough (6 months) or more likely that the bias is a result of the differences in forcing between CPLDA and FOAM described in the next section. Future work is planned to look in depth at the vertical propagation of surface increments using a 1-D model and may aid in the attribution of the observed subsurface bias.

## 3.5 Atmosphere and surface fluxes

In this section, we assess the atmosphere component of CPLDA, focussing on the interface with the ocean, in particular the surface fluxes. We compare CPLDA surface fluxes to FOAM over the ocean; we have not yet investigated the fluxes over the ice. The differences in net total heat flux are shown in Fig. 9. On average, CPLDA receives less heat than FOAM (−3.9 W m−2). CPLDA receives less (or loses more) heat in the tropics than FOAM except in the Gulf of Guinea and in the East Pacific along the Peruvian and Chilean coast. At higher latitude, CPLDA loses less heat than FOAM, especially in the regions of large latent heat loss (Kuroshio and Gulf Stream). The reduced heat gain compared to FOAM between 30 S and 30 N contributes to the colder SST observed in CPLDA. The differences in net heat flux compared to FOAM are mainly due to differences in shortwave radiation and particularly in latent heat (Fig. 9). The small-scale pattern differences are due to differences in the representation of the mesoscale between the two models (eddies are not completely constrained by the observations and so are not located in exactly the same positions).

Figure 9Annual mean net downward heat flux differences (W m−2): (a) total heat flux difference between CPLDA and FOAM (−3.9 W m−2), (b) latent heat flux difference caused by using COARE3.0 bulk formulae in CPLDA rather than CORE as in FOAM (+9.61 W m−2), (c) shortwave radiation difference between CPLDA and FOAM (+1.05 W m−2), and (d) latent heat difference between CPLDA and the atmosphere-only NWP system, both of which are using COARE3.0 bulk formulae (−14.5 W m−2). In brackets are the average values for the global ocean.

Figure 10Annual mean wind stress magnitude (N m−2): (a) CPLDA, (b) FOAM, (c) wind stress difference between CPLDA and FOAM, and (d) wind stress difference caused by using COARE3.0 bulk formulae in CPLDA rather than CORE as in FOAM.

The differences in latent heat can be decomposed into two components. First, the atmosphere component in CPLDA differs from the atmosphere model (NWP) used to force FOAM. Figure 9d shows the difference between CPLDA and NWP latent heat. The NWP latent heat is calculated using the same bulk formulae as CPLDA but is using OSTIA SSTs as a surface boundary condition. This shows that even with the same bulk formulae the evaporation is significantly higher in CPLDA than in NWP. In CPLDA, the air at 10 m is drier than in NWP, causing increased evaporation and increased latent heat loss. The signal is stronger in the tropics and contributes to the differences in total heat flux seen between CPLDA and FOAM, with FOAM losing less heat than CPLDA in the tropics.

Secondly, there are differences in latent heat due to the use of different bulk formulae in CPLDA and FOAM. To investigate the difference caused by the bulk formulae, we recalculated CPLDA fluxes using CORE bulk formulae as used in FOAM. Figure 9b shows the differences in latent heat when using CORE formulae. At high latitude the latent heat loss is increased, with the largest differences observed in the regions with large latent heat loss. On the other hand, in the equatorial band, the latent heat loss is reduced with CORE formulae. The impact of the bulk formulae on the latent heat calculation is significant and contributes to the differences in total heat flux. An impact is also observed on the sensible heat but with a much smaller magnitude (not shown). The addition of the differences in latent heat caused by having a different atmospheric state (−14.5 W m−2 Fig. 9d) and those caused by using different bulk formulae (+9.61 W m−2 Fig. 9b) can explain much of the difference seen in the net heat flux into the ocean between CPLDA and FOAM (Fig. 9a).

The sum of the latent heat differences and shortwave differences equals −3.84 W m−2 , close to the difference in net total heat flux (−3.9 W m−2).

Air–sea momentum flux is a crucial forcing for the ocean as it is through the wind stress that the atmosphere drives the ocean. The CPLDA wind stress magnitude annual mean is shown in Fig. 10a. The wind stress in CPLDA is stronger than in FOAM almost everywhere. The differences in wind stress between CPLDA and FOAM are mainly caused by the different bulk formulae. Indeed, when recalculating CPLDA wind stress using the same (CORE) bulk formulae as FOAM, the differences between CPLDA and the new calculated wind stress are similar to the differences between CPLDA and FOAM (Fig. 10c and d). The new wind stresses are calculated with CORE bulk formulae but using CPLDA wind speed and surface currents. This highlights the importance in the choice of bulk formulae for the wind stress calculation. However, we note that it is not equivalent to running CPLDA using CORE bulk formulae, as reduced wind stress would imply feedback to the atmosphere causing an increase in wind speed and therefore limiting the reduction in wind stress.

Because of the large differences caused by the bulk formulae, comparing the differences in wind stress is equivalent to comparing the different drag coefficient used. highlighted disagreement in the drag coefficient between different bulk formulae, with the COARE 3.0 algorithm producing higher wind stress. They mention that the latest improvements in the COARE algorithm (version 3.5) suggest that the drag coefficient of COARE3.0 is likely too small in strong wind conditions. This suggests that the wind stresses calculated in CPLDA may be underestimated at times, although it is improved compared to those used by FOAM.

## 3.6 Velocities

Predictions of ocean currents are important for marine activities. They are used for a number of practical applications such as ship routing, marine search and rescue, pollution monitoring, the offshore oil and gas industry, and marine renewable energy. To assess the velocities at 15 m, we compare CPLDA to measurements from drifters. The observations are independent as no ocean current observations are assimilated by the system. We use an in situ delayed-mode product from CMEMS (Etienne2017); this is designed for reanalysis purposes with the best available version of in situ data for ocean surface currents. The data are collected from the Surface Drifter Data Assembly Centre (SD-DAC at NOAA AOML). All surface drifter data are processed to check for drogue loss, and a wind slippage correction is applied to undrogued buoys. The wind slip correction is computed following Rio (2012). We compared 15 m model velocities against 15 m CMEMS observations corrected with wind slippage for the year 2015. Despite a limited number of observations compared to SST observations, the observation coverage from drifters is generally good except in the tropical Atlantic.

The class 4 statistics for the 15 m velocities for the global ocean are presented in Fig. 11. On average the CPLDA analysis has a negative bias (−0.07 m s−1) and an RMSE of 0.19 m s−1. In all regions except the tropical Pacific, both the bias and the RMSE are stable during the forecast (not shown). The bias in the uncoupled system analyses (FOAM and PSY4) is the same as CPLDA and the RMSE is close (0.18 for FOAM; 0.17 for PSY4). The larger RMSE in the CPLDA analysis is mainly due to periods of large RMSE in April and November 2015 in the tropical Pacific (see Fig. 11). In regions other than the tropical Pacific no significant differences are observed, with CPDLA and FOAM statistics similar. The increased RMSE in the analysis is caused by spurious currents in the western tropical Pacific (Fig. 12). During the forecast, the currents weaken and the RMSE decreases, suggesting they are caused by the data assimilation. The shorter data assimilation window in CPLDA limits the number of observations and causes noisier increments; in addition, a reduced total number of SLA observations is assimilated in CPLDA compared to FOAM. We re-ran the CPLDA system for a month from 15 October 2015 with the updated scheduling, allowing more observations to be assimilated. In this experiment, the RMSE in the analysis is significantly reduced and now similar to FOAM and PSY4 (see shaded area in Fig. 11). The spurious currents north of Indonesia are suppressed (Fig. 12). However, in the current CPLDA operational system (running with the updated scheduling, allowing more observations to be assimilated), there are still periods with unrealistic currents developing in the western tropical Pacific (not shown). These unrealistic currents caused by SLA assimilation are not present in the FOAM system, which uses the same assimilation scheme and assimilates the same observations as CPLDA. This strongly suggests that they are caused by the shorter assimilation window in CPLDA.

Tuning the error covariances for the shorter time window may reduce or eliminate the above problem. Smaller estimated background errors would give the observation less weight in the analysis and lead to smaller, less noisy increments. Future work investigating re-estimating the background error covariances is planned. The updated CNES-CLS13 mean dynamic topography of is significantly improved, particularly around the Maritime Continent, compared to the CNES-CLS09 version used in these experiments. Hence, using a newer MDT in the CPLDA system may also reduce this issue.

Figure 11The 15 m velocity using class 4 statistics with respect to velocities derived from drifting buoys for 2015. RMSE and mean bias are shown for CPLDA and FOAM, as well as PSY4 and, for a short test period, CPLDA using the operational scheduling, allowing for the assimilation of additional SLA observations.

Figure 1215 m velocity average (m s−1) in the western tropical Pacific for the week from 5 to 11 November 2015 for (a) CPLDA, (b) FOAM and (c) CPLDA with the current operational scheduling, allowing for the assimilation of additional SLA observations.

Table 2Velocity class 4 statistics against drifting buoys for 2015 (m s−1).

Despite the mean bias and RMSE values, CPLDA velocities are only moderately correlated with observations (0.50). This correlation is weaker than in FOAM (0.55) and PSY4 (0.62). Only PSY4 is skilful using the definition that a correlation greater than 0.6 is an indication of a skilful forecast . Correlation varies substantially from region to region (Table 2). In some regions, CPLDA correlation is near or above 0.6, as in the North Pacific, while in the North Atlantic the correlation is poor (0.10). All the models have skilful correlation in the North Pacific but poor correlation in the North Atlantic. This was previously observed by in an earlier version of the FOAM system. They explained the lack of skill in the North Atlantic by the domination of the mesoscale, which is more difficult to predict, while in more benign regions the model performed better. It is in the Southern Ocean, a region largely dominated by the mesoscale, that the $\mathrm{1}/\mathrm{12}{}^{\circ }$ model (PSY4) most clearly outperforms the $\mathrm{1}/\mathrm{4}{}^{\circ }$ models (CPLDA, FOAM). Here PSY4 has a correlation of 0.62, while CPLDA and FOAM have correlations of 0.45 and 0.50, respectively.

Eddy kinetic energy (EKE) was also compared in CPLDA, FOAM and PSY4. In an ocean-only model, using the ocean velocity to calculate the wind stress has a damping effect on the eddies . Despite the expected reduced damping effect in a coupled system, we did not observe a higher EKE in CPLDA than in FOAM. However, at $\mathrm{1}/\mathrm{4}{}^{\circ }$ resolution, the mesoscale is poorly represented and most of the EKE is injected into the model by the SLA assimilation. Hence differences in EKE between CPLDA and FOAM are mainly caused by differences in SLA observations assimilated, or impacts of the shorter assimilation window, rather than due to a reduced eddy damping by the wind stress.

4 Conclusions

Since July 2017, the CPLDA system has been delivering an ocean analysis and forecast to CMEMS operationally. This was a significant upgrade as prior to this ocean products were delivered to CMEMS from the coupled GloSea system, which was initialised from uncoupled ocean (FOAM) and atmosphere (Met Office NWP) analyses. Here we have assessed the first long (1-year) experiment using the CPLDA system in a pseudo-operational mode. Previous studies with an earlier similar system presented the results only from short (1-month) trials. This longer trial has allowed for an in-depth assessment of the CPLDA system against observations and a comparison of its ocean analysis and forecast to current benchmark products like the Met Office $\mathrm{1}/\mathrm{4}{}^{\circ }$ model FOAM and the Mercator $\mathrm{1}/\mathrm{12}{}^{\circ }$ model PSY4.

Overall, the CPLDA system performs well compared to the FOAM system. After applying an update to the scheduling to allow more observations to be available at the runtime, the SLA statistics are now similar to FOAM statistics both for the bias and RMSE. The SST statistics are improved, with an RMSE significantly smaller than in FOAM, and the warm bias that was developing in the coupled GloSea forecasts is not present in CPLDA. However, despite improving the SST, the vertical propagation of the SST increments may have contributed to an increased subsurface cold bias in CPLDA. The cold bias is present in FOAM but is worsened in CPLDA. The increased cold bias is also associated with a deepening of the MLD in CPLDA. The statistics for the 15 m current are similar in CPLDA and FOAM except for two periods when CPLDA exhibits large errors in the western tropical Pacific caused by the SLA assimilation. Again, these are largely addressed by the scheduling modification to allow more observations to be assimilated. The 15 m currents from both models have a poor correlation with observations.

Compared to other systems used to produce ocean analyses, the short data assimilation window distinguishes the CPLDA system. The impact of the data assimilation time window was investigated by (1-month experiment) and further investigated in experiments described herein (6-month experiments). The FOAM system was run with a 6 h cycle and assimilation window, and the results were compared with the standard FOAM 24 h cycle. found a small impact on the temperature and salinity profiles, as well as on the SLA statistics, but saw an improvement in the SST statistics explained because with a shorter cycle the model errors have less time to grow. These results were supported by the new longer experiments detailed in this paper. The shorter window in CPLDA compared to FOAM has a positive impact on the SST analysis, but the RMSE increase during the forecast is enhanced in CPLDA, suggesting that the system may be overfitting the observations. The shorter assimilation window in CPLDA also leads to the surface temperature increments being propagated deeper and to an over-deepening of the mixed layer depth. This can be partly explained by the asymmetric effect of noisy surface temperature increments on the vertical temperature structure. In the assimilation scheme the depth to which the SST increment is propagated is determined by the mixed layer depth of the background field. In the current system, the background field used is that at the first time step of the observation operator. To reduce the noise, a future improvement could be to use a daily mean field instead of the instantaneous field. This method has been tested with a UK shelf model but has not yet been tested for the CPLDA system.

The impact of the short assimilation is also seen in the surface currents near the Equator where the SLA assimilation is responsible for the large erroneous currents in CPLDA in the western tropical Pacific. The problem is also present in the FOAM system with a 24 h window; indeed, both systems have excessive eddy kinetic energy along the Equator. However, it is likely that having a 6 h cycle makes the problem worse as it limits the numbers of observations assimilated, creating large increments which cause the spurious currents. It highlights the need to improve the SLA assimilation, in particular using background error covariances appropriate to a 6 h assimilation window. The error covariances currently used are the same as those used the FOAM system, but estimates specific to the CPLDA system would take into account the 6 h assimilation window as well as the different model error characteristics of the coupled model relative to the ocean-only model. The 1-year run carried out gives us the data to allow for the estimation of error covariances for the CPLDA system; making these improvements to the ocean DA should help to resolve the problems seen in equatorial currents and hopefully improve the correlation of the model velocity to drifter velocities.

For many years, ocean analyses and forecasts have been produced from forced ocean models. Using a coupled system brings some new challenges. When assessing the CPLDA system it is important to understand which changes are genuine impacts of coupled processes and feedbacks, as opposed to unavoidable changes in the system set-up (like, for example, the use of a 6 h assimilation window). The comparison between CPLDA and FOAM surface fluxes highlighted some significant differences in the CPLDA and NWP atmospheric fields. Firstly, there is a significant difference in 10 m air temperature, with the NWP fields (used to force FOAM) being warmer than CPLDA; this is also associated with an increased specific humidity at 10 m. Differences in both air temperature and humidity are large scale and especially significant between 30 S and 30 N. With colder and drier air at 10 m in CPLDA, the evaporation is larger, causing more heat loss by latent heat. This extra heat loss in CPLDA compared to FOAM contributes to the differences in SST observed, with CPLDA being colder than FOAM. Given that we have been unable to attribute the subsurface bias observed in CPLDA to the shorter assimilation window, it is likely this is also related to the surface flux differences between CPLDA and FOAM.

The differences in atmospheric configurations between the NWP model used to force FOAM in 2015 and that in CPLDA mean it is difficult to separate the impact of the coupling. A more in-depth investigation with a recently developed NWP model using the same configuration as CPLDA is required to understand to what extent these differences are caused by the coupling. In addition, further work is needed to understand the relationship between the differences in shortwave radiation and those in surface air temperature and specific humidity.

The assessment of the surface fluxes also emphasised the importance of the bulk formulae. In CPLDA, the fluxes are calculated by the atmosphere component using the UM bulk formulae (based on COARE3.0), while in FOAM the fluxes are calculated by NEMO using CORE bulk formulae. One of the main impacts of the different bulk formulae is the differences in wind stress. Despite having similar winds, the magnitude of the CPLDA wind stress is significantly larger than that of FOAM, particularly in the Southern Ocean. This is partly responsible for the increased, and too deep, mixed layer depth in CPLDA.

The 40 km atmospheric resolution of CPLDA is not high enough to make sensible comparisons of NWP performance with that of the Met Office operational NWP system (which at the time had a resolution of 17 km), but a basket of metrics used for assessing model performance suggests that CPLDA atmospheric performance is good and at least comparable to atmosphere-only systems at equivalent resolution. In fact, because the 2015 CPLDA trial included VarBC , which was not included in the NWP system until 2016, some aspects of the CPLDA system outperformed the much higher-resolution NWP system as operational at the time.

A “coupled NWP” system is now being developed based upon the operational CMEMS system described here. This will have a much higher-resolution (10 km) atmosphere with the aim of delivering both weather and ocean forecast products from a single system by 2021. On this timescale it is hoped that it will be possible to address some of the issues already discussed for which the CPLDA performance is slightly degraded compared to FOAM. However, the very good performance for analysis SSTs in CPLDA compared to both FOAM and OSTIA, as well as the ability to evolve these through the forecast, suggests that such a system will be well-placed to improve upon the performance of the existing NWP system. A subsequent upgrade of this system would be to increase the ocean resolution to $\mathrm{1}/\mathrm{12}{}^{\circ }$ as well as incorporating a wave model and using ensemble information to improve the ocean data assimilation.

Code availability
Code availability.

The Met Office Unified Model (MetUM) is available for use under licence. A number of research organisations and national meteorological services use the UM in collaboration with the Met Office to undertake basic atmospheric process research, produce forecasts, develop the UM code, and build and evaluate Earth system models. For further information on how to apply for a licence, see http://www.metoffice.gov.uk/research/modelling-systems/unified-model ().

JULES is available under licence free of charge. Further information on how to gain permission to use JULES for research purposes can be found at https://jules-lsm.github.io/ ().

The model code for NEMO v3.6 is available from the NEMO website (http://www.nemo-ocean.eu, ). On registering, individuals can access the code using the open-source subversion software (http://subversion.apache.org/, ).

The model code for CICE is available from the Met Office code repository https://code.metoffice.gov.uk/trac/cice/browser (). In order to implement the scientific configuration of GC2 and to allow the components to work together, a number of branches (code changes) are applied to the above codes. Please contact the authors for more information on these branches and how to obtain them.

Author contributions
Author contributions.

CG, JRJ, CH and DL contributed to the technical development of the configuration and to the scientific evaluation and analysis of the results. CG and JRJ performed the simulations and carried out the assessment of the model. AR developed the tools used to calculate the statistics and processed the observations used for validation. IA prepared tools and data for the velocity assessments.

Competing interests
Competing interests.

The authors declare that they have no conflict of interest.

Special issue statement
Special issue statement.

Acknowledgements
Acknowledgements.

This work has been carried out as part of the Copernicus Marine Environment Monitoring Service (CMEMS). CMEMS is implemented by Mercator Ocean International in the framework of a delegation agreement with the European Union.

Financial support
Financial support.

This research has been supported by the Copernicus Marine Environment Monitoring Service (CMEMS) (grant no. 2015/S 056-097650).

Review statement
Review statement.

This paper was edited by Pierre-Yves Le Traon and reviewed by two anonymous referees.

References

Amante, C. and Eakins, B. W.: ETOPO1 1 Arc-Minute Global Relief Model: Procedures, Data Sources and Analysis, NOAA Technical Memorandum NESDIS NGDC-24, 2009. a

Apache Software foundation: Apache Subversion, available at: http://subversion.apache.org/, last access: 27 September 2019. a

Bell, M., Martin, M., and Nichols, N.: Assimilation of data into an ocean model with systematic errors near the equator, Q. J. Roy. Meteor. Soc., 130, 873–893, 2004. a, b, c

Blockley, E. W., Martin, M. J., and Hyder, P.: Validation of FOAM near-surface ocean current forecasts using Lagrangian drifting buoys, Ocean Sci., 8, 551–565, https://doi.org/10.5194/os-8-551-2012, 2012. a

Blockley, E. W., Martin, M. J., McLaren, A. J., Ryan, A. G., Waters, J., Lea, D. J., Mirouze, I., Peterson, K. A., Sellar, A., and Storkey, D.: Recent development of the Met Office operational ocean forecasting system: an overview and assessment of the new Global FOAM forecasts, Geosci. Model Dev., 7, 2613–2638, https://doi.org/10.5194/gmd-7-2613-2014, 2014. a, b, c, d

Bourdallé-Badie, R. and Treguier, A.: A climatology of runoff for the global ocean-ice model, ORCA025, Mercator-Ocean reference: MOO-RP-425-365-MER august 2006, 2006. a, b

Brodeau, L., Barnier, B., Gulev, S. K., and Woods, C.: Climatologically significant effects of some approximations in the bulk parameterizations of turbulent air-sea fluxes, J. Phys. Oceanogr., 47, 5–28, https://doi.org/10.1175/JPO-D-16-0169.1, 2017. a

Browne, P. A., de Rosnay, P., Zuo, H., Bennett, A., and Dawson, A.: Weakly Coupled Ocean Atmosphere Data Assimilation in the ECMWF NWP System, Remote Sens., 11, 234, https://doi.org/10.3390/rs11030234, 2019. a

Burchard, H.: Energy-conserving discretisation of turbulent shear and buoyancy production, Ocean Model., 4, 347–361, 2002. a

Cameron, J. and Bell, W.: The testing and planned implementation of variational bias correction (VarBC) at the Met Office, 20th International TOVS study conference, Madison, WI, available at: https://cimss.ssec.wisc.edu/itwg/itsc/itsc20/papers/11_01_cameron_paper.pdf (last access: 27 September 2019), 2016. a

Dawe, J. T. and Thompson, L. A.: Effect of ocean surface currents on wind stress, heat flux, and wind power input to the ocean, Geophys. Res. Lett., 33, 1–5, 2006. a

Donlon, C. J., Martin, M., Stark, J., Roberts-Jones, J., Fiedler, E., and Wimmer, W.: The Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) system, Remote Sens. Environ., 116, 140–158, https://doi.org/10.1016/j.rse.2010.10.017, 2012. a

Duhaut, T. H. A. and Straub, D. N.: Wind stress dependence on ocean surface velocity: implications for mechanical energy input to ocean circulation, J. Phys. Oceanogr., 36, 202–211, 2006. a

Etienne, H.: Quality Information Document for Global Ocean Delayed Mode in-situ Observations of Ocean Surface Currents and Temperature from Drifters. INSITU_GLO_UV_L2_REP_OBSERVATIONS_013_044, CMEMS Quality Information Document, available at: http://marine.copernicus.eu/documents/QUID/CMEMS-INS-QUID-013-044.pdf (last access: 27 September 2019), 2017. a

Fairall, C. W., Bradley, E. F., Hare, J. E., Grachev, A. A., and Edson, J. B.: Bulk Parameterization of Air–Sea Fluxes: Updates and Verification for the COARE Algorithm, J. Climate, 16, 571–591, https://doi.org/10.1175/1520-0442(2003)016<0571:BPOASF>2.0.CO;2, 2003. a

Fiedler, E.: Improvements to feature resolution in the OSTIA sea surface temperature analysis using the NEMOVAR assimilation scheme, Q. J. Roy. Meteorol. Soc., online first, https://doi.org/10.1002/qj.3644, 2018. a, b

Gaspar, P., Gregoris, Y., and Lefevre, J. M.: A simple eddy kinetic energy model for simulations of the ocean vertical mixing: tests at station Papa and long-term upper ocean study site, J. Geophys. Res., 95, 16179–16193, https://doi.org/10.1029/JC095iC09p16179, 1990. a

Haney, R. L.: Surface Thermal Boundary Condition for Ocean Circulation Models, J. Phys. Oceanogr., 1, 241–248, https://doi.org/10.1175/1520-0485(1971)001<0241:STBCFO>2.0.CO;2, 1971. a

Hollingsworth, A., Arpe, K., Tiedtke, M., Capaldo, M., and Savijrvi, H.: The Performance of a Medium-Range Forecast Model in Winter: Impact of Physical Parameterizations, Mon. Weather Rev., 108, 1736–1773, https://doi.org/10.1175/1520-0493(1980)108<1736:TPOAMR>2.0.CO;2, 1980. a

Hunke, E. C. and Dukowicz, J. K.: The elastic-viscous-plastic sea ice dynamics model in general orthogonal curvilinear coordinates on a sphere – incorporation of metric terms, Mon. Weather Rev., 130, 1848–1865, 2002. a

Hunke, E. C. and Lipscomb, W.: CICE: the Los Alamos sea ice model documentation and software user's manual. Version 4.1, LA-CC-06-012, Technical report, Los Alamos National Laboratory, N.M., 2010. a

IOC, IHO and BODC: Edition of the GEBCO Digital Atlas, published on CD-ROM on behalf of the Intergovernmental Oceanographic Commission and the International Hydrographic Organization as part of the General Bathymetric Chart of the Oceans, British Oceanographic Data Centre, Liverpool, UK, 2003. a

Johns, T., Shelly, A., Rodriguez, J., Copsey, D., Guiavarc'h, C., and Sykes, P.: Report on extensive coupled ocean-atmosphere trials on NWP (1–15 day) timescales, Met Office PMS key deliverable report, 2012. a, b

Kara, A. B., Rochford, P. A., and Hurlburt, H. E.: An optimal definition for ocean mixed layer depth, J. Geophys. Res.-Oceans, 105, 16803–16821, https://doi.org/10.1029/2000JC900072, 2000. a

King, R. R., While, J., Martin, M. J., Lea, D. J., Lemieux-Dudon, B., Waters, J., and O'Dea, E.: Improving the initialisation of the Met Office operational shelf-seas model, Ocean Modell., 130, 1–14, https://doi.org/10.1016/j.ocemod.2018.07.004, 2018. a, b

Large, W. G. and Yeager, S. G.: Diurnal to decadal global forcing for ocean and sea ice models: the data sets and flux climatologies, Technical Report TN-460+STR, NCAR, 105 pp., 2004. a, b

Lea, D. J., Mirouze, I., Martin, M. J., Shelly, A., Hines, A., and Sykes, P.: Assessment of a weakly coupled atmosphere-land-ocean-ice data assimilation system, Met Office Deliverable report, September, 2013. a, b, c, d, e

Lea, D. J., Mirouze, I., Martin, M. J., King, R. R., Hines, A., Walters, D., and Thurlow, M.: Assessing a new data assimilation system based on the Met Office coupled atmosphere-land-ocean-sea ice model, Mon. Weather Rev., 143, 4678–4694, 2015. a, b, c, d, e, f, g, h, i, j, k, l

Lellouche, J.-M., Greiner, E., Le Galloudec, O., Garric, G., Regnier, C., Drevillon, M., Benkiran, M., Testut, C.-E., Bourdalle-Badie, R., Gasparin, F., Hernandez, O., Levier, B., Drillet, Y., Remy, E., and Le Traon, P.-Y.: Recent updates to the Copernicus Marine Service global ocean monitoring and forecasting real-time 1/12 high-resolution system, Ocean Sci., 14, 1093–1126, https://doi.org/10.5194/os-14-1093-2018, 2018. a

Lengaigne, M., Menkes, C., Aumont, O., Gorgues, T., Bopp, L., André, J.-M., and Madec, G.: Influence of the oceanic biology on the tropical Pacific climate in a coupled general circulation model, Clim. Dynam., 5, 503–516, https://doi.org/10.1007/s00382-006-0200-2, 2007. a

Lorenc, A.: VSDP32. VAR documentation, Met Office report, 2013. a

MacLachlan, C., Arribas, A., Peterson, A., Fereday, D., Scaife, A. A., Gordon, M., Vellinga, M., A., W., Comer, R., Camp, J., and Xavier, P.: Description of GloSea5: the Met Office high resolution seasonal forecast system, Q. J. Roy. Meteor. Soc., 141, 1072–1084, https://doi.org/10.1002/qj.2396, 2014. a

Madec, G. and the NEMO team: NEMO ocean engine, Note du Pôle de modélisation, Institut Pierre-Simon Laplace (IPSL), No. 27, ISSN 1288-1619, 2008. a

Martin, M. J., Hines, A., and Bell, M. J.: Data assimilation in the FOAM operational short-range ocean forecasting system: a description of the scheme and its impact, Q. J. Roy. Meteor. Soc., 133, 981–995, 2007. a

Megann, A., Storkey, D., Aksenov, Y., Alderson, S., Calvert, D., Graham, T., Hyder, P., Siddorn, J., and Sinha, B.: GO5.0: the joint NERC–Met Office NEMO global ocean model for use in coupled and forced applications, Geosci. Model Dev., 7, 1069–1092, https://doi.org/10.5194/gmd-7-1069-2014, 2014. a

Met Office: Unified Model Met Office, available at: http://www.metoffice.gov.uk/research/modelling-systems/unified-model, last access: 27 September 2019a. a

Met Office: Joint UK Land Environment Simulator (JULES) Documentation, available at: https://jules-lsm.github.io/, last access: 27 September 2019b. a

Met Office: CICE code, https://code.metoffice.gov.uk/trac/cice/browser, last access: 27 September 2019c. a

Mirouze, I. and Weaver, A.: Representation of correlation functions in variational assimilation using an implicit diffusion operator, Q. J. Roy. Meteor. Soc., 136, 1421–1443, 2010. a

Mirouze, I., Blockley, E. W., Lea, D. J., Martin, M. J., and Bell, M. J.: A multiple length scale correlation operator for ocean data assimilation, Tellus A, 68, 29744, https://doi.org/10.3402/tellusa.v68.29744, 2016. a

Mogensen, K., Balmaseda, M. A., and Weaver, A.: The NEMOVAR ocean data assimilation system as implemented in the ECMWF ocean analysis for System 4, European Centre for Medium-Range Weather Forecasts, 2012. a

Mulholland, D. P., Laloyaux, P., Haines, K., and Balmaseda, M. A.: Origin and Impact of Initialization Shocks in Coupled Atmosphere–Ocean Forecasts, Mon. Weather Rev., 143, 4631–4644, https://doi.org/10.1175/MWR-D-15-0076.1, 2015. a

Murphy, A. H. and Epstein, E. S.: Skill Scores and Correlation Coefficients in Model Verification, Mon. Weather Rev., 117, 572–582, https://doi.org/10.1175/1520-0493(1989)117<0572:SSACCI>2.0.CO;2, 1989. a

NEMO consortium: NEMO Community ocean model, available at: http://www.nemo-ocean.eu, last access: 27 September 2019. a

Paulson, C. A. and Simpson, J. J.: Irradiance measurements in the upper ocean, J. Phys. Oceanogr., 7, 952–956, 1977. a, b

Penny, S. G. and Hamill, T. M.: Coupled Data Assimilation for Integrated Earth System Analysis and Prediction, B. Am. Meteorol. Soc., 98, ES169–ES172, https://doi.org/10.1175/BAMS-D-17-0036.1, 2017. a

Rae, J. G. L., Hewitt, H. T., Keen, A. B., Ridley, J. K., West, A. E., Harris, C. M., Hunke, E. C., and Walters, D. N.: Development of the Global Sea Ice 6.0 CICE configuration for the Met Office Global Coupled model, Geosci. Model Dev., 8, 2221–2230, https://doi.org/10.5194/gmd-8-2221-2015, 2015. a

Rawlins, F., Ballard, S. P., Bovis, K. J., Clayton, A. M., Li, D., Inverarity, G. W., Lorenc, A. C., and Payne, T. J.: The Met Office global four‐dimensional variational data assimilation scheme, Q. J. Roy. Meteor. Soc. 133, 347–362, 2007. a

Renault, L., Molemaker, M. J., McWilliams, J. C., Shchepetkin, A. F., Lemarié, F., Chelton, D., Illig, S., and Hall, A.: Modulation of Wind Work by Oceanic Current Interaction with the Atmosphere, J. Phys. Oceanogr., 46, 1685–1704, https://doi.org/10.1175/JPO-D-15-0232.1, 2016. a

Ridley, J. K., Blockley, E. W., Keen, A. B., Rae, J. G. L., West, A. E., and Schroeder, D.: The sea ice model component of HadGEM3-GC3.1, Geosci. Model Dev., 11, 713–723, https://doi.org/10.5194/gmd-11-713-2018, 2018. a

Rio, M.-H.: Use of Altimeter and Wind Data to Detect the Anomalous Loss of SVP-Type Drifters Drogue, J. Atmos. Ocean. Tech., 29, 1663–1674, https://doi.org/10.1175/JTECH-D-12-00008.1, 2012. a

Rio, M.-H., Guinehut, S., , and Larnicol, G.: New CNES-CLS09 global mean dynamic topography computed from the combination of GRACE data, altimetry, and in situ measurements, J. Geophys. Res., 116, C07018, https://doi.org/10.1029/2010JC006505, 2011. a

Rio, M.-H., Mulet, S., and Picot, N.: Beyond GOCE for the ocean circulation estimate: Synergetic use of altimetry, gravimetry, and in situ data provides new insight into geostrophic and Ekman currents, Geophys. Res. Lett., 41, 8918–8925, https://doi.org/10.1002/2014GL061773, 2014. a

Roberts-Jones, J., Fiedler, E. K., and Martin, M. J.: Daily, Global, High-Resolution SST and Sea Ice Reanalysis for 1985-2007 Using the OSTIA System, J. Climate, 25, 6215–6232, https://doi.org/10.1175/JCLI-D-11-00648.1, 2012. a

Ryan, A., Regnier, C., Divakaran, P., Spindler, T., Mehra, A., Smith, G., Davidson, F., Hernandez, F., Maksymczuk, J., and Liu, Y.: GODAE OceanView Class 4 forecast verification framework: global ocean inter-comparison, J. Oper. Oceanogr., 8, 98–111, 2015. a, b, c

Saha, S., Moorthi, S., Wu, X., Wang, J., Nadiga, S., Tripp, P., Behringer, D., Hou, Y.-T., Chuang, H.-y., Iredell, M., Ek, M., Meng, J., Yang, R., Mendez, M. P., van den Dool, H., Zhang, Q., Wang, W., Chen, M., and Becker, E.: The NCEP Climate Forecast System Version 2, J. Climate, 27, 2185–2208, https://doi.org/10.1175/JCLI-D-12-00823.1, 2014. a

Semtner, A. J.: A model for the thermodynamic growth of sea ice in numerical investigations of climate, J. Phys. Oceanogr., 6, 379–389, 1976. a

Smith, G. C., Belanger, J.-M., Roy, F., Pellerin, P., Ritchie, H., Onu, K., Roch, M., Zadra, A., Colan, D. S., Winter, B., Fontecilla, J.-S., and Deacu, D.: Impact of Coupling with an Ice-Ocean Model on Global Medium-Range NWP Forecast Skill, Mon.Weather Rev., 146, 1157–1180, https://doi.org/10.1175/MWR-D-17-0157.1, 2018. a

Thorndike, A., Rothrock, D., Maykut, G., and Colony, R.: The thickness distribution of sea ice, J. Geophys. Res., 80, 4501–4513, 1975. a

Valcke, S.: OASIS3 User Guide (prism 2-5), PRISM Support Initiative No 3, 2006. a

Walters, D., Boutle, I., Brooks, M., Melvin, T., Stratton, R., Vosper, S., Wells, H., Williams, K., Wood, N., Allen, T., Bushell, A., Copsey, D., Earnshaw, P., Edwards, J., Gross, M., Hardiman, S., Harris, C., Heming, J., Klingaman, N., Levine, R., Manners, J., Martin, G., Milton, S., Mittermaier, M., Morcrette, C., Riddick, T., Roberts, M., Sanchez, C., Selwood, P., Stirling, A., Smith, C., Suri, D., Tennant, W., Vidale, P. L., Wilkinson, J., Willett, M., Woolnough, S., and Xavier, P.: The Met Office Unified Model Global Atmosphere 6.0/6.1 and JULES Global Land 6.0/6.1 configurations, Geosci. Model Dev., 10, 1487–1520, https://doi.org/10.5194/gmd-10-1487-2017, 2017. a

Waters, J., Lea, D. J., Martin, M. J., Storkey, D., and While, J.: Describing the development of the new FOAM-NEMOVAR system in the global 1/4 degree configuration, Met Office Technical Report 578, 2013. a, b

Waters, J., Lea, D. J., Martin, M. J., Mirouze, I., Weaver, A., and While, J.: Implementing a variational data assimilation system in an operational 1/4 degree global ocean model, Q. J. Roy. Meteor. Soc., 141, 333–349, https://doi.org/10.1002/qj.2388, 2015. a

Weaver, A. T., Deltel, C., Machu, E., Ricci, S., and Daget, N.: A multivariate balance operator for variational ocean data assimilation, Q. J. Roy. Meteor. Soc., 131, 3605–3625, https://doi.org/10.1256/qj.05.119, 2006. a

Wehde, H., V. Schuckmann, K., Pouliquen, S., Grouazel, A., Bartolome, T., Tintore, J., De Alfonso Alonso Munoyerro, M., and the INS-TAC team: Quality Information Document For Near Real Time IN SITU products INSITU_GLO_NRT_OBSERVATIONS_013_030, CMEMS Quality Information Document, available at: http://marine.copernicus.eu/documents/QUID/CMEMS-INS-QUID-013-030-036.pdf (last access: 27 September 2019), 2016. a

West, A. E., McLaren, A. J., Hewitt, H. T., and Best, M. J.: The location of the thermodynamic atmosphere–ice interface in fully coupled models – a case study using JULES and CICE, Geosci. Model Dev., 9, 1125–1141, https://doi.org/10.5194/gmd-9-1125-2016, 2016.  a

Zalesak, S. T.: Fully multidimensional flux-corrected transport algorithms for fluids, J. Computat. Phys., 31, 335–362, https://doi.org/10.1016/0021-9991(79)90051-2, 1979. a