WeatherNext 2: Our most advanced weather forecasting model

(blog.google)

291 points meetpateltech | 5 comments | 17 Nov 25 15:04 UTC | HN request time: 0s | source

Show context

binsquare ◴[17 Nov 25 17:24 UTC] No.45955754[source]▶

I find it interesting that they quantify the improvement on speed and number of forecast-ed scenarios but lack details on how it results in improved accuracy of the forecast per:

``` WeatherNext 2 can generate forecasts 8x faster and with resolution up to 1-hour. This breakthrough is enabled by a new model that can provide hundreds of possible scenarios. ```

As an end user, all I care is that there's one accurate forecasted scenario.

replies(9): >>45955783 #>>45955791 #>>45956031 #>>45956168 #>>45956289 #>>45956381 #>>45956881 #>>45957542 #>>45960317 #

1. meandthewallaby ◴[17 Nov 25 19:10 UTC] No.45956881[source]▶

>>45955754 #

This is really important: You're not the end user of this product. These types of models are not built for laypeople to access them. You're an end user of a product that may use and process this data, but the CRPS scorecard, for example, should mean nothing to you. This is specifically addressing an under-dispersion problem in traditional ensemble models, due to a limited number (~50) and limited set of perturbed initial conditions (and the fact that those perturbations do very poorly at capturing true uncertainty).

Again, you, as an end user, don't need to know any of that. The CRPS scorecard is a very specific measure of error. I don't expect them to reveal the technical details of the model, but an industry expert instantly knows what WeatherBench[1] is, the code it runs, the data it uses, and how that CRPS scorecard was generated.

By having better dispersed ensemble forecasts, we can more quickly address observation gaps that may be needed to better solidify certain patterns or outcomes, which will lead to more accurate deterministic forecasts (aka the ones you get on your phone). These are a piece of the puzzle, though, and not one that you will ever actually encounter as a layperson.

[1]: https://sites.research.google/gr/weatherbench/

replies(2): >>45957215 #>>45958821 #

2. counters ◴[17 Nov 25 19:29 UTC] No.45957215[source]▶

>>45956881 (TP) #

> By having better dispersed ensemble forecasts, we can more quickly address observation gaps that may be needed to better solidify certain patterns or outcomes, which will lead to more accurate deterministic forecasts.

Sorry - not sure this is a reasonable take-away. The models here are all still initialized from analysis performed by ECMWF; Google is not running an in-house data assimilation product for this. So there's no feedback mechanism between ensemble spread/uncertainty and the observation itself in this stack. The output of this system could be interrogated using something like Ensemble Sensitivity Analysis, but there's nothing novel about that and we can do that with existing ensemble forecast systems.

3. DoctorOetker ◴[17 Nov 25 21:57 UTC] No.45958821[source]▶

>>45956881 (TP) #

Sorry to hijack you: I have some questions regarding current weather models:

I am personally not interested in predicting the weather as end users expect it, rather I am interested in representative evolutions of wind patterns. I.e. specify some location (say somewhere in the North Sea, or perhaps on mainland Western Europe), and a date (say Nov 12) without specifying a year, and would like to have the wind patterns at different heights for that location say for half an hour. Basically running with different seeds, I want to have representative evolutions of the wind vector field (without specifying starting conditions, other than location and date, i.e. NO prior weather).

Are there any ML models capable of delivering realistic and representative wind gust models?

(The context is structural stability analysis of hypothetical megastructures)

replies(1): >>45959499 #

4. counters ◴[17 Nov 25 23:08 UTC] No.45959499[source]▶

>>45958821 #

I mean - you don't need any ML for that. Just go grab random samples from a ~30 day window centered on your day of interest over the region of interest from a reanalysis product like ERA5. If the duration of ERA5 isn't sufficient (e.g. you wouldn't expect on average to see events with a >100 year return period given the limited temporal extent of the dataset) then you could take one step further and pull from an equilibrium climate model simulation - some of these are published as part of the CMIP inter-comparison, or you could go to special-built ensembles like the CESM LENS [1]. You could also use a generative climate downscaling model like NVIDIA's Climate-in-a-bottle, but that's almost certainly overkill for your application.

[1]: https://www.cesm.ucar.edu/community-projects/lens

replies(1): >>45997720 #

5. DoctorOetker ◴[20 Nov 25 21:05 UTC] No.45997720{3}[source]▶

>>45959499 #

The ERA5 seems to give hourly data, i.e. nyquist limit would thus give decent oscillation amplitudes for waves with periods of about 5 hours or more, whereas I am more interested in faster timescales seconds, minutes, i.e. wind gusts.

Calculating the stability and structural requirements for a super-chimney to the tropopause, would require representative higher temporal frequency wind fields

Do you know if I can extract such a high time resolution from LENS since a cursory look at ERA5 showed a time resolution of just 1 hour?

The advantage of an ML model is that its usually possible to calculate the joint probability for a wind field, or to selectively generate a dataset with N-th percentile wind fields etc.

If its differentiable, and the structural stress assumptions are known, then one can "optimize" towards wind profiles that are simultaneously more dangerous and more probable, to identify what needs adressing. Thats why an ML model of local wind patterns would be desirable. ML is more than just LLM's. What one typically complains of in the context of LLM's: that there's no error bars on the output, is not entirely correct: just like differentiable ML models for physical and other phenomena they too allow to calculate the joint probability of sentences, except instead of modeling natural phenomena it is modelling what humans uttered in the corpus (or implicit corpus after RLHF etc). A base model LLM can quite accurately predict the likelihood of a human expressing a certain phrase, but thats modeling human expressions, not their validity. An ML model trained on actual weather data, or fine grained simulated weather data results in comparatively more accurate probability distributions, because physics isn't much of an opinion.

↑