Date
Thu, 05 May 2022
Time
12:00 - 13:00
Location
L1
Speaker
Davide Faranda
Organisation
Université Paris Saclay

Latent Dirichlet Allocation (LDA) is capable of analyzing thousands of documents in a short time and highlighting important elements, recurrences and anomalies. It is generally used in linguistics to study natural language: its word analysis reveals the theme(s) of a document, each theme being identified by a specific vocabulary or, more precisely, by a particular statistical distribution of word frequency.
In the climatologists' use of LDA, the document is a daily weather map and the word is a pixel of the map. The theme with its corpus of words can become a cyclone or an anticyclone and, more generally, a 'pattern'  that the scientists term motif. Artificial intelligence – a sort of incredibly fast robot meteorologist – looks for correlations both between different places on the same map, and between successive maps over time. In a sense, it 'notices' that a particular location is often correlated with another location, recurrently throughout the database, and this set of correlated locations constitutes a specific pattern.
The algorithm performs statistical analyses at two distinct levels: at the word or pixel level of the map, LDA defines a motif, by assigning a certain weight to each pixel, and thus defines the shape and position of the motif;  LDA breaks down a daily weather map into all these motifs, each of which is assigned a certain weight.
In concrete terms, the basic data are the daily weather maps between 1948 and nowadays over the North Atlantic basin and Europe. LDA identifies a dozen or so spatially defined motifs, many of which are familiar meteorological patterns such as the Azores High, the Genoa Low or even the Scandinavian Blocking. A small combination of those motifs can then be used to describe all the maps. These motifs and the statistical analyses associated with them allow researchers to study weather phenomena such as extreme events, as well as longer-term climate trends, and possibly to understand their mechanisms in order to better predict them in the future.

The preprint of the study is available as:
 Lucas Fery, Berengere Dubrulle, Berengere Podvin, Flavio Pons, Davide Faranda. Learning a weather dictionary of atmospheric patterns using Latent Dirichlet Allocation. 2021. ⟨hal-03258523⟩ https://hal-enpc.archives-ouvertes.fr/X-DEP-MECA/hal-03258523v1
 

Please contact us with feedback and comments about this page. Last updated on 05 May 2022 10:18.