Periodic Reporting for period 1 - MAELSTROM (MAchinE Learning for Scalable meTeoROlogy and cliMate)
Reporting period: 2021-04-01 to 2022-09-30
The MAELSTROM compute system designs test machine learning applications across a range of hardware configurations regarding energy consumption, time-to-solution, numerical precision and solution accuracy. Customised compute systems are designed that are optimised for application needs to strengthen Europe’s high-performance computing portfolio and to pull recent hardware developments, driven by general machine learning applications, toward needs of weather and climate applications.
The MAELSTROM software framework enables scientists to apply and compare machine learning tools and libraries efficiently across a wide range of computer systems. A user interface will link application developers with compute system designers, and automated benchmarking and error detection of machine learning solutions will be performed during the development phase. Tools will be published as open source.
The MAELSTROM machine learning applications cover all important components of the workflow of weather and climate predictions including the processing of observations, the assimilation of observations to generate initial and reference conditions, model simulations, as well as post-processing of model data and the development of forecast products. For each application, benchmark datasets with up to 10 terabytes of data are published online for training and machine learning tool-developments. MAELSTROM machine learning solutions serve as a blueprint for a wide range of machine learning applications on supercomputers in the future.
- The first wave of deliverables were the survey deliverables which outlined the state-of-the-art for machine learning applications (D1.2) software (D2.1) and hardware (D3.1 and D3.2).
- MAELSTROM datasets already comprise 16 TB of data which are documented and published, and are available for download via the internet.
- The development of MAELSTROM software tools progressed as planed and will soon be useable for all MAELSTROM applications.
- The first hardware benchmarks for the MAELSTROM applications have been performed and results have been reported and fed back to the application designers.
- We had the 1st MAELSTROM Dissemination Workshop on 28th March which was organised back-to-back with the Machine Learning Workshop at ECMWF from 29th March to 1st April. The dissemination workshop attracted 208 registered participants.
- The first 1st MAELSTROM hackathon (called MAELSTROM Bootcamp) was very successful with more than 30 participants and 16 scientific advisors meeting from 27th-30th September at JSC.
- MAELSTROM scientists have already provided more than 30 presentations and published 13 papers.
- We have designed a project webpage that provides access to all important information on the project https://www.maelstrom-eurohpc.eu/.
- MAELSTROM applications are already used by industry as reference benchmarks for HPC performance and machine learning application quality, including companies such as GRAPHCORE, Microsoft and ATOS.