Periodic Reporting for period 3 - EOSC Future (EOSC Future)
Berichtszeitraum: 2023-04-01 bis 2023-09-30
The European Commission follows an Open Science strategy with on eight pillars. For example, that scientific publications are freely available, to develop digital skills, to engage citizens in science. For (research) data there are two main goals. First, the FAIR concept which implies that data are Findable (search in a catalogue or Google), Accessible (right to use the data), Interoperable (data can be combined with other data), and Reusable (quality of the data). Second, to realise a platform where such data and tools to work it, can be stored, found, combined and re-used - the European Open Science Cloud, or EOSC.
Complex problems and global crises like COVID require global collaboration, across disciplines, alignment between government, science, industry, and society. COVID concerned genetics, vaccine development, economic measures, behaviour, just to name a few. Vaccines could be developed fast because of years of research on related viruses. But this information should be findable and available and new results should be checked and added quickly to the database. It was also important to share experiences over countries – on research, on which measures where most effective, about effects from the vaccines, etc.
Before you can compare data there must be agreement about definitions, how the research was carried out, to describe the respondents in the clinical trials. To find the data, these must be described in a proper (comparable) way – a computer search on COVID and Corona will give no hits, unless it is somewhere described that these two terms are related (and machines might miss the link between “covid”, “Covid”, “Covid-19” and “SARS-CoV-2”).
Hence, a lot of preparatory work needs to be carried out before heterogenous data can be used and combined. Further, one needs the equipment (or research infrastructure) to handle the data. Building new facilities takes years, and therefore it is good to have an overview of infrastructures that are available in Europe – and globally. There must be organisations that take care of storing, improving, cleaning and describing the data, and there must be storage, networks and (super)computers to operate the data.
Luckily, we don’t need to start from scratch. There are many research facilities available and data (digital, but also DNA, samples, pictures, video) are stored and can go back in time. But making all this ready to share is a different chapter. That is where EOSC comes in: this platform would provide researchers a trusted environment where they can develop, access and use research data, using existing infrastructures, computers and specialist services.
The EOSC Future project is the major implementation project building the EOSC. Combining expertise from major European research infrastructures (computing, network, storage, data, tools) it works on operationalising the EOSC. It also builds on previous EU projects that developed core parts of EOSC. The architecture connects and integrates existing and new technological elements. It brings in (data) content and tools to investigate and analyse. This is carried out by a large consortium of over 90 partners, in collaboration with researchers and major European stakeholders.
The mission of the project is to bring the different research infrastructure communities together to implement an operational EOSC Platform focusing on technology and interoperability, resources, user engagement and user experience.
The EOSC will consist of three technical features, called the Core, the Exchange and the Interoperability Framework, offering base functionalities, tools to work with the data, and protocols for connecting existing facilities to the EOSC.
Content comes in via the Science Clusters, but also from national, regional or thematic research infrastructures and via procurement (existing and new services from commercial providers). There is co-creation between science and industry via so-called Digital Innovation Hubs. Via the Research Data Alliance we connect with activities outside Europe.
Training has two main features: to describe (catalogue) what is already available on training, including checks on quality, and to offer a platform for creating new training based on the material that is available in the EOSC.
Outreach is very relevant, to attract new users and service providers, to align with other stakeholders, to build communities that will last after the project.
Compared to the previous periods, many improvements were achieved:
-EOSC Core and Exchange: marketplace, onboarding of data sources, workflow of the EPOT onboarding committee, and 3rd party resource catalogues, connecting catalogues, self-service integration with EOSC Core functions (helpdesk, monitoring, order management, etc.).
-Interoperability Framework: scaled up capabilities and delivered EOSC Execution Framework and Interoperability Guidelines. The M22 release including the the EOSC Service Management System went operational in January 2023.
-Science projects: producing results on data & services content and integration and via scientific papers.
-Catalogue: enriched with 07-project services, cluster/RI services, and e-Infrastructure services: 293 resource providers and 543 individual services
-Knowledge Hub: in production, creating and delivering the training and learning resources.
-Collaboration with other projects, e.g. with the EOSC-07 projects via 3 thematic groups on technical activities; with the EOSC Steering Board including on EOSC Observatory. Beneficiaries are involved in all 13 EOSC-Association Task Forces.