Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy

Objective

Audiovisual media content created and used in films and videos is key for people to communicate and entertain. It has also become an essential resource of modern history, since a large portion of memories and records of the 20th and 21st centuries are audiovisual. To fully benefit from this asset, fast and effective methods are needed to cope with the rapidly growing audiovisual big data that are collected in digital repositories and used internationally. MeMAD will provide novel methods for an efficient re-use and re-purpose of multilingual audiovisual content which revolutionize video management and digital storytelling in broadcasting and media production. We go far beyond the state-of-the-art automatic video description methods by making the machine learn from the human. The resulting description is thus not only a time-aligned semantic extraction of objects but makes use of the audio and recognizes action sequences. While current methods work mainly for English, MeMAD will handle multilingual source material and produce multilingual descriptions and thus enhance the user experience. Our method interactively integrates the latest research achievements in deep neural network techniques in computer vision with knowledge bases, human and machine translation in a continuously improving machine learning framework. This results in detailed, rich descriptions of the moving images, speech, and audio, which enable people working in the Creative Industries to access and use audiovisual information in more effective ways. Moreover,the intermodal translation from images and sounds into words will attract millions of new users to audiovisual media, including the visually and hearing impaired. Anyone using audiovisual content will also benefit from these verbalisations as they are non-invasive surrogates for visual and auditory information, which can be processed without the need of actually watching or listening, matching the new usage of video consumption on mobile devices.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.

Keywords

See other projects for this call

Sub call

H2020-ICT-2017-1

Funding Scheme

RIA - Research and Innovation action

Coordinator

AALTO KORKEAKOULUSAATIO SR

Net EU contribution

€ 752 207,50

Address

OTAKAARI 1
02150 Espoo
Finland

Region

Manner-Suomi Helsinki-Uusimaa Helsinki-Uusimaa

Activity type

Higher or Secondary Education Establishments

Links

Contact the organisation Website

Participation in EU R&I programmes

HORIZON collaboration network

Total cost

€ 752 207,50

Participants (8)

HELSINGIN YLIOPISTO

Finland

Net EU contribution

€ 397 398,75

EURECOM GIE

France

Net EU contribution

€ 401 598,75

UNIVERSITY OF SURREY

United Kingdom

Net EU contribution

€ 370 680,00

YLEISRADIO OY

Finland

Net EU contribution

€ 354 850,00

LIMECRAFT NV

Belgium

Net EU contribution

€ 633 080,00

LINGSOFT OY

Finland

Net EU contribution

€ 157 717,50

INSTITUT NATIONAL DE L'AUDIOVISUEL

France

Net EU contribution

€ 212 528,75

LINGSOFT LANGUAGE SERVICES OY

Finland

Net EU contribution

€ 151 532,50

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.

Keywords

Programme(s)

Topic(s)

Call for proposal

Sub call

Funding Scheme

Coordinator

Participants (8)

Share this page

Download

Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.

Keywords

Programme(s)

Topic(s)

Call for proposal

Sub call

Funding Scheme

Coordinator

Participants (8)

Share this page

Download

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.