Skip to main content
European Commission logo
français français
CORDIS - Résultats de la recherche de l’UE
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary

“Bridging the technology gap: Integrating Malta into European Research and Innovation efforts for AI-based language technologies”

CORDIS fournit des liens vers les livrables publics et les publications des projets HORIZON.

Les liens vers les livrables et les publications des projets du 7e PC, ainsi que les liens vers certains types de résultats spécifiques tels que les jeux de données et les logiciels, sont récupérés dynamiquement sur OpenAIRE .

Livrables

Publications

Exploring the Impact of Transliteration on NLP Performance: Treating Maltese as an Arabic Dialect

Auteurs: Kurt Micallef, Fadhl Eryani, Nizar Habash, Houda Bouamor, Claudia Borg
Publié dans: Proceedings of the Workshop on Computation and Written Language (CAWL 2023), 2023, Page(s) 22-32
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2023.cawl-1.4

FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN

Auteurs: Milind Agarwal, Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javor
Publié dans: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2023, Page(s) 1–61
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2023.iwslt-1.1

Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions.

Auteurs: Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A Farrugia, Claudia Borg, Kenneth P Camilleri, Michael Rosner, and Lonneke van der Plas.
Publié dans: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018, Page(s) 3323-3328
Éditeur: European Language Resources Association (ELRA).

From Linguistic Linked Data to Big Data

Auteurs: Dimitar Trajanov, Elena Apostol, Radovan Garabik, Katerina Gkirtzou, Dagmar Gromann, Chaya Liebeskind, Cosimo Palma, Michael Rosner, Alexia Sampri, Gilles Sérasset, Blerina Spahiu, Ciprian-Octavian Truică, Giedre Valunaite Oleskeviciene
Publié dans: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024, Page(s) 7489–7502
Éditeur: ELRA and ICCL

Findings of the 2021 Conference on Machine Translation (WMT21)

Auteurs: Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondřej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussa, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom
Publié dans: Proceedings of the Sixth Conference on Machine Translation, 2021, Page(s) 1–88
Éditeur: Association for Computational Linguistics

Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching

Auteurs: Kurt Micallef, Nizar Habash, Claudia Borg, Fadhl Eryani, Houda Bouamor
Publié dans: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), 2024, Page(s) 1014–1025
Éditeur: Association for Computational Linguistics

What's the Meaning of Superhuman Performance in Today's NLU?

Auteurs: Simone Tedeschi; Johan Bos; Thierry Declerck; Jan Hajič; Daniel Hershcovich; Eduard Hovy; Alexander Koller; Simon Krek; Steven Schockaert; Rico Sennrich; Ekaterina Shutova; Roberto Navigli
Publié dans: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023, Page(s) 12471–12491
Éditeur: Association for Computational Linguistics
DOI: 10.48550/arxiv.2305.08414

On the Cusp of Comprehensibility: Can Language Models Distinguish Between Metaphors and Nonsense?

Auteurs: Bernadeta Griciūtė, Marc Tanti, Lucia Donatelli
Publié dans: Proceedings of the 3rd Workshop on Figurative Language Processing (FLP), 2023, Page(s) 173–177
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2022.flp-1.25

A Linked Data Approach for linking and aligning Sign Language and Spoken Language Data

Auteurs: Thierry Declerck, Sam Bigeard, Fahad Khan, Irene Murtagh, Sussi Olsen, Mike Rosner, Ineke Schuurman, Andon Tchechmedjiev, Andy Way
Publié dans: Proceedings of the Second International Workshop on Automatic Translation for Signed and Spoken Languages, 2023, Page(s) 11–21
Éditeur: European Association for Machine Translation

Your Stereotypical Mileage May Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts

Auteurs: Karen Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, Marthese Borg, Yongjian Chen, Fanny Ducel, Yoann Dupont, Guido Ivetta, Zhijian Li, Margot Mieskes, Marco Naguib, Yuyan Qian, Matteo Radaelli, Wolfgang S. Schmeisser-Nieto, Emma Raimundo Schulz, Thiziri Saci, Sarah Saidi, Javier Torroba Marchante, Shilin Xie, Sergio E. Zanotto, Aurélie Névéol
Publié dans: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024, Page(s) 17764–17769
Éditeur: ELRA and ICCL

Leveraging DBnary Data to Enrich Information of Multiword Expressions in Wiktionary

Auteurs: Serasset, Gilles; Declerck, Thierry; Bajčetić, Lenka
Publié dans: LDK 2023 – 4th Conference on Language, Data and Knowledge, 2023, Page(s) 49–60
Éditeur: NOVA CLUNL

UM-DFKI Maltese Speech Translation

Auteurs: Aiden Williams, Kurt Abela, Rishu Kumar, Martin Bär, Hannah Billinghurst, Kurt Micallef, Ahnaf Mozib Samin, Andrea DeMarco, Lonneke van der Plas, Claudia Borg
Publié dans: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 2023, Page(s) 433–441
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2023.iwslt-1.41

Cross-lingual Transfer Learning with Persian

Auteurs: Sepideh Mollanorozy, Marc Tanti, Malvina Nissim
Publié dans: Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, 2023, Page(s) 89–95
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2023.sigtyp-1.9

COMET for Low-Resource Machine Translation Evaluation: A Case Study of English-Maltese and Spanish-Basque

Auteurs: Júlia Falcão, Claudia Borg, Nora Aranberri, Kurt Abela
Publié dans: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024, Page(s) 3553–3565
Éditeur: ELRA and ICCL

Visually grounded generation of entailments from premises

Auteurs: Somayeh Jafaritazehjani, Albert Gatt, Marc Tanti
Publié dans: Proceedings of the 12th International Conference on Natural Language Generation, 2019, Page(s) 178-188
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/w19-8625

Towards a Corpus of Spoken Maltese: Korpus tal-Malti Mitkellem, KMM

Auteurs: Alexandra (Sandra) Vella, Sarah Agius, Aiden Williams, Claudia Borg
Publié dans: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024, Page(s) 16343–16352
Éditeur: ELRA and ICCL

The 2023 WebNLG Shared Task on Low Resource Languages. Overview and Evaluation Results (WebNLG 2023)

Auteurs: Liam Cripwell, Anya Belz, Claire Gardent, Albert Gatt, Claudia Borg, Marthese Borg, John Judge, Michela Lorandi, Anna Nikiforovskaya, William Soto Martinez
Publié dans: Proceedings of the Workshop on Multimodal, Multilingual Natural Language Generation and Multilingual WebNLG Challenge (MM-NLG 2023), 2023, Page(s) 55–66
Éditeur: Association for Computational Linguistics

Linguistic LOD for Interoperable Morphological Description

Auteurs: Michael Rosner, Maxim Ionov
Publié dans: Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, 2024, Page(s) 94–102
Éditeur: ELRA and ICCL

PARSEME multilingual corpus of verbal multiword expressions.

Auteurs: Savary, Agata & Candito, Marie & Mititelu, Verginica & Bejček, Eduard & Cap, Fabienne & Čéplö, Slavomír & Cordeiro, Silvio & Eryiğit, Gülşen & Giouli, Voula & Van Gompel, Maarten & HaCohen-Kerner, Yaakov & Kovalevskaitė, Jolanta & Krek, Simon & Liebeskind, Chaya & Monti, Johanna & Parra Escartín, Carla & Der, Lonneke & Qasemi Zadeh, Behrang & Ramisch, Carlos & Vincze, Veronika.
Publié dans: Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop, 2018, ISBN 978-3-96110-123-8
Éditeur: Language Science Press

Enriching Multiword Terms in Wiktionary with Pronunciation Information

Auteurs: Lenka Bajcetic, Thierry Declerck, Gilles Sérasset
Publié dans: Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023), 2023, Page(s) 65–72
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2023.mwe-1.10

Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese

Auteurs: Kurt Micallef, Albert Gatt, Marc Tanti, Lonneke van der Plas and Claudia Borg
Publié dans: Proceedings of the Third Workshop on Deep Learning for Low-Resource Natural Language Processing, 2022, Page(s) 90–101
Éditeur: Association for Computational Linguistics
DOI: 10.18653/v1/2022.deeplo-1.10

Beyond Concatenative Morphology: Applying OntoLex-Morph to Maltese

Auteurs: Maxim Ionov, Mike Rosner
Publié dans: Proceedings of the 4th Conference on Language, Data and Knowledge, 2023, Page(s) 385–391
Éditeur: NOVA CLUNL

Topic Classification and Headline Generation for Maltese Using a Public News Corpus

Auteurs: Amit Kumar Chaudhary, Kurt Micallef, Claudia Borg
Publié dans: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024, Page(s) 16274–16281
Éditeur: ELRA and ICCL

Transfer learning from language models to image caption generators: Better models may not transfer better

Auteurs: Marc Tanti, Albert Gatt and Kenneth Micallef
Publié dans: 2019
Éditeur: arXiv preprint

A Dependency Parser for Maltese - Comparing the impact of transfer learning from Romance and Semitic Languages

Auteurs: Andrei Zammit, Slavomír Čéplö, Lonneke van der Plas, Claudia Borg
Publié dans: Abstract in the 7th International Conference on Maltese Linguistics, 2019
Éditeur: Għaqda Internazzjonali tal-Lingwistika Maltija

Where to put the image in an image caption generator

Auteurs: MARC TANTI, ALBERT GATT, KENNETH P. CAMILLERI
Publié dans: Natural Language Engineering, Numéro 24, 2020, Page(s) 467-489, ISSN 1351-3249
Éditeur: Cambridge University Press
DOI: 10.1017/s1351324918000098

Prosodic and gestural marking of complement fronting in Maltese

Auteurs: Paggio, Patrizia and Galea, Luke and Vella, Alexandra
Publié dans: The languages of Malta, 2018, Page(s) 81–116
Éditeur: Language Science Press
DOI: 10.5281/zenodo.1181805

Quantifying the Amount of Visual Information Used by Neural Caption Generators

Auteurs: Marc Tanti, Albert Gatt, Kenneth P. Camilleri
Publié dans: Lecture Notes in Computer Science, Computer Vision – ECCV 2018 Workshops, Numéro vol 11132, 2023, Page(s) 124-132, ISBN 978-3-030-11017-8
Éditeur: Springer International Publishing
DOI: 10.1007/978-3-030-11018-5_11

Automatic Removal of Identifying Information in Official EU Languages for Public Administrations: The MAPA Project

Auteurs: Lucie Gianola, Ēriks Ajausks, Victoria Arranz, Chomicha Bendahman, Laurent Bié, Claudia Borg, Aleix Cerdà, Khalid Choukri, Montse Cuadros, Ona De Gibert, Hans Degroote, Elena Edelman, Thierry Etchegoyhen, Ángela Franco Torres, Mercedes García Hernandez, Aitor García Pablos, Albert Gatt, Cyril Grouin, Manuel Herranz, Alejandro Adolfo Kohan, Thomas Lavergne, Maite Melero, Patrick Paroubek, Mic
Publié dans: Frontiers in Artificial Intelligence and Applications, Legal Knowledge and Information Systems, Numéro Volume 334: Legal Knowledge and Information Systems, 2020, Page(s) 223 - 226
Éditeur: IOS Press
DOI: 10.3233/faia200869

Pre-gen Metrics: Predicting Caption Quality Metrics Without Generating Captions

Auteurs: Marc Tanti, Albert Gatt, Adrian Muscat
Publié dans: Lecture Notes in Computer Science, Computer Vision – ECCV 2018 Workshops, Numéro vol 11132, 2023, Page(s) 114-123, ISBN 978-3-030-11017-8
Éditeur: Springer International Publishing
DOI: 10.1007/978-3-030-11018-5_10

Recherche de données OpenAIRE...

Une erreur s’est produite lors de la recherche de données OpenAIRE

Aucun résultat disponible