Skip to main content
European Commission logo
English English
CORDIS - EU research results
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary

Piloting a Cooperative Open Web Search Infrastructure to Support Europe's Digital Sovereignty

Deliverables

Dissemination, Exploitation and Communication (DEC) Report V1

Dissemination, Exploitation and Communication (DEC) Report first version

ELSA-catalogue & code of conduct for open Web search

ELSA-catalogue & code of conduct for open Web search initial version

Model governance for federating an open search infrastructure V1

Model governance for federating an open search infrastructure Version 1

Report on scientific cooperation, community building and stakeholder involvement V1

Report on scientific cooperation, community building and stakeholder involvement initial version

Report of privacy, transparency, and trust models for search applications V1

Report of privacy, transparency, and trust models for search applications in its first version

Launch of the Pilot infrastructure
Crawler Coordination Software Stack & Demonstrator V1

Open Source Software Stack for coordinating multiple, distributed and usually independent crawlers.

The OpenWebSearch Hub and the Open Web Index V1

The OpenWebSearch Hub and the Open Web Index in a first version indexing common crawls and providing first specifications

Publications

Cross-Market Product-Related Question Answering

Author(s): Ghasemi, Negin; Aliannejadi, Mohammad; Bonab, Hamed; Kanoulas, Evangelos; de Vries, Arjen P.; Allan, James; Hiemstra, Djoerd
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591658

A User Study on the Acceptance of Native Advertising in Generative IR

Author(s): Ines Zelch, Matthias Hagen and Martin Potthast
Published in: Proceedings of the 2024 Conference on Human Information Interaction and Retrieval (CHIIR '24), 2024, ISBN 979-8-4007-0434-5
Publisher: ACM
DOI: 10.1145/3627508.3638316

Challenges of Index Exchange for Search Engine Interoperability

Author(s): Hiemstra, D., Hendriksen, G., Kamphuis, C., & de Vries, A. P.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10529619

Overview of Touché 2023: Argument and Causal Retrieval

Author(s): Alexander Bondarenko, Maik Fröbe, Johannes Kiesel, Ferdinand Schlatt, Valentin Barriere, Brian Ravenet, Léo Hemamou, Simon Luck, Jan Heinrich Reimer, Benno Stein, Martin Potthast, and Matthias Hagen
Published in: Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, 2023, ISBN 978-3-031-42447-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-42448-9_31

On Stance Detection in Image Retrieval for Argumentation

Author(s): Carnot, Miriam Louise; Schreieder, Tobias; Heinemann, Lorenz; Kiesel, Johannes; Braker, Jan; Fröbe, Maik; Potthast, Martin; Stein, Benno
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591917

Overview of PAN 2023: Authorship Verification, Multi-Author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection

Author(s): Janek Bevendorff, Ian Borrego-Obrador, Mara Chinea-Ríos, Marc Franco-Salvador, Maik Fröbe, Annina Heini, Krzysztof Kredens, Maximilian Mayerl, Piotr Pęzik, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann,
Published in: Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, 2023, ISBN 978-3-031-42447-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-42448-9_29

Conceptual Design and Implementation of a Prototype Search Application using the Open Web Search Index

Author(s): Nussbaumer, A., Kaushik, R., Hendriksen, G., Gürtl, S., & Gütl, C.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10636166

Generating Natural Language Queries for More Effective Systematic Review Screening Prioritisation

Author(s): Shuai Wang; Harrisen Scells; Bevan Koopman; Martin Potthast; Guido Zuccon
Published in: SIGIR-AP '23: Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023
DOI: 10.48550/arxiv.2309.05238

Simulating Follow-up Questions in Conversational Search

Author(s): Kiesel, J., Gohsen, M., Mirzakhmedova, N., Hagen, M., Stein, B.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14609, 2024, ISBN 978-3-031-56059-0
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56060-6_25

The Information Retrieval Experiment Platform

Author(s): Fröbe, Maik; Deckers, Niklas; Stein, Benno; Reimer, Jan Heinrich; Reich, Simon; Hagen, Matthias; MacAvaney, Sean; Bevendorff, Janek; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.48550/arXiv.2305.18932

Indicative Summarization of Long Discussions

Author(s): Syed, Shahbaz; Schwabe, Dominik; Al-Khatib, Khalid; Potthast, Martin
Published in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Publisher: ACL
DOI: 10.48550/arxiv.2311.01882

Continuous Integration for Reproducible Shared Tasks with TIRA.io

Author(s): Maik Fröbe, Matti Wiegmann, Nikolay Kolyada, Bastian Grahm, Theresa Elstner, Frank Loebe, Matthias Hagen, Benno Stein, and Martin Potthast
Published in: Advances in Information Retrieval. 45th European Conference on IR Research (ECIR 2023), 2023, ISBN 978-3-031-28240-9
Publisher: Springer
DOI: 10.1007/978-3-031-28241-6_20

SemEval-2023 Task 5: Clickbait Spoiling

Author(s): Maik Fröbe, Tim Gollub, Benno Stein, Matthias Hagen, and Martin Potthast
Published in: Proceedings of 17th International Workshop on Semantic Evaluation (SemEval 2023), 2023
Publisher: ACL
DOI: 10.18653/v1/2023.semeval-1.315

An Empirical Comparison of Web Content Extraction Algorithms

Author(s): Bevendorff, Janek; Kiesel, Johannes; Gupta, Sanket; Stein, Benno
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023, ISBN 978-1-4503-9408-6
Publisher: ACM
DOI: 10.1145/3539618.3591920

The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives

Author(s): Reimer, Jan Heinrich; Gienapp, Lukas; Schmidt, Sebastian; Scells, Harrisen; Fröbe, Maik; Stein, Benno; Hagen, Matthias; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.48550/arXiv.2304.00413

MMEAD: MS MARCO Entity Annotations and Disambiguations

Author(s): Kamphuis, Chris; Lin, Jimmy; Lin, Aileen; de Vries, Arjen P.; Yang, Siwen; Hasibi, Faegheh
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591887

Overview of Touché 2024: Argumentation Systems

Author(s): Kiesel, J. et al.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14612, 2024, ISBN 978-3-031-56068-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56069-9_64

Weighted AUReC: Handling Skew in Shard Map Quality Estimation for Selective Search

Author(s): Hendriksen, G., Hiemstra, D., de Vries, A.P.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14611, 2024, ISBN 978-3-031-56065-1
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56066-8_10

Citance-Contextualized Summarization of Scientific Papers

Author(s): Syed, Shahbaz; Hakimi, Ahmad Dawar; Al-Khatib, Khalid; Potthast, Martin
Published in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Publisher: ACL
DOI: 10.48550/arxiv.2311.02408

Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models

Author(s): Parry, A., Fröbe, M., MacAvaney, S., Potthast, M., Hagen, M.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14609, 2024, ISBN 978-3-031-56059-0
Publisher: Springer, Cham
DOI: 10.48550/arXiv.2403.07654

Bootstrapped nDCG Estimation in the Presence of Unjudged Documents

Author(s): Maik Fröbe, Lukas Gienapp, Martin Potthast, and Matthias Hagen
Published in: Advances in Information Retrieval. 45th European Conference on IR Research (ECIR 2023), 2023, ISBN 978-3-031-28243-0
Publisher: Springer
DOI: 10.1007/978-3-031-28244-7_20

OWler: Preliminary results for building a Collaborative Open Web Crawler

Author(s): Dinzinger, M., Al-Maamari, M., Zerhoudi, S., Istaiti, M., Mitrović, J., & Granitzer, M.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10581841

Understanding and Mitigating Cognitive Bias during Web Search

Author(s): Hitzginger, S., Nussbaumer, A., Gütl, C., & Ruß-Baumann, C.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10607402

Trigger Warnings: Bootstrapping a Violence Detector for Fan Fiction

Author(s): Magdalena Wolska, Matti Wiegmann, Christopher Schröder, Ole Borchardt, Benno Stein, and Martin Potthast
Published in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Publisher: ACL
DOI: 10.18653/v1/2023.findings-emnlp.41

Commercialized Generative AI: A Critical Study of the Feasibility and Ethics of Generating Native Advertising Using Large Language Models in Conversational Web Search

Author(s): Zelch, I., Hagen, M., and Potthast, M.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.48550/arXiv.2310.04892

Investigating the Effects of Sparse Attention on Cross-Encoders

Author(s): Schlatt, F., Fröbe, M., Hagen, M.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14608, 2024, ISBN 978-3-031-56027-9
Publisher: Springer, Cham
DOI: 10.48550/arXiv.2312.17649

A Comprehensive Dataset for Webpage Classification

Author(s): Al-Maamari, M., Istaiti, M., Zerhoudi, S., Dinzinger, M., Granitzer, M., & Mitrović, J.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10594210

Overview of PAN 2024: Multi-Author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification

Author(s): Bevendorff et al.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14613, 2024, ISBN 978-3-031-56071-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56072-9_1

Smooth Operators for Effective Systematic Review Queries

Author(s): Scells, Harrisen; Schlatt, Ferdinand; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591768

Geoparsing at Web-scale - Challenges and Opportunities

Author(s): Farzana, Sheikh Mastura; Hecking, Tobias
Published in: GeoExT 2023: First International Workshop on Geographic Information Extraction from Texts at ECIR 2023 (CEUR Workshop Proceedings), Issue 3385, 2023, ISSN 1613-0073
Publisher: CEUR-WS

Product Spam On YouTube: a Case Study

Author(s): Bevendorff, J., Wiegmann, M., Potthast, M., & Stein, B.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10498306

UNFair: Search Engine Manipulation, Undetectable by Amortized Inequity

Author(s): De Jonge, Tim; Hiemstra, Djoerd
Published in: FAccT 2023 - Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023
Publisher: ACM
DOI: 10.1145/3593013.3594046

Pybool_ir: A Toolkit for Domain-Specific Search Experiments

Author(s): Scells, Harrisen; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591819

Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines

Author(s): Bevendorff, J., Wiegmann, M., Potthast, M., Stein, B.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14610, 2024, ISBN 978-3-031-56062-0
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56063-7_4

Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024

Author(s): Ionescu, B. et al.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14613, 2024, ISBN 978-3-031-56071-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56072-9_6

Impact and development of an Open Web Index for Open Web Search

Author(s): Granitzer Michael; Voigt Stefan; Noor Afshan Fathima; Golasowski Martin; Guetl Christian; Hecking Tobias; Gijs Hendriksen; Djoerd Hiemstra; Jan Martinovič; Jelena Mitrović; Izidor Mlakar; Stavros Moiras; Alexander Nussbaumer; Per Öster; Martin Potthast; Marjana Senčar Srdič; Sharikadze Megi; Kateřina Slaninová; Benno Stein; Arjen P. de Vries; Vít Vondrák; Andreas Wagner; Saber Zerhoudi
Published in: JASIST, 2023, ISSN 2330-1635
Publisher: Willey
DOI: 10.1002/asi.24818

Evaluating Generative Ad Hoc Information Retrieval

Author(s): Gienapp, Lukas; Scells, Harrisen; Deckers, Niklas; Bevendorff, Janek; Wang, Shuai; Kiesel, Johannes; Syed, Shahbaz; Fröbe, Maik; Zuccon, Guido; Stein, Benno; Hagen, Matthias; Potthast, Martin
Published in: Computing Research Repository (CoRR) in arXiv, 2023
Publisher: ArXiv
DOI: 10.48550/arxiv.2311.04694

Prototyping Open Web Search Applications with TIRA: A Case Study in Research-oriented Teaching

Author(s): Fröbe, M., Elstner, T., Scells, H., Stein, B., & Potthast, M.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10557539

Searching for OpenAIRE data...

There was an error trying to search data from OpenAIRE

No results available