Skip to main content

Research Repository

Advanced Search

A zero-shot monolingual dual stage information retrieval system for Spanish biomedical systematic literature reviews.

Ofori-Boateng, Regina; Aceves-Martins, Magaly; Wiratunga, Nirmalie; Moreno-Garcia, Carlos Francisco

Authors

Magaly Aceves-Martins



Contributors

Kevin Duh
Editor

Helena Gomez
Editor

Steven Bethard
Editor

Abstract

Systematic Reviews (SRs) are foundational in healthcare for synthesising evidence to inform clinical practices. Traditionally skewed towards English-language databases, SRs often exclude significant research in other languages, leading to potential biases. This study addresses this gap by focusing on Spanish, a language notably underrepresented in SRs. We present a foundational zero-shot dual information retrieval (IR) baseline system, integrating traditional retrieval methods with pre-trained language models and cross-attention re-rankers for enhanced accuracy in Spanish biomedical literature retrieval. Utilising the LILACS database, known for its comprehensive coverage of Latin American and Caribbean biomedical literature, we evaluate the approach with three real-life case studies in Spanish SRs. The findings demonstrate the system's efficacy and underscore the importance of query formulation. This study contributes to the field of IR by promoting language inclusivity and supports the development of more comprehensive and globally representative healthcare guidelines.

Citation

OFORI-BOATENG, R., ACEVES-MARTINS, M., WIRATUNGA, N. and MORENO-GARCIA, C. 2024. A zero-shot monolingual dual stage information retrieval system for Spanish biomedical systematic literature reviews. In Duh, K., Gomez, H. and Bethard, S. (eds.) Proceedings of the 2024 North American Chapter of the Association for Computational Linguistics conference (NAACL 2024): human language technologies, 16-21 June 2024, Mexico City, Mexico. Stroudsburg, PA: ACL [online], volume 1: long papers, pages 3725-3736. Available from: https://doi.org/10.18653/v1/2024.naacl-long.206

Presentation Conference Type Conference Paper (published)
Conference Name 2024 North American Chapter of the Association for Computational Linguistics conference (NAACL 2024): human language technologies
Start Date Jun 16, 2024
End Date Jun 21, 2024
Acceptance Date Mar 1, 2024
Online Publication Date Jun 30, 2024
Publication Date Jun 30, 2024
Deposit Date Jun 20, 2024
Publicly Available Date Jul 2, 2024
Publisher ACL Association for Computational Linguistics
Peer Reviewed Peer Reviewed
Volume 1: long papers
Pages 3725--3736
ISBN 9798891761148
DOI https://doi.org/10.18653/v1/2024.naacl-long.206
Keywords Systematic reviews; Healthcare; Clinical practices
Public URL https://rgu-repository.worktribe.com/output/2377852
Publisher URL https://aclanthology.org/2024.naacl-long.206

Files




You might also like



Downloadable Citations