NFDI4DS | UHH-SEMS - Publication Details

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

FOS: Computer and information sciences Computer Science - Computation and Language Artificial Intelligence (cs.AI) Computer Science - Artificial Intelligence Computation and Language (cs.CL) Information Retrieval (cs.IR) Computer Science - Information Retrieval

DOI: 10.48550/arxiv.2305.06897 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (52)

Ogundepo, Odunayo

Gwadabe, Tajuddee...

Rivera, Clara E.

Clark, Jonathan H.

Ruder, Sebastian

Adelani, David If...

Dossou, Bonaventu...

DIOP, Abdou Aziz

Sikasote, Claytone

Hacheme, Gilles

Buzaaba, Happy

Ezeani, Ignatius

Mabuya, Rooweither

Osei, Salomey

Emezue, Chris

Kahira, Albert Nj...

Muhammad, Shamsud...

Oladipo, Akintunde

Owodunni, Abraham...

Tonja, Atnafu Lam...

Shode, Iyanuoluwa

Asai, Akari

Ajayi, Tunde Oluw...

Siro, Clemencia

Arthur, Steven

Adeyemi, Mofetoluwa

Ahia, Orevaoghene

Aremu, Anuoluwapo

Awosan, Oyinkansola

Chukwuneke, Chiamaka

Opoku, Bernard

Ayodele, Awokoya

Otiende, Verrah

Mwase, Christine

Sinkala, Boyd

Rubungo, Andre Ni...

Ajisafe, Daniel A.

Onwuegbuzia, Emek...

Mbow, Habib

Niyomutabazi, Emile

Mukonde, Eunice

Lawan, Falalu Ibr...

Ahmad, Ibrahim Said

Alabi, Jesujoba O.

Namukombo, Martin

Chinedu, Mbonu

Phiri, Mofya

Putini, Neo

Mngoma, Ndumiso

Amuok, Priscilla A.

Iro, Ruqayya Nasir

Adhiambo, Sonia

ABSTRACT

African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create AfriQA, the first cross-lingual QA dataset with a focus on African languages. AfriQA includes 12,000+ XOR QA examples across 10 African languages. While previous datasets have focused primarily on languages where cross-lingual QA augments coverage from the target language, AfriQA focuses on languages where cross-lingual answer content is the only high-coverage source of answer content. Because of this, we argue that African languages are one of the most important and realistic use cases for XOR QA. Our experiments demonstrate the poor performance of automatic translation and multilingual retrieval methods. Overall, AfriQA proves challenging for state-of-the-art QA models. We hope that the dataset enables the development of more equitable QA technology.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....