Natural Language Processing Research Group

BUKNLP is a Research Group for Natural Language Processing and Machine Learning at Bayero University, Kano-Nigeria. The research group consists of academic researchers from computer science, linguistic, and students

The group research activities include sentiment analysis, social media analysis, machine translation, Computational Social Science, information retrieval, textual analysis, multilingual natural language processing as well as the creation of linguistic resources (dictionaries and annotated corpora) for applications of various types. Recently, the group focused on natural language processing for low-resource languages and related task.

Research Areas

Natural Language Processing

Machine Learning

HausaNLP

Meet the Team

Researchers

Avatar

Bello Shehu Bello

Lecturer in Computer Science

Machine Learning, Social Media Analysis, Natural Language Processing, Computational Social Science

Avatar

Ibrahim Said Ahmad

Lecturer in Information Technology

Data Mining, Machine Learning, Sentiment Analysis

Avatar

Jaafar Zubairu Maitama

Lecturer in Computer Science

Natural Language Processing, Summarization, Machine learning, Sentiment analysis

Avatar

Mahmud Yusuf Ahmad

Lecturer in Computer Science

Data mining, Machine Learning, Learning Analytics, Big data

Avatar

Shamsuddeen Hassan Muhammad

Lecturer in Computer Science

Sentimemnt Analysis, Machine Learning, Data Science, Low-resource NLP

Avatar

Suhail Kamal

Lecturer in Information Technology

Sign Language Recognition, Sign Language Translation, Machine Translation

Collaborators

Avatar

Lecturer in Computer Science

Machine Translation, Natural Language Processing

Avatar

Ahamdu Shehu

Assistant Professor of English and Literature at the American University of Nigeria, Yola

Cognitive Linguistics, Cultural Linguistics, Structural aspects of African languages

Avatar

Idris Abdulmuminu

Lecturer in Computer Science at Ahmadu Bello University, Zaria (ABU)

Neural Machine Translation, Low Resource Languages

Recent Publications

Quickly discover relevant content by filtering publications.

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Research in NLP lacks geographic diversity, and the question of how NLP can be scaled to low-resourced languages has not yet been adequately solved. “Low-resourced”-ness is a complex problem going beyond data availability and reflects systemic problems in society. In this paper, we focus on the task of Machine Translation (MT), that plays a crucial role for information accessibility and communication worldwide. Despite immense improvements in MT over the past decade, MT is centered around a few high-resourced languages. As MT researchers cannot solve the problem of low-resourcedness alone, we propose participatory research as a means to involve all necessary agents required in the MT development process. We demonstrate the feasibility and scalability of participatory research with a case study on MT for African languages. Its implementation leads to a collection of novel translation datasets, MT benchmarks for over 30 languages, with human evaluations for a third of them, and enables participants without formal training to make a unique scientific contribution. Benchmarks, models, data, code, and evaluation results are released under https://github.com/masakhane-io/masakhane-mt.

Projects

HausaNLP

This project aims to develop Hausa language resource for natural language processing task such as Hausa Social Media Corpus, Hausa Sentiment Lexicon , HausaNER , and POS.

Join Us?

We are always open for collaboration with motivated researchers and students with passion in our relevant research interest.