GitHub - Yanmi01/mini-SabiYarn

Pretraining a foundational model after SabiYarn

This repository is built, mimicking the SabiYarn, a Large Language model pre-trained on major Nigerian languages; pidgin, Hausa, Igbo, Yoruba, and English. It uses the architecture of GPT-J and is pre-trained on a larger dataset.

However, the model in this repository uses the GPT-2 architecture and is pre-trained on Hausa, Yoruba, Igbo, and English datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
readme.md		readme.md
sabiyarn_with_huggingface_library.py		sabiyarn_with_huggingface_library.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pretraining a foundational model after SabiYarn

About

Releases

Packages

Languages

Yanmi01/mini-SabiYarn

Folders and files

Latest commit

History

Repository files navigation

Pretraining a foundational model after SabiYarn

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages