From 95e3103f88f48529ee751c143975df558252721a Mon Sep 17 00:00:00 2001 From: Ichimaru Gin <35499653+1chimaruGin@users.noreply.github.com> Date: Tue, 30 Apr 2024 14:15:14 +0700 Subject: [PATCH] Burmese Language dict and corpus (#12020) * updated bm_dict * ppocr/utils/dict/README.md added * minor fix --------- Co-authored-by: Zhang Jun --- ppocr/utils/dict/README.md | 5 ++ ppocr/utils/dict/bm_dict.txt | 160 +++++++++++++++++++++++++++++++++++ 2 files changed, 165 insertions(+) create mode 100644 ppocr/utils/dict/README.md create mode 100644 ppocr/utils/dict/bm_dict.txt diff --git a/ppocr/utils/dict/README.md b/ppocr/utils/dict/README.md new file mode 100644 index 0000000000..a552445dc8 --- /dev/null +++ b/ppocr/utils/dict/README.md @@ -0,0 +1,5 @@ +## Dictionary and Corpus + +Dictionary files (usually character level vocabulary) are included here for easier configuration. Corpus contributed by OSS contirbutors are listed here, please respect copyrights when using them at your own risk. + +- Burmese corpus: https://github.com/1chimaruGin/BurmeseCorpus diff --git a/ppocr/utils/dict/bm_dict.txt b/ppocr/utils/dict/bm_dict.txt new file mode 100644 index 0000000000..bd68de9354 --- /dev/null +++ b/ppocr/utils/dict/bm_dict.txt @@ -0,0 +1,160 @@ +က +ခ +ဂ +ဃ +င +စ +ဆ +ဇ +ဈ +ဉ +ည +ဋ +ဌ +ဍ +ဎ +ဏ +တ +ထ +ဒ +ဓ +န +ပ +ဖ +ဗ +ဘ +မ +ယ +ရ +လ +ဝ +သ +ဟ +ဠ +အ +ဢ +ဣ +ဤ +ဥ +ဦ +ဧ +ဨ +ဩ +ဪ +ါ +ာ +ိ +ီ +ု +ူ +ေ +ဲ +ဳ +ဴ +ဵ +ံ +့ +း +္ +် +ျ +ြ +ွ +ှ +ဿ +၀ +၁ +၂ +၃ +၄ +၅ +၆ +၇ +၈ +၉ +၊ +။ +၌ +၍ +၎ +၏ +ၐ +ၑ +ၒ +ၓ +ၔ +ၕ +ၖ +ၗ +ၘ +ၙ +ၚ +ၛ +ၜ +ၝ +ၞ +ၟ +ၠ +ၡ +ၢ +ၣ +ၤ +ၥ +ၦ +ၧ +ၨ +ၩ +ၪ +ၫ +ၬ +ၭ +ၮ +ၯ +ၰ +ၱ +ၲ +ၳ +ၴ +ၵ +ၶ +ၷ +ၸ +ၹ +ၺ +ၻ +ၼ +ၽ +ၾ +ၿ +ႀ +ႁ +ႂ +ႃ +ႄ +ႅ +ႆ +ႇ +ႈ +ႉ +ႊ +ႋ +ႌ +ႍ +ႎ +ႏ +႐ +႑ +႒ +႓ +႔ +႕ +႖ +႗ +႘ +႙ +ႚ +ႛ +ႜ +ႝ +႞ +႟