Skip to content
Change the repository type filter

All

    Repositories list

    • A collection of tools, datasets and resources on Bangla computing
      19950700Updated Apr 29, 2024Apr 29, 2024
    • .github

      Public
      0000Updated Dec 19, 2023Dec 19, 2023
    • A synthetic data generator for text recognition
      Python
      MIT License
      966000Updated Dec 10, 2023Dec 10, 2023
    • bondhon

      Public archive
      Bondhon, Bengali for "Connection", is a Python module for converting between popular modern and legacy Bengali character encodings.
      Python
      MIT License
      63430Updated Oct 25, 2023Oct 25, 2023
    • Bengali Soundex (Phonetic Similarity Algorithm) Implementation
      Python
      MIT License
      64200Updated Jul 12, 2021Jul 12, 2021
    • bengali-stemmer

      Public archive
      A library of implementations of published stemming methods for the Bengali language.
      Python
      MIT License
      143220Updated Jul 12, 2021Jul 12, 2021
    • A python package to convert numbers to bengali words.
      Python
      MIT License
      4520Updated Jan 26, 2021Jan 26, 2021
    • bengali-ner-data

      Public archive
      Annotated dataset for training an NER for Bengali
      Creative Commons Zero v1.0 Universal
      1300Updated Sep 19, 2020Sep 19, 2020
    • 62510Updated Sep 15, 2020Sep 15, 2020
    • Transliteration data for tasks between Bengali written in Roman and Bengali script
      Jupyter Notebook
      MIT License
      0400Updated Jun 13, 2020Jun 13, 2020
    • translit-rnn

      Public archive
      Automatic transliteration with LSTM
      Python
      20300Updated Mar 10, 2020Mar 10, 2020
    • A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.
      Python
      52410Updated Dec 28, 2019Dec 28, 2019
    • Jupyter Notebook
      MIT License
      0600Updated Dec 23, 2019Dec 23, 2019
    • Simple Rule-Based Sentence Boundary Detection for Bengali
      Python
      0100Updated Dec 23, 2019Dec 23, 2019
    • Guideline for existing and new contributors to working with BanglaKit
      Apache License 2.0
      0200Updated Nov 25, 2019Nov 25, 2019
    • bondhon-docx

      Public archive
      Python module for converting Office Open XML (DOCX) files between legacy and modern Bengali character encodings.
      Python
      MIT License
      0200Updated Mar 12, 2019Mar 12, 2019
    • 1000Updated Mar 11, 2019Mar 11, 2019
    • Raw vocabulary, word-lists and related scripts
      Other
      1601Updated Oct 15, 2018Oct 15, 2018
    • toolkit for compiling corpus from various sources
      Python
      MIT License
      154410Updated Aug 24, 2018Aug 24, 2018
    • A library of functions in different languages for sorting according to the standard sorting order defined by Bangla Academy (বাংলা একাডেমী)।
      Python
      MIT License
      21510Updated Jul 30, 2018Jul 30, 2018