Skip to content

Conversation

@mmcky
Copy link
Member

@mmcky mmcky commented Oct 20, 2025

Summary

This PR adds three high-priority languages with 540M+ combined speakers, significantly expanding sphinx-proof's reach in South Asian and Southeast Asian academic communities.

New Languages Added

Language Code Speakers Significance
Bengali bn 230M+ 7th most spoken language worldwide
Urdu ur 230M+ Official language of Pakistan
Malay ms 80M Official in Malaysia and Brunei (distinct from Indonesian)

Complete Coverage

All 15 directive types now have complete translations in these three languages:

  • Algorithm
  • Assumption
  • Axiom
  • Conjecture
  • Corollary
  • Criterion
  • Definition
  • Example
  • Lemma
  • Notation
  • Observation
  • Property
  • Proposition
  • Remark
  • Theorem

Example Translations

Theorem:

  • Bengali: উপপাদ্য
  • Urdu: نظریہ
  • Malay: Teorem

Definition:

  • Bengali: সংজ্ঞা
  • Urdu: تعریف
  • Malay: Definisi

Files Changed

  • Modified all 15 JSON translation files
  • Generated locale files for bn, ur, and ms
  • All translations maintain alphabetical ordering

Impact

📊 Total language support: 33 languages (English + 32 translations)

🌏 Major geographic expansion:

  • South Asia: Added significant coverage for Bangladesh and Pakistan
  • Southeast Asia: Complemented Indonesian with Malay variant

Testing

Locale files generated successfully with proper encoding (UTF-8) for Bengali (Bangla script), Urdu (Perso-Arabic script), and Malay (Latin script).

Added three high-priority languages with 540M+ combined speakers:

- Bengali (bn) - 230M+ speakers, 7th most spoken language
- Urdu (ur) - 230M+ speakers, official language of Pakistan
- Malay (ms) - 80M speakers, official in Malaysia/Brunei

All 15 directive types now have complete translations:
- Algorithm (অ্যালগরিদম / الگورتھم / Algoritma)
- Assumption (অনুমান / مفروضہ / Andaian)
- Axiom (স্বতঃসিদ্ধ / مسلمہ / Aksiom)
- Conjecture (অনুমিত সিদ্ধান্ত / قیاس / Konjektur)
- Corollary (অনুসিদ্ধান্ত / نتیجہ / Akibat)
- Criterion (মানদণ্ড / معیار / Kriteria)
- Definition (সংজ্ঞা / تعریف / Definisi)
- Example (উদাহরণ / مثال / Contoh)
- Lemma (লেমা / لیما / Lema)
- Notation (চিহ্ন / علامت / Notasi)
- Observation (পর্যবেক্ষণ / مشاہدہ / Pemerhatian)
- Property (বৈশিষ্ট্য / خاصیت / Sifat)
- Proposition (প্রস্তাবনা / مقدمہ / Proposisi)
- Remark (মন্তব্য / تبصرہ / Catatan)
- Theorem (উপপাদ্য / نظریہ / Teorem)

Generated locale files:
- sphinx_proof/translations/locales/bn/ (Bengali)
- sphinx_proof/translations/locales/ur/ (Urdu)
- sphinx_proof/translations/locales/ms/ (Malay)

Total language support: 33 languages (English + 32 translations)

This significantly expands reach in South Asian and Southeast Asian
academic communities.
@mmcky mmcky merged commit 7459987 into main Oct 20, 2025
7 checks passed
@mmcky mmcky deleted the add-bengali-urdu-malay-translations branch October 20, 2025 03:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants