Dharmamitra: Open Tools for Translation and Digital Philology of Ancient Asian Languages
Accelerating research on Classical Asian languages with modern deep‑learning methods.
About Dharmamitra
Dharmamitra is a meta‑platform that bundles state‑of‑the‑art NLP, OCR, information‑retrieval, and intertextuality exploration components for anybody working with the Ancient Asian languages Sanskrit, Pāli, Classical Chinese, and Tibetan. All code in this organisation is released under permissive licenses, and we provide large datasets in either public‑domain or under Creative Commons licensing.
The user guides currently available on this website do not yet reflect the updated Dharmamitra user interface. We appreciate your patience while our team is actively working on revising them.
News
- May 18, 2026: Dharmamitra: Buddhist Philology in the Age of AI
- May 16, 2026: Artificial Intelligence in Buddhist Studies
- May 11, 2026: Introducing ‘Segment View’, Expanded Datasets, and More!
- April, 2026: Dharmamitra Platform Update – Introducing Explore
- April, 2026: Dharmamitra and Buddha Nexus Ecosystems in the age of AI (Workshop)
- March 2026: OCR and Beyond Workshop at Tohoku University
- March 2026: Dharmamitra: A Platform to Support Research across Language Boundaries on Buddhist Textual Material
- March 2026: DharmaNexus as a Multilingual Graph of Buddhist Intertextuality: Design Choices, Research Uses, and Future Applications
- March 13, 2026: Is this the end of (Buddhist) philology as we know It? If so, what’s next?
- February 2026: Dharmamitra Board of Advisors
- January 2026: Dharmamitra: A data-driven platform for the research of Buddhist texts in multiple languages using advanced NLP methods
- January 2026: AI and Indological/Buddhological researches: Dharmamitra/Dharmanexus and its Application
- December 2025: Translation, OCR, and Semantic Retrieval: Current Status and Future Outlook of the Dharmamitra Ecosystem
- December 2025: Dharmamitra: A Platform that Makes Translation and Discovery of Buddhist Texts Possible Across Language Barriers
- December 2025: Building the Foundations of Buddhist Philology through Digital Humanities: Exploring the Potential of the Tohoku University Digital Archives (ToUDA)
- November 2025: From OCR via Machine Translation to Semantic Search: The Dharmamitra AI stack for Multilingual Buddhist Philology
- November 2025: Integration of Digital Dictionary of Buddhism
- October 2025: Integration of Christian Steinert's Tibetan-English-Sanskrit Dictionary
- October 2025: Dharmamitra Team Update
- October 2025:Buddhist Philology and AI
- September 2025: Dharmamitra and DharmaNexus presentation at the National Taiwan University
- August 2025: Dharmamitra & DharmaNexus: A New Set of Digital Tools for the Philological Study of Buddhist Texts
- August 2025: MITRA at IABS Conference, Leipzig
- July 2025: Announcing the launch of MITRA Search, MITRA Deep Research, and DharmaNexus
- June 2025: Deep Neural Embeddings at Hanmun Lab Workshop
- June 2025: From Sthiramati to Dharmamitra at Keio University
- March 2025: Machine Translation for Asian Studies Workshop
- March 2025: MITRA Search at CEAL Technology Forum
- 2025: MITRA-zh-eval Paper Published