TMT — Translation Management Tool
TMT is a high-performance command-line utility for translating structured documents while preserving their original formatting and metadata. It supports CSV, TSV, DOCX, and PDF formats with first-class handling of complex scripts such as Devanagari.
Translation is powered by the TMT API provided by Kathmandu University’s ILPRL lab, which supports English, Nepali, and Tamang language pairs.
Supported Formats
| Format | Extension | Notes |
|---|---|---|
| Comma-separated values | .csv | All cells translated |
| Tab-separated values | .tsv | All cells translated |
| Word document | .docx | Paragraph-level translation |
.pdf | Requires pdf feature flag; needs a font for Devanagari rendering |
Supported Languages
| Code | Language |
|---|---|
en | English |
ne | Nepali |
tmg | Tamang |
Design Philosophy
The tool follows a strict Parse → Validate → Execute lifecycle:
- CLI arguments are parsed and validated into a
RuntimeConfig. - The validated config is used to dispatch to a format-specific handler.
- Each handler extracts text, delegates translation to the
TranslationService, and reconstructs the output document.
This separation keeps format concerns isolated from network and rate-limiting logic, making each layer independently testable and extensible.
Source Repository
The source code is available at github.com/razzat008/tmt-hackathon.
Created by github/razzat008.