We present the new THOT toolkit for fully-automatic and interactive statistical machine translation (SMT). Initial public versions of THOT date back to 2005 and did only include estimation of phrase-based models. By contrast, the new version offers several new features that had not been previously incorporated. The key innovations provided by the toolkit are computer-aided translation, including post-editing and interactive SMT, incremental learning and robust generation of alignments at phrase level. In addition to this, the toolkit also provides standard SMT features such as fully-automatic translation, scalable and parallel algorithms for model training, client-server implementation of the translation functionality, etc. The toolkit can be compiled in Unix-like and Windows platforms and it is released under the GNU Lesser General Public License (LGPL).
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Ortiz-Martínez, D., & Casacuberta, F. (2014). The New THOT Toolkit for Fully-Automatic and Interactive Statistical Machine Translation. In EACL 2014 - Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (pp. 45–48). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/e14-2012