We describe a representation scheme and an analysis engine using that scheme, both of which have been used to develop infrastructure for HLT. The Shakti Standard Format is a readable and robust representation scheme for analysis frameworks and other purposes. The representation is highly extensible. This representation scheme, based on the blackboard architectural model, allows a very wide variety of linguistic and non-linguistic information to be stored in one place and operated upon by any number of processing modules. We show how it has been successfully used for building machine translation systems for several language pairs using the same architecture. It has also been used for creation of language resources such as treebanks and for different kinds of annotation interfaces. There is even a query language designed for this representation. Easily wrappable into XML, it can be used equally well for distributed computing.
CITATION STYLE
Bharati, A., Sangal, R., Sharma, D., & Singh, A. K. (2014). SSF: A Common Representation Scheme for Language Analysis for Language Technology Infrastructure Development. In Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT, OIAF4HLT 2014 - Held at the 25th International Conference on Computational Linguistics, COLING 2014 (pp. 66–76). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-5208
Mendeley helps you to discover research relevant for your work.