The paper deals with extracting contexts for keywords found in text, in particular in Automatic Speech Recognition (ASR) output. We propose using a syntactic parser to find contexts by analysing the sentence structure, rather than simply using a window of several words on the left and right of the keyword, or the whole sentence. This method provides concise but meaningful contexts that are easily readable by humans and can also be used in applications such as thematic clustering. We describe the Russian SemSin system which combines a syntactic dependency parser and elements of semantic ontology. We demonstrate the use of SemSin for our task both for normal text and for recognition output, and outline the suggestions for future developments of our method.
CITATION STYLE
Khomitsevich, O., Boyarsky, K., Kanevsky, E., Bulusheva, A., & Mendelev, V. (2017). Flexible context extraction for keywords in Russian automatic speech recognition results. In Communications in Computer and Information Science (Vol. 661, pp. 145–154). Springer Verlag. https://doi.org/10.1007/978-3-319-52920-2_14
Mendeley helps you to discover research relevant for your work.