Using sliding-window rule application and extraction filtering techniques, we propose a framework for extracting semantic frames from Thai textual phrases with unknown boundaries based on patterns of triggering terms. A supervised rule learning algorithm is used for constructing multi-slot extraction rules from hand-tagged training phrases. A filtering module is introduced for predicting rule application across phrase boundaries based on instantiation features of rule internal wildcards. The framework is applied to text documents in three domains with different target-phrase density and average lengths. The experimental results show that the filtering module improves precision and preserves high recall satisfactorily, yielding extraction performance comparable to frame extraction with manually identified phrase boundaries. © Springer-Verlag Berlin Heidelberg 2009.
CITATION STYLE
Intarapaiboon, P., Nantajeewarawat, E., & Theeramunkong, T. (2009). Information extraction from thai text with unknown phrase boundaries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5476 LNAI, pp. 525–532). https://doi.org/10.1007/978-3-642-01307-2_49
Mendeley helps you to discover research relevant for your work.