Abstract
This paper describes a novel classification method for multi-stream conversational documents. Documents of contact center dialogues or meetings are often composed of multiple source documents that are transcriptions of the recordings of each speaker's channel. To enhance the classification performance of such multi-stream conversational documents, three main advances over the previous method are introduced. The first is a parallel hierarchical attention network (PHAN) for multi-stream conversational document modeling. PHAN can precisely capture word and sentence structures of individual source documents and efficiently integrate them. The second is a shared memory reader that can yield a shared attention mechanism. The shared memory reader highlights common important information in a conversation. Our experiments on a call category classification in contact center dialogues show that PHAN together with the shared memory reader outperforms the single document modeling method and previous multi-stream document modeling method.
Author supplied keywords
Cite
CITATION STYLE
Sawada, N., Masumura, R., & Nishizaki, H. (2017). Parallel hierarchical attention networks with shared memory reader for multi-stream conversational document classification. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2017-August, pp. 3311–3315). International Speech Communication Association. https://doi.org/10.21437/Interspeech.2017-259
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.