A Semi-automated Evaluation Metric for Dialogue Model Coherence

Sudeep Gandhe; David Traum

Book Chapter

A Semi-automated Evaluation Metric for Dialogue Model Coherence

Gandhe S
Traum D

DOI: 10.1007/978-3-319-21834-2_19

N/ACitations

12Readers

Get full text

Abstract

We propose a new metric, Voted Appropriateness, which can be used to automatically evaluate dialogue policy decisions, once some wizard data has been collected. We show that this metric outperforms a previously proposed metric Weak agreement. We also present a taxonomy for dialogue model evaluation schemas, and orient our new metric within this taxonomy.

Cite

CITATION STYLE

APA

Gandhe, S., & Traum, D. (2016). A Semi-automated Evaluation Metric for Dialogue Model Coherence (pp. 217–225). https://doi.org/10.1007/978-3-319-21834-2_19

A Semi-automated Evaluation Metric for Dialogue Model Coherence

Abstract

Cite

Register to see more suggestions