Abstract
Testing is a promising way to gain trust in a learned action policy €, in particular if πis a neural network. A "bug"in this context constitutes undesirable or fatal policy behavior, e.g., satisfying a failure condition. But how do we distinguish whether such behavior is due to bad policy decisions, or whether it is actually unavoidable under the given circumstances? This requires knowledge about optimal solutions, which defeats the scalability of testing. Related problems occur in software testing when the correct program output is not known. Metamorphic testing addresses this issue through metamorphic relations, specifying how a given change to the input should affect the output, thus providing an oracle for the correct output. Yet, how do we obtain such metamorphic relations for action policies? Here, we show that the well explored concept of relaxations in the Artificial Intelligence community can serve this purpose. In particular, if state s′ is a relaxation of state s, i.e., s′ is easier to solve than s, and πfails on easier s′ but does not fail on harder s, then we know that πcontains a bug manifested on s′. We contribute the first exploration of this idea in the context of failure testing of neural network policies πlearned by reinforcement learning in simulated environments. We design fuzzing strategies for test-case generation as well as metamorphic oracles leveraging simple, manually designed relaxations. In experiments on three single-Agent games, our technology is able to effectively identify true bugs, i.e., avoidable failures of €, which has not been possible until now.
Author supplied keywords
Cite
CITATION STYLE
Eniser, H. F., Gros, T. P., Wüstholz, V., Hoffmann, J., & Christakis, M. (2022). Metamorphic relations via relaxations: An approach to obtain oracles for action-policy testing. In ISSTA 2022 - Proceedings of the 31st ACM SIGSOFT International Symposium on Software Testing and Analysis (pp. 52–63). Association for Computing Machinery, Inc. https://doi.org/10.1145/3533767.3534392
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.