Towards a framework combining machine ethics and machine explainability

10Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

Abstract

We find ourselves surrounded by a rapidly increasing number of autonomous and semi-autonomous systems. Two grand challenges arise from this development: Machine Ethics and Machine Explainability. Machine Ethics, on the one hand, is concerned with behavioral constraints for systems, so that morally acceptable, restricted behavior results; Machine Explainability, on the other hand, enables systems to explain their actions and argue for their decisions in a way that human users can understand and justifiably trust them. In this paper, we try to motivate and work towards a framework combining Machine Ethics and Machine Explainability. Starting from a toy example, we detect various desiderata of such a framework and argue why they should and how they could be incorporated in autonomous systems. Our main idea is to apply a framework of formal argumentation theory both, for decision-making under ethically motivated constraints and for the task of generating useful explanations based on these constraints given only limited knowledge of the world. The result of our deliberations can be described as a first version of an ethically motivated, principle-governed framework combining Machine Ethics and Machine Explainability.

Cite

CITATION STYLE

APA

Baum, K., Hermanns, H., & Speith, T. (2019). Towards a framework combining machine ethics and machine explainability. In Electronic Proceedings in Theoretical Computer Science, EPTCS (Vol. 286, pp. 34–49). Open Publishing Association. https://doi.org/10.4204/EPTCS.286.4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free