Error-Correction for AI Safety

Aliman, Nadisha Marie; Elands, Pieter; Hürst, Wolfgang; Kester, Leon; Thórisson, Kristinn R.; Werkhoven, Peter; Yampolskiy, Roman; Ziesche, Soenke

doi:https://doi.org/10.1007/978-3-030-52152-3_2

Error-Correction for AI Safety

Files

Aliman2020_Chapter_Error_CorrectionForAISafety.pdf (775 KB)

Publication date

2020-01-01

Authors

Aliman, Nadisha Marie

Elands, Pieter

Hürst, Wolfgang

Kester, Leon

Thórisson, Kristinn R.

Werkhoven, Peter

Yampolskiy, Roman

Ziesche, Soenke

Editors

Goertzel, Ben

Potapov, Alexey

Panov, Aleksandr I.

Yampolskiy, Roman

DOI

https://doi.org/10.1007/978-3-030-52152-3_2

Document Type

Part of book

Metadata

Show full item record

Collections

Utrecht University Repository

License

taverne

Abstract

The complex socio-technological debate underlying safety-critical and ethically relevant issues pertaining to AI development and deployment extends across heterogeneous research subfields and involves in part conflicting positions. In this context, it seems expedient to generate a minimalistic joint transdisciplinary basis disambiguating the references to specific subtypes of AI properties and risks for an error-correction in the transmission of ideas. In this paper, we introduce a high-level transdisciplinary system clustering of ethical distinction between antithetical clusters of Type I and Type II systems which extends a cybersecurity-oriented AI safety taxonomy with considerations from psychology. Moreover, we review relevant Type I AI risks, reflect upon possible epistemological origins of hypothetical Type II AI from a cognitive sciences perspective and discuss the related human moral perception. Strikingly, our nuanced transdisciplinary analysis yields the figurative formulation of the so-called AI safety paradox identifying AI control and value alignment as conjugate requirements in AI safety. Against this backdrop, we craft versatile multidisciplinary recommendations with ethical dimensions tailored to Type II AI safety. Overall, we suggest proactive and importantly corrective instead of prohibitive methods as common basis for both Type I and Type II AI safety.

Keywords

AI ethics, AI safety paradox, Error-correction, Taverne, Theoretical Computer Science, General Computer Science

Citation

Aliman, N M, Elands, P, Hürst, W, Kester, L, Thórisson, K R, Werkhoven, P, Yampolskiy, R & Ziesche, S 2020, Error-Correction for AI Safety. in B Goertzel, A Potapov, A I Panov & R Yampolskiy (eds), Artificial General Intelligence : 13th International Conference, AGI 2020, St. Petersburg, Russia, September 16–19, 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12177 LNAI, Springer, pp. 12-22, 13th International Conference on Artificial General Intelligence, AGI 2020, St. Petersburg, Russian Federation, 16/09/20. https://doi.org/10.1007/978-3-030-52152-3_2, conference

URI

https://dspace.library.uu.nl/handle/1874/414626

Error-Correction for AI Safety

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI