Epistemic defenses against scientific and empirical adversarial AI attacks
Files
Publication date
2021
Editors
Advisors
Supervisors
DOI
Document Type
/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/conferencearticle
Metadata
Show full item recordCollections
License
cc_by
Abstract
In this paper, we introduce “scientific and empirical adversarial AI attacks” (SEA AI attacks) as umbrella term for not yet prevalent but technically feasible deliberate malicious acts of specifically crafting AI-generated samples to achieve an epistemic distortion in (applied) science or engineering contexts. In view of possible socio-psycho-technological impacts, it seems responsible to ponder countermeasures from the onset on and not in hindsight. In this vein, we consider two illustrative use cases: the example of AI-produced data to mislead security engineering practices and the conceivable prospect of AI-generated contents to manipulate scientific writing processes. Firstly, we contextualize the epistemic challenges that such future SEA AI attacks could pose to society in the light of broader i.a. AI safety, AI ethics and cybersecurity-relevant efforts. Secondly, we set forth a corresponding supportive generic epistemic defense approach. Thirdly, we effect a threat modelling for the two use cases and propose tailor-made defenses based on the foregoing generic deliberations. Strikingly, our transdisciplinary analysis suggests that employing distinct explanation-anchored, trust-disentangled and adversarial strategies is one possible principled complementary epistemic defense against SEA AI attacks - albeit with caveats yielding incentives for future work.
Keywords
General Computer Science
Citation
Aliman, N M & Kester, L 2021, 'Epistemic defenses against scientific and empirical adversarial AI attacks', CEUR Workshop Proceedings, vol. 2916. < http://ceur-ws.org/Vol-2916/ >