NFDI4DS | UHH-SEMS - Publication Details

Matija Franklin

ORCID: 0000-0003-1846-8907

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5030262115

Research Areas

Ethics and Social Impacts of AI
Explainable Artificial Intelligence (XAI)
Psychology of Moral and Emotional Judgment
Artificial Intelligence in Healthcare and Education
Autonomous Vehicle Technology and Safety
Law, AI, and Intellectual Property
Decision-Making and Behavioral Economics
Dementia and Cognitive Impairment Research
AI-based Problem Solving and Planning
Multi-Agent Systems and Negotiation
Human-Automation Interaction and Safety
Traffic and Road Safety
Blockchain Technology Applications and Security
Behavioral Health and Interventions
Neuroethics, Human Enhancement, Biomedical Innovations
Reinforcement Learning in Robotics
Free Will and Agency
Death Anxiety and Social Exclusion
Context-Aware Activity Recognition Systems
Climate Change Communication and Perception
Bayesian Modeling and Causal Inference
Social Robot Interaction and HRI
Technology Use by Older Adults
Legal and Policy Issues
Adversarial Robustness in Machine Learning

University College London
2020-2024

Cambridge Cognition (United Kingdom)
2023

Public Health England
2022

The Ethics of Advanced AI Assistants

OPENALEX - Publications

Iason Gabriel Arianna Manzini Geoff Keeling Lisa Anne Hendricks Verena Rieser and 52 more

This paper focuses on the opportunities and ethical societal risks posed by advanced AI assistants. We define assistants as artificial agents with natural language interfaces, whose function is to plan execute sequences of actions behalf a user, across one or more domains, in line user's expectations. The starts considering technology itself, providing an overview assistants, their technical foundations potential range applications. It then explores questions around value alignment,...

10.48550/arxiv.2404.16244 preprint EN arXiv (Cornell University) 2024-04-24

Multi-Agent Risks from Advanced AI

OPENALEX - Publications

Lewis Hammond Alan Chan Jesse Clifton Jason Hoelscher-Obermaier Akbir Khan and 39 more

The rapid development of advanced AI agents and the imminent deployment many instances these will give rise to multi-agent systems unprecedented complexity. These pose novel under-explored risks. In this report, we provide a structured taxonomy risks by identifying three key failure modes (miscoordination, conflict, collusion) based on agents' incentives, as well seven risk factors (information asymmetries, network effects, selection pressures, destabilising dynamics, commitment problems,...

10.48550/arxiv.2502.14143 preprint EN arXiv (Cornell University) 2025-02-19

Blaming automated vehicles in difficult situations

OPENALEX - Publications

Matija Franklin Edmond Awad David A. Lagnado

Automated vehicles (AVs) have made huge strides toward large-scale deployment. Despite this progress, AVs continue to make mistakes, some resulting in death. Although mistakes are avoidable, others hard avoid even by highly skilled drivers. As these shape attitudes AVs, we need understand whether people differentiate between them. We ask the following two questions. When an AV makes a mistake, does perceived difficulty or novelty of situation predict blame attributed it? How that attribution...

10.1016/j.isci.2021.102252 article EN cc-by iScience 2021-03-01

A Proposal for a Definition of General Purpose Artificial Intelligence Systems

OPENALEX - Publications

Carlos Ignacio Gutierrez Anthony Aguirre Risto Uuk Claire Boine Matija Franklin

Abstract The European Union (EU) is in the middle of comprehensively regulating artificial intelligence (AI) through an effort known as AI Act. Within vast spectrum issues under Act’s aegis, treatment technologies classified general purpose systems (GPAIS) merits special consideration. Particularly, existing proposals to define GPAIS do not provide sufficient guidance distinguish these from those designed perform specific tasks, denominated fixed-purpose. Thus, our working paper has three...

10.1007/s44206-023-00068-w article EN cc-by Deleted Journal 2023-09-12

Missing Mechanisms of Manipulation in the EU AI Act

OPENALEX - Publications

Matija Franklin Hal Ashton Rebecca Gorman Stuart Armstrong

The European Union Artificial Intelligence (AI) Act proposes to ban AI systems that ”manipulate persons through subliminal techniques or exploit the fragility of vulnerable individuals, and could potentially harm manipulated individual third person”. This article takes perspective cognitive psychology analyze understand what algorithmic manipulation consists of, who individuals may be, is considered as harm. Subliminal are expanded with concepts from behavioral science study preference...

10.32473/flairs.v35i.130723 article EN cc-by-nc Proceedings of the ... International Florida Artificial Intelligence Research Society Conference 2022-05-04

Causal Framework of Artificial Autonomous Agent Responsibility

OPENALEX - Publications

Matija Franklin Hal Ashton Edmond Awad David A. Lagnado

Recent empirical work on people's attributions of responsibility toward artificial autonomous agents (such as Artificial Intelligence or robots) has delivered mixed findings. The conflicting results reflect differences in context, the roles AI and human agents, domain application. In this article, we outline a causal framework attribution which integrates these It outlines nine factors that influence - causality, role, knowledge, objective foreseeability, capability, intent, desire,...

10.1145/3514094.3534140 article EN 2022-07-26

AI Governance through Markets

OPENALEX - Publications

Philip Moreira Tomei Rakesh Jain Matija Franklin

This paper argues that market governance mechanisms should be considered a key approach in the of artificial intelligence (AI), alongside traditional regulatory frameworks. While current approaches have predominantly focused on regulation, we contend market-based offer effective incentives for responsible AI development. We examine four emerging vectors governance: insurance, auditing, procurement, and due diligence, demonstrating how these can affirm relationship between risk financial...

10.48550/arxiv.2501.17755 preprint EN arXiv (Cornell University) 2025-01-29

Model-Free RL Agents Demonstrate System 1-Like Intentionality

OPENALEX - Publications

Hal Ashton Matija Franklin

This paper argues that model-free reinforcement learning (RL) agents, while lacking explicit planning mechanisms, exhibit behaviours can be analogised to System 1 ("thinking fast") processes in human cognition. Unlike model-based RL which operate akin 2 slow") reasoning by leveraging internal representations for planning, agents react environmental stimuli without anticipatory modelling. We propose a novel framework linking the dichotomy of and distinction between RL. framing challenges...

10.48550/arxiv.2501.18299 preprint EN arXiv (Cornell University) 2025-01-30

Defense Against the Dark Prompts: Mitigating Best-of-N Jailbreaking with Prompt Evaluation

OPENALEX - Publications

Stuart Armstrong Matija Franklin Connor Stevens Rebecca Gorman

Recent work showed Best-of-N (BoN) jailbreaking using repeated use of random augmentations (such as capitalization, punctuation, etc) is effective against all major large language models (LLMs). We have found that $100\%$ the BoN paper's successful jailbreaks (confidence interval $[99.65\%, 100.00\%]$) and $99.8\%$ in our replication $[99.28\%, 99.98\%]$) were blocked with Defense Against The Dark Prompts (DATDP) method. DATDP algorithm works by repeatedly utilizing an evaluation LLM to...

10.48550/arxiv.2502.00580 preprint EN arXiv (Cornell University) 2025-02-01

Beyond Preferences in AI Alignment

OPENALEX - Publications

Tan Zhi‐Xuan Micah Carroll Matija Franklin Hal Ashton

Abstract The dominant practice of AI alignment assumes (1) that preferences are an adequate representation human values, (2) rationality can be understood in terms maximizing the satisfaction preferences, and (3) systems should aligned with one or more humans to ensure they behave safely accordance our values. Whether implicitly followed explicitly endorsed, these commitments constitute what we term a preferentist approach alignment. In this paper, characterize challenge approach, describing...

10.1007/s11098-024-02249-w article EN cc-by Philosophical Studies 2024-11-09

A Proposal for a Definition of General Purpose Artificial Intelligence Systems

OPENALEX - Publications

Carlos Ignacio Gutierrez Anthony Aguirre Risto Uuk Claire Boine Matija Franklin

The European Union (EU) is in the middle of comprehensively regulating artificial intelligence (AI) through an effort known as AI Act. Within vast spectrum issues under Act's aegis, treatment technologies classified general purpose systems (GPAIS) merits special consideration. Particularly, existing proposals to define GPAIS do not provide sufficient guidance distinguish these from those designed perform specific tasks, denominated fixed-purpose. Thus, our working paper has three objectives....

10.2139/ssrn.4238951 article EN SSRN Electronic Journal 2022-01-01

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI

OPENALEX - Publications

Seliem El-Sayed Canfer Akbulut Amanda McCroskery Geoff Keeling Zachary Kenton and 15 more

Recent generative AI systems have demonstrated more advanced persuasive capabilities and are increasingly permeating areas of life where they can influence decision-making. Generative presents a new risk profile persuasion due the opportunity for reciprocal exchange prolonged interactions. This has led to growing concerns about harms from how be mitigated, highlighting need systematic study persuasion. The current definitions unclear related insufficiently studied. Existing harm mitigation...

10.48550/arxiv.2404.15058 preprint EN arXiv (Cornell University) 2024-04-23

Are autonomous vehicles blamed differently?

OPENALEX - Publications

Darko Stojilović Matija Franklin Bertram F Malle Carlos Fernandez‐Basso Edmond Awad and 1 more

This study investigates how people assign blame to autonomous vehicles (AVs) when involved in an accident. Our experiment (N = 2647) revealed that placed more on AVs than human drivers accident details were unspecified. To examine whether assess major classes of blame-relevant information differently for and humans, we developed a causal model introduced novel concept prevention effort, which emerged as crucial factor or judgement alongside intentionality. Finally, addressed the “many hands”...

10.31234/osf.io/ncrsh preprint EN 2024-06-05

Boosting human competences with interpretable and explainable artificial intelligence.

OPENALEX - Publications

Stefan M. Herzog Matija Franklin

10.1037/dec0000250 article EN Decision 2024-10-01

Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI

OPENALEX - Publications

Matija Franklin Hal Ashton Rebecca Gorman Stuart Armstrong

As artificial intelligence becomes more powerful and a ubiquitous presence in daily life, it is imperative to understand manage the impact of AI systems on our lives decisions. Modern ML often change user behavior (e.g. personalized recommender learn preferences deliver recommendations that online behavior). An externality preference change. This article argues for establishment multidisciplinary endeavor focused understanding how preference: Preference Science. We operationalize incorporate...

10.48550/arxiv.2203.10525 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Designing Memory Aids for Dementia Patients using Earables

OPENALEX - Publications

Matija Franklin David A. Lagnado Chulhong Min Akhil Mathur Fahim Kawsar

Globally around 50 million people are currently living with dementia, and there nearly 10 new cases every year. The decline of memory and, it, lack self-confidence continuous confusion have a devastating effect on this disease. Dementia patients even struggle to accomplish mundane chores require assistance for daily social connectedness. Over the past decade, we seen remarkable growth in wearable technologies manage our health wellbeing improve awareness However, ask why wearables not...

10.1145/3460418.3479324 article EN 2021-09-21

Blaming Automated Vehicles in Difficult Situations

OPENALEX - Publications

Matija Franklin Edmond Awad David A. Lagnado

The third driverless car competition of the DARPA Grand Challenge (Urban Challenge) in 2007 saw six autonomous vehicle teams finishing event successfully. Since then, Automated Vehicles (AVs) made huge strides towards deployment on a large scale. Despite all this progress, AVs continue to make mistakes, some which have resulted deaths passengers and pedestrians. These crashes received wide coverage media drew parallel bleak picture public’s lack enthusiasm for technology. However, not...

10.2139/ssrn.3701256 article EN SSRN Electronic Journal 2020-01-01

Using text and charts to provide social norm feedback to general practices with high overall and high broad-spectrum antibiotic prescribing: a series of national randomised controlled trials

OPENALEX - Publications

Natalie Gold Anna Sallis Ayoub Saei Rohan Arambepola Robin Watson and 3 more

Abstract Background Sending a social norms feedback letter to general practitioners who are high prescribers of antibiotics has been shown reduce antibiotic prescribing. The 2017-9 Quality Premium for primary care in England sets target broad-spectrum prescribing, which should be at or below 10% total We tested norm that targeted prescribing and the addition chart text-only overall Methods conducted three 2-armed randomised controlled trials, on different groups practices: Trial A compared...

10.1186/s13063-022-06373-y article EN cc-by Trials 2022-06-18

The Corrupting Influence of AI as a Boss or Counterparty

OPENALEX - Publications

H A Ashton Matija Franklin

In a recent article K ̈obis, Bonnefon, and Rahwan (2021) propose framework to identify four different primary roles in which Artificial Intelligence (AI) cause unethical or corrupt human behaviour; namely - role model, delegate, partner, advisor. this we two further AI as boss counterparty. We argue that the exerts coercive power over its employees whilst perceptual abilities of an counterparty provide opportunity for humans behave differently towards them than they would with analogues....

10.2139/ssrn.4309643 article EN SSRN Electronic Journal 2022-01-01

Strengthening the EU AI Act: Defining Key Terms on AI Manipulation

OPENALEX - Publications

Matija Franklin Philip Moreira Tomei Rebecca Gorman

The European Union's Artificial Intelligence Act aims to regulate manipulative and harmful uses of AI, but lacks precise definitions for key concepts. This paper provides technical recommendations improve the Act's conceptual clarity enforceability. We review psychological models define "personality traits," arguing should protect full "psychometric profiles." urge expanding "behavior" include "preferences" since preferences causally influence are influenced by behavior. Clear provided...

10.48550/arxiv.2308.16364 preprint EN cc-by arXiv (Cornell University) 2023-01-01

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

OPENALEX - Publications

Ross Gruetzemacher Alan Chan Kevin Frazier Christin Manning Štěpán Los and 8 more

Given rapid progress toward advanced AI and risks from frontier systems (advanced pushing the boundaries of capabilities frontier), creation implementation governance regulatory schemes deserves prioritization substantial investment. However, status quo is untenable and, frankly, dangerous. A gap has permitted labs to conduct research, development, deployment activities with minimal oversight. In response, system evaluations have been proposed as a way assessing development systems. Yet,...

10.48550/arxiv.2310.14455 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

Beyond Preferences in AI Alignment

OPENALEX - Publications

Tan Zhi‐Xuan Micah Carroll Matija Franklin Hal Ashton

The dominant practice of AI alignment assumes (1) that preferences are an adequate representation human values, (2) rationality can be understood in terms maximizing the satisfaction preferences, and (3) systems should aligned with one or more humans to ensure they behave safely accordance our values. Whether implicitly followed explicitly endorsed, these commitments constitute what we term a preferentist approach alignment. In this paper, characterize challenge approach, describing...

10.48550/arxiv.2408.16984 preprint EN arXiv (Cornell University) 2024-08-29

Coming Soon ...