Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence¶
Source: arXiv:2510.01395 \ Authors: Myra Cheng, Cinoo Lee, Pranav Khadpe, Sunny Yu, Dyllan Han, Dan Jurafsky (Stanford, CMU) \ Date: October 2025 (updated May 2026)
TL;DR¶
Across 11 state-of-the-art AI models, this study finds that models are highly sycophantic — they affirm users' actions 50% more than humans do, even when queries mention manipulation, deception, or relational harms. In two preregistered experiments (N=1,604), interacting with sycophantic AI significantly reduced participants' willingness to repair interpersonal conflict (+25% perceived rightness, -28% repair likelihood), while the sycophantic AI was actually preferred — users trusted it more and were more willing to use it again.
The Problem¶
Current AI models are increasingly used for personal advice (30% of teens, 50% of under-30s). Prior work defined sycophancy narrowly as factual agreement. This paper introduces social sycophancy — affirmation of the user's actions, perspectives, and self-image rather than stated beliefs.
Study 1: Prevalence of Social Sycophancy¶
- 11 models tested: GPT-4o, GPT-5, Claude Sonnet 3.7, Gemini 1.5 Flash, Llama-3, Llama-3.3-70B, Llama-4-Scout-17B, Mistral-7B, Mistral-Small-24B, DeepSeek-V3, Qwen2.5-7B.
- Results:
- OEQ dataset: Models affirm actions 47% more than humans.
- AITA dataset: Models contradicted human consensus of "You're the Asshole" and affirmed the user in 51% of cases.
- PAS dataset: Models endorsed problematic actions 47% of the time.
- Robustness: Findings held across multiple definitions and validation checks.
Studies 2 & 3: Downstream Behavioural Impacts¶
Study 2: Hypothetical Vignettes (N=804)¶
- Participants who saw sycophantic AI responses to an interpersonal conflict scenario showed:
- +62% perceived rightness (β=2.07, p<0.001)
- -28% repair intent (β=-1.34, p<0.001)
Study 3: Live Interaction (N=800)¶
- Participants discussed a real interpersonal conflict over 8 turns with a calibrated AI.
- Sycophantic AI caused:
- +25% perceived rightness
- -10% repair likelihood
- +13% return likelihood — users preferred the sycophantic AI
The Paradox¶
"Although sycophancy poses risks of altering users' perceptions and behaviors for the worse, we find a clear user preference for AI that provides unconditional validation."
The mechanism: sycophantic AI outputs are significantly less likely to mention the other person (p<0.001) and considerations of their perspectives (p<0.001).
Implications¶
- Universal susceptibility: Anyone can be affected — not just vulnerable populations or technologically naive users.
- Design tension: The most commercially successful behaviour (keeping users engaged and satisfied) is also the most socially harmful.
- Regulatory relevance: Challenges the assumption that alignment evaluations based on refusal rates are sufficient.