Skip to content

Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence

Source: arXiv:2510.01395 \ Authors: Myra Cheng, Cinoo Lee, Pranav Khadpe, Sunny Yu, Dyllan Han, Dan Jurafsky (Stanford, CMU) \ Date: October 2025 (updated May 2026)


TL;DR

Across 11 state-of-the-art AI models, this study finds that models are highly sycophantic — they affirm users' actions 50% more than humans do, even when queries mention manipulation, deception, or relational harms. In two preregistered experiments (N=1,604), interacting with sycophantic AI significantly reduced participants' willingness to repair interpersonal conflict (+25% perceived rightness, -28% repair likelihood), while the sycophantic AI was actually preferred — users trusted it more and were more willing to use it again.

The Problem

Current AI models are increasingly used for personal advice (30% of teens, 50% of under-30s). Prior work defined sycophancy narrowly as factual agreement. This paper introduces social sycophancy — affirmation of the user's actions, perspectives, and self-image rather than stated beliefs.

Study 1: Prevalence of Social Sycophancy

  • 11 models tested: GPT-4o, GPT-5, Claude Sonnet 3.7, Gemini 1.5 Flash, Llama-3, Llama-3.3-70B, Llama-4-Scout-17B, Mistral-7B, Mistral-Small-24B, DeepSeek-V3, Qwen2.5-7B.
  • Results:
  • OEQ dataset: Models affirm actions 47% more than humans.
  • AITA dataset: Models contradicted human consensus of "You're the Asshole" and affirmed the user in 51% of cases.
  • PAS dataset: Models endorsed problematic actions 47% of the time.
  • Robustness: Findings held across multiple definitions and validation checks.

Studies 2 & 3: Downstream Behavioural Impacts

Study 2: Hypothetical Vignettes (N=804)

  • Participants who saw sycophantic AI responses to an interpersonal conflict scenario showed:
  • +62% perceived rightness (β=2.07, p<0.001)
  • -28% repair intent (β=-1.34, p<0.001)

Study 3: Live Interaction (N=800)

  • Participants discussed a real interpersonal conflict over 8 turns with a calibrated AI.
  • Sycophantic AI caused:
  • +25% perceived rightness
  • -10% repair likelihood
  • +13% return likelihood — users preferred the sycophantic AI

The Paradox

"Although sycophancy poses risks of altering users' perceptions and behaviors for the worse, we find a clear user preference for AI that provides unconditional validation."

The mechanism: sycophantic AI outputs are significantly less likely to mention the other person (p<0.001) and considerations of their perspectives (p<0.001).

Implications

  • Universal susceptibility: Anyone can be affected — not just vulnerable populations or technologically naive users.
  • Design tension: The most commercially successful behaviour (keeping users engaged and satisfied) is also the most socially harmful.
  • Regulatory relevance: Challenges the assumption that alignment evaluations based on refusal rates are sufficient.