AI Customer Service Failures Start at Handoff, Not the Model
Adobe's CX director explains why escalation design—not chatbot accuracy—determines whether AI deployments build or destroy customer trust.

The Hidden Risk in AI Customer Service
Enterprises racing to cut support costs with AI chatbots face a problem that has little to do with the technology itself. When customer service AI fails, the damage compounds not from the initial error but from what happens next—and most organizations have designed that moment badly.
Robert Rose, Senior Director of Customer Experience at Adobe, outlined a framework for AI deployment in customer service that inverts the usual priorities. Speaking on Emerj's AI in Business Podcast, Rose argued that model performance matters far less than three organizational capabilities most enterprises have not yet built: understanding where customer trust permits automation, grounding generative systems in deterministic logic, and designing escalation workflows that recover gracefully when AI reaches its limits.
The stakes are documented. Stanford's RegLab found general-purpose AI chatbots hallucinate on legal queries between 58% and 82% of the time. The Consumer Financial Protection Bureau warned that chatbot failures in financial services carry active legal liability when they cause customers to select wrong products or lose access to dispute resolution. Research by COPC Inc. across six countries found that when AI fails to resolve an issue, a brand's Net Promoter Score can drop by as much as 70 points—and identified the handover from AI to human agent as the most consistent failure point, not the model itself.
Why it matters
Most enterprises treat AI customer service as a technology deployment when it is actually a trust management problem. The risk is not whether the model will fail—it will—but whether the organization has built the workflow, oversight, and escalation design to contain that failure before it becomes a brand event. Legal liability, customer churn, and regulatory scrutiny all hinge on operational readiness, not model accuracy.
Trust Thresholds Determine Deployment Sequence
Customer willingness to accept AI scales inversely with interaction risk. Low-stakes transactions—password resets, order tracking—can be automated now. High-stakes interactions involving billing disputes, financial decisions, or healthcare concerns require human oversight until trust is earned. Rose emphasized that this curve moves on the customer's schedule, not the enterprise's.
Embedded in this is a strategic choice most companies are making quietly: whether to disclose that a customer is interacting with AI. Rose framed this as a brand decision with consequences in both directions. Disclosure sets expectations and protects against backlash when the system fails. Non-disclosure raises the stakes because customers feel misled, not just underserved.
Generative AI Requires a Deterministic Foundation
Rose drew a distinction most AI roadmaps miss: predictive and generative AI are sequential, not interchangeable. Predictive systems follow programmed rules. Generative systems produce responses based on available data, which introduces risk. Without a reliable deterministic layer beneath them, generative outputs lack grounding.
The near-term value case for generative AI in customer service is response personalization—adjusting tone and content based on customer profile, recent interactions, and detected sentiment. Rose described this capability as available today but requiring human review before customer delivery. His recommended path: experiment internally, let human eyes evaluate outputs, and temper responses before deployment.
Escalation Design Measures AI Maturity
When customers repeatedly demand a human representative, the failure is rarely the model—it is the workflow. Rose identified emotion detection as an underutilized capability that allows systems to sense frustration early and escalate before the customer explicitly asks. The real test is whether the human agent receives complete context or forces the customer to start over.
Most implementations fail silently here. The technology can transfer context, but agent behavior often does not use it. That is a training problem, not a technology one, and sits entirely within enterprise control. Rose's rule: measure AI maturity not by what the model achieves when it works, but by how cleanly the system recovers when it does not.
These insights were first reported by Emerj in a conversation with Adobe's Robert Rose on the AI in Business Podcast.
This is an original analysis by the Omega editorial team. Source reporting: AI Watch.
Want systems like this working for your business?
Book a Call

