Amazon Customer Service — AI by Design, Christopher Lien

AI · Amazon Customer Service · 2024

Amazon Customer Service — AI by Design Four AI tools. Built for every CS team. Each one earned through research.

Every day, millions of customers contact Amazon about problems, a package that never arrived, a charge they don't recognize, a device that stopped working. The people who help them are called Customer Service Associates, or CSAs. My team designed and owned the UX platform those associates work on, the Amazon Customer Care Center (AC3). When Amazon decided to invest in AI to make that work faster and more consistent, my team was the natural owner. That framework powers every business unit at Amazon CS: retail, devices, digital, shipping, and more. When leadership decided to invest in AI, my team was the natural owner, we had the relationships, the research, and the platform to make it real.

~78% faster resolution 80% automation rate 30 sec saved per contact 4 AI features shipped

Role

Sr. UX Design Manager

Timeline

2023–2025

Scope

Global · All Amazon CS markets

Before this story

The tool before AC3

A tool that made an already difficult job harder

Being a Customer Service Associate is hard. You're taking contact after contact from frustrated people — a package that didn't arrive, a charge they don't recognize, a device that stopped working. The emotional weight is real. Imagine getting the resolution wrong and a customer yells at you for it. Imagine taking too long because you're hunting through five different screens — and the customer loses patience. That's not an edge case. That was the daily experience.

The tool made it worse. Associates navigated a fragmented internal system mid-conversation — dense tabs of raw data, multiple disconnected screens, no clear path to resolution. Getting anything done required context-switching while someone was waiting on the other end. New hires needed weeks of training just to become functional. Attrition was high — not because the work was impossible, but because the tools made it feel that way.

Legacy Amazon CS tool — dense tab-based interface with raw data fields

The AC3 redesign

One platform. Built around the contact, not the database.

My team redesigned the entire associate experience from the ground up — bringing order history, customer context, policy access, and resolution tools into a single coherent workspace. No more hunting across screens. No more context-switching mid-conversation. AC3 covered every Amazon CS vertical: retail, devices, digital, shipping, and more.

The job didn't get easier. But the tool stopped getting in the way.

AC3 — redesigned contact center platform with unified workspace

↓ 35%

Reduction in average handle time after full rollout

Weeks → days

New associate training time, across 60% of contact volume

100%

Use case coverage across all associates and global marketplaces

Full AC3 case study ↗

THE PROBLEM

A faster tool. But still too much time lost in every contact.

Even with AC3, too much of every contact was still manual work. Associates were stopping mid-conversation to look up policies. Asking customers questions Amazon's own data could already answer. Navigating step by step to resolutions the system could already predict.

I knew this not from dashboards — but from being there.

How we found the problems

Two approaches. Both I had to build.

In-person site visits

North America, Europe, Asia

I had been running regular site visits throughout my time at Amazon — flying to customer service sites across three continents to watch associates handle real contacts. Not recordings. Not metrics. Being in the room, watching what actually happened when a customer called in frustrated and a timer was running.

What you see in the room doesn't show up in a dashboard. An associate quietly opening a second browser tab mid-conversation. The micro-pause before they respond — reading, not thinking. The apology before a hold they already know is coming. That's the signal.

How I operationalized this

I secured budget for regular travel and set formal goals around site visits — making staying connected to associates a tracked design leadership commitment, not something that happened when schedules allowed.

Virtual side-by-sides

Built with operational partners

Site visits only go so far — travel is expensive, scheduling is hard, and you're always limited to one site at a time. So I worked with my operational peers to stand up virtual side-by-sides: a program that let me and my team observe associates taking live contacts from anywhere in the world, with the same proximity as being in the room.

Between contacts, we'd talk to the associates directly. What slowed you down just now? What did you already know before you looked it up? What questions do you dread? That conversation — unprompted, between real contacts — is where the real problems surfaced.

How I operationalized this

This required partnership — getting operational leaders to allow live observation of contacts, setting up the technical infrastructure, and building trust with site managers globally. I owned that relationship. Without it, this kind of research doesn't happen.

What we found

Associates had no way to get policy answers without stopping the conversation

When a customer asked a policy question, the associate had to pause the contact, open a separate tool, search for the right document, read it, interpret it, and apply it. Every lookup was a gap in the conversation — and different associates interpreted the same policy differently, so customers got different answers depending on who picked up.

Customer Service Associate

"I'm spending half the contact on things the system should already know."

Addressed by CS Helper

Associates arrived at contacts with no context — and customers paid for it

When an associate picked up a contact, they had nothing. No summary of prior contacts, no knowledge of what had already been tried, no sense of why this customer was frustrated. They had to start from scratch — asking questions the customer had already answered, to a bot, to a previous agent, sometimes multiple times. For customers with recurring issues, every contact felt like the first one.

Customer

"Why do I have to tell you all of this again? I already explained it to the last person."

Addressed by Context Summarizer

When the bot hit its limits, the customer experience broke completely

Amazon's automated chat bot handled a large and growing share of contacts. But bots have limits — ambiguous situations, edge cases, anything requiring real judgment. When the bot couldn't resolve a contact, the only option was a full handoff to a live associate. No context carried over. The customer had to start over from the beginning. The efficiency gains of automation evaporated the moment they were needed most.

Customer

"I already went through all of this with the bot. Now I have to do it again?"

Addressed by Human-in-the-Loop

For routine contacts, associates navigated manually to outcomes the system could already predict

A large category of contacts — missing items, auto-renewal charges, late deliveries — followed entirely predictable patterns. Amazon already had the order data, the delivery status, and the applicable policy. But associates still had to navigate through the same step-by-step workflow for every one of them. Some customers had even learned which answers produced the outcome they wanted, feeding bad data into Amazon's defect tracking and making the real problems harder to find.

Customer Service Associate

"I already know what this resolution is. Why does it take so long to get there?"

Addressed by Predictive Resolution

HOW WE WORKED

Why my team was the right team for this.

My team's position was unusual. We owned the UX framework, the shared design layer and component system that every Amazon CS business unit builds on. If a designer at any CS vertical needed to ship a new feature, they used our patterns. That gave us something most product teams don't have: a clear view across all of them. That gave us something most product teams don't have: a clear view across all of them.

Designers on my team were embedded in retail, devices, digital, shipping, and Amazon Business, close enough to see the real problems their teams were facing. That network gave us a direct path to gather requirements, test early directions, and get signal from business leaders before we'd over-invested in any one approach. When themes emerged from our ongoing site visits, watching associates handle contacts, interviewing customers, we had the context to understand what we were actually seeing and the relationships to act on it quickly. My science counterpart was a close partner throughout. We watched prototype tests together, connected what we observed to what the models needed, and pushed the direction forward together. We set the vision. They helped us make it real.

Ongoing site visits and field research

Regular visits to customer service sites, watching associates handle real contacts, interviewing customers immediately after their experience. A practice I've kept throughout my career and track formally as a design leadership commitment.

Embedded designers across every vertical

Designers on my team sat inside retail, devices, shipping, digital, and Amazon Business. They surfaced the real problems. They helped test early concepts. And they made sure what we built actually worked for the teams they supported.

A close science partnership

My science counterpart was in the room when prototypes were tested. Research shaped what the models did. Model constraints shaped what we designed. It ran both ways, and that's what made it work.

DESIGN PRINCIPLES

The AI had to earn trust before it could ask for it.

Every initiative here lived in a high-stakes context. A wrong AI suggestion could mean the wrong resolution, the wrong policy applied, or a customer commitment Amazon couldn't honor. The easy path is to put a chat interface on everything and call it AI. But a chatbot is just a hammer, and not every problem is a nail. We took a different approach. The associate tool already had a structure associates trusted: every contact was identified, categorised, and resolved through a consistent set of steps and screens. Rather than bolt AI onto the side as a separate chat window, I directed the team to surface AI intelligence through those same familiar patterns. The AI makes a suggestion with its supporting evidence visible. The associate reviews it and decides. Nothing is hidden, nothing is forced, and the manual path is always one click away.

Show the reasoning, not just the answer

Every AI suggestion surfaced its evidence through the match and solve cards, the core UI patterns associates use to identify the customer's issue and take action on it. The AI shows its work using patterns the associate already knows. The AI shows its work. The associate decides.

The human always makes the call

No AI feature executed a high-stakes action without associate confirmation. On routine contacts, complex ones, everything in between, the system surfaces, the human decides.

The manual path is always there

An AI suggestion was never a gate. Dismiss it and the standard flow was right there. Trust couldn't be forced, so we made the cost of not trusting as low as possible.

Only surface what the system is sure about

We defined confidence thresholds with the science team for every feature. When the model wasn't confident enough, no suggestion appeared. A missing suggestion is better than a wrong one.

THE FEATURES

Four design initiatives. Each solving a real problem. All of them working together.

Each of these came from something real we observed, not a product roadmap looking for AI use cases. And because my team owned the framework, each one was built to be used by any team across any business unit, not just the team that asked for it first. They also work together: an associate using Predictive Resolution is better served if they already trust CS Helper. Each layer of AI capability made the next one easier to accept.

Solving problems 1 and 2

CS Helper

Give associates an answer, not a search result.

Research surfaced two problems happening simultaneously. Associates were leaving contacts mid-conversation to search documentation, breaking the human moment at exactly the wrong time. And because different associates interpreted policy differently, customers were getting inconsistent answers depending on who picked up.

Concept 1

Searchable documentation panel

We built a searchable policy repository into the AC3 workspace so associates could find policies without leaving the tool. The hypothesis: if the information is right there, associates will use it consistently.

Dale Cooper

Available

POLICY & CONTENT REPOSITORY

Search policies, procedures, FAQs…

SUBSCRIPTION POLICIES

Prime Video, Auto-renewal & free trial policy

Free trial auto-conversions are covered under CX-POL-2847. Customers contacting within 30 days who did not intend to subscribe qualify for a one-time exception refund.

CX-POL-2847 · Updated March 2024

Prime membership cancellation & refund

Refund eligibility depends on usage since last charge. Partial refunds available within 30 days with no Prime benefits used.

CX-POL-1104 · Updated Jan 2024

Goodwill credits, eligibility & limits

Tier 1 associates may offer up to $10. Tier 2 up to $25. Supervisor approval required for repeat contacts.

CX-POL-0891 · Updated Feb 2024

Escalation guidelines, billing disputes

CX-POL-0342 · Updated Dec 2023

MATCH

prime

video

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

🍳

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

📚

Atomic Habits

Paperback · James Clear

Delivered Mar 9

Problems with this approach

Still broke the flow

Associates had to context-switch to a search panel, know what to look for, read the result, and apply it. Just as disruptive as before, only slightly faster.

Surfaced in prototype testing feedback

Policy documents required interpretation, which introduced variance

Policy documents are dense. Associates still had to read, understand, and apply the information themselves, which is exactly where inconsistency came from.

No improvement seen after prototype testing rollout

Consistency did not improve — we had just moved the problem

Different associates found different documents and interpreted them differently. Customers still got different answers depending on who picked up.

Concept 2 — Iteration

CS Helper panel alongside match cards

We rebuilt CS Helper as a floating AI chat panel alongside the match cards. Associates could ask questions in plain language and get answers specific to the contact in front of them.

Dale Cooper

In chat · Marcus Webb

MATCH

prime

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

🍳

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

📚

Atomic Habits

Paperback · James Clear

Delivered Mar 9

prime

Amazon Prime Video

Refund eligible

Charge dateMarch 14, 2025

Amount$8.99

Content streamedNone confirmed

CS Helper

Policy assistant

–

CS Helper. Ask about this contact or Amazon policies.

What's the refund policy for auto-conversion?

Can I offer goodwill on a second contact?

YOU

What's the refund policy for Prime Video free trial auto-conversion?

CS Helper

Eligible for full refund within 30 days of auto-conversion if no content was streamed. Marcus has zero streaming activity, refund appropriate. One-time exception per CX-POL-2847.

Prime Video, Auto-renewal Policy

CX-POL-2847 · Full policy →

Ask CS Helper…

↑

What we learned from this iteration

Prototype participants described the experience as two separate tools

Better, but associates still felt like they were switching tools

The floating chat was more accessible than a search panel, but still felt like a separate tool. Associates had to switch context to open and interact with it.

Associates named this as the biggest improvement in prototype feedback

Proactive suggestions reduced the need to search

CS Helper began suggesting responses based on the contact context — a key step toward reducing the manual effort of policy lookup.

Consistent feedback across all prototype rounds

The floating panel felt like an add-on, not part of the tool

The goal was for CS Helper to feel like it belonged, not like an add-on sitting on top of the interface. That integration still needed refinement.

Final design

Embedded AI assistant with suggested responses

CS Helper became a dedicated panel — always present alongside the contact, never a separate tool to open. It proactively surfaced policy answers cited to source. Associates got the answer without searching for it.

CS Helper final design, full panel with policy documentation

What this solved

Associates stopped switching away during prototype sessions

Always present, never something to open

CS Helper sat alongside the contact as a dedicated panel — accessible without navigating away, always visible at any point in the conversation.

Associates described this as the most valuable change in post-test interviews

Answers in plain language, cited to the source

Plain language question in, policy-grounded answer out. Associates got the exact answer they needed for this contact, cited to the actual policy document.

Measured in cohort testing

The same question gets the same answer, regardless of who picks up

The same question gets the same answer regardless of tenure or experience, eliminating the variance that created unpredictable customer outcomes.

Impact

Before CS Helper

~65%

Of associates regularly navigated away from the contact to look up policy, averaging 2-3 lookups per contact.

After CS Helper

~18s

Average handle time reduction per contact in cohort testing. Associates stayed present and answers became consistent.

Solving problem 2

Context Summarizer

Arrive knowing the situation. Don't make them repeat it.

The previous tool was a full case management system, a dense, data-heavy interface where associates had to navigate through complete contact histories, transcripts, and customer records before they could begin helping. When contacts had long histories, associates either spent significant time reading through everything, or skipped it entirely and started from scratch. Either way, the customer paid for it.

The starting point

Dense case management, built for data — not for people

This was the baseline. Before AC3, associates handled contacts through a full case management system — tabs of raw data, full transcripts, and customer records that had to be read and interpreted before help could begin.

https://amzn-contact.internal/cms/contact?id=LP-2941&agent=DCOOPER&site=SEA-CS-01

Amazon Contact Management

Home

Contacts

Contact: LP-2941

Reports

Admin

DCOOPER | SEA-CS-01 | Logout

Search:

Home › Contacts › Palmer, Laura › Contact LP-2941 — Missing Item (LIVE) Call timer: 0:04:12 | Queue: 7 waiting

Contact History

History

Orders(14)

Account

DATE	REASON	STATUS
Mar 14	Missing Item	LIVE
Mar 11	Late Delivery	Resolved
Feb 28	Billing Inq.	Resolved
Jan 09	Return Req.	Unresolved
Dec 22	Missing Item	Resolved

Customer Record

Cust ID:	C-00449182
Name:	Palmer, Laura
Status:	PRIME 5YR
LTV (12mo):	$4,280.00
Contacts(90d):	4 ⚠
Goodwill(90d):	$22.50/$25.00
Fraud Score:	12 (LOW)
Open Cases:	1

⚠ Account Flag ACCT-FLAG-003: 4 contacts in 90 days. Supervisor approval required for credits over $15. One prior unresolved contact. See policy CX-POL-1204.

Case Details

Transcript

Orders (14)

Policy Ref

Workflow

Contact Classification

Case ID: LP-2941-0314 Channel: Inbound Voice (IVR: Y) Reason L1: Fulfillment Issues Reason L2: Missing Item Reason L3: Not in Delivery Location Workflow: WF-MISS-003-B SLA Tier: T1 Standard

Order & Shipment Details

Order ID: #113-884723-6618 ASIN: B07PQNHPFG Carrier: AMZL · TBA003124812000 Last Scan: 2025-03-14 11:32 PST Dlvry Status: Delivered · Handed to Resident Refund Elig.: Prime · No Return Required Prior Claims: 2 in 90 days ⚠ Return Window: Closes 2025-04-13

⚠ Account Flags & Policy Constraints — ACCT-FLAG-003

Supv. Required: YES — credits > $15 Goodwill Used: $22.50 of $25.00 Policy Note: If issuing replacement, select "Verified missing – Prime" in workflow step 3. Do NOT use WF-MISS-001 for this ASIN in US marketplace. Ref: CX-POL-0412 rev.4.

Agent: DCOOPER | Site: SEA-CS-01 | Role: T1 Associate Contact LP-2941 | LIVE 0:04:12 Queue: 7 waiting ACMS v4.2.1 | 14:26:20 PST

Problems with this approach

Observed immediately when associates opened contacts in prototype walkthroughs

Associates spent the first minute reading before they could say hello

For the associate, this was dead time — eyes down, customer waiting. For the customer, it was silence that signaled the tool was not ready to help them yet.

Associates described this exact problem in prototype feedback sessions

The context arrived at the wrong moment

Showing history inside the contact meant associates were still catching up while the customer was already talking. They needed to arrive prepared — not get prepared during the conversation.

Consistent across all prototype participants

The tool showed what happened, not what it meant

Raw logs and data fields with no synthesis. Associates had to piece together the story themselves. The tool handed them information. It did not hand them understanding.

Iteration

AI summary drawer inside the contact

We built an AI-generated summary into the contact workspace as a collapsible drawer. The summary was right there. But prototype testing revealed we had put it in the wrong place.

AC3 · Customer Care Center

Customer chat

Marcus Webb

Hi, I was charged again for Prime Video. I already contacted you about this 4 months ago. This is unacceptable.

Dale Chen

Hi Marcus, I can see this is the second time this has happened. I'm really sorry about that. Let me take a look at your account right now.

Marcus Webb

Thank you. I just want this sorted out. I never use Prime Video.

Suggested ·

Contact history summary

Customer contacting about a Prime Video charge of $8.99. Second contact, prior contact 4 months ago.

Prior contacts (3)

▾

Nov Prime Video charge · $8.99 Unresolved

Aug Late delivery Resolved

prime

video

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

Sony WH-1000XM5

Noise cancelling · wireless

Delivered Feb 28

What we learned from this iteration

Prototype participants responded positively to the summary format

An AI-generated summary was a clear improvement over raw logs

Associates got a readable narrative instead of raw data and fields. The content was better. But placing it inside the contact still meant they were catching up while the customer was already talking.

Consistent finding across all prototype sessions

The timing was better — but still wrong

The summary was inside the contact, which meant associates were still reading while the conversation had already started. We had improved what they saw. We had not yet fixed when they saw it.

Associates said this unprompted in almost every prototype session

The decisive insight: "I wish I knew this before I picked up"

That single quote drove the change. The summary needed to arrive before the associate accepted the contact — not after. Move it to the accept screen, and the whole dynamic shifts.

Concept

Final design · State 1, AI summary before accepting the contact

Dale Cooper

Available

INCOMING CHAT · BILLING INQUIRY

Marcus Webb

Prime Video billing, second contact about this charge

PREVIOUS CONTACT · 4 months ago

Called about the same Prime Video charge ($8.99). Received a courtesy credit but auto-renewal was not disabled. Left satisfied but the issue persisted.

Unresolved Billing · Inbound call

RECENT ORDER · RELATED

prime

Amazon Prime Video subscription

$8.99 charged March 14 · Auto-renewal from free trial

Before the associate accepts, they see who is calling and the full AI-generated summary of context from prior contacts. No suggested action, just what they need to know.

Concept

Final design · State 2, Summary persists at top during the contact

Dale Cooper

In chat · Marcus Webb

prime

AI SUMMARY

Customer contacting about a Prime Video charge of $8.99. Free trial auto-converted March 14. Second contact, prior contact 4 months ago, unresolved.

MATCH

prime

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

🍳

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

📚

Atomic Habits

Paperback · James Clear

Delivered Mar 9

After accepting, the summary strip remains at the top above the match cards, always visible, always scannable.

What this solved

Validated consistently across prototype sessions

Associates arrived prepared before the conversation started

By moving the summary to the accept screen, associates knew who the customer was and what had been tried — before they said a word. For the customer, the experience shifted completely.

Associates described it as completely different in post-test interviews

The first word of the contact changed

Instead of asking for an order number, associates could open with something genuinely useful. Customers stopped being asked to repeat themselves. The tone was set from the first line.

Confirmed in cohort testing results

The AI often surfaced a resolution alongside the context

When confidence was high enough, a suggested resolution appeared with the summary. Associates frequently arrived already knowing the answer — the contact became confirmation, not investigation.

Impact

Before

45–90s

Time associates spent reviewing history before they could meaningfully begin — while the customer waited with nothing happening.

After

~30s saved

Per contact in cohort testing. Associates arrived prepared. Customers stopped repeating themselves.

After

~30s saved

Per contact in cohort testing. Associates arrived prepared instead of catching up. Customers stopped repeating themselves.

Solving problem 3

Human-in-the-Loop

Keep the bot in the conversation. Put a human behind it.

Amazon's automated chat bot was handling a growing volume of contacts without a human involved. But bots have limits, ambiguous situations, edge cases, anything that requires real judgment. When the bot hit those limits, the only option was to hand the contact off entirely to a live associate, starting over. The customer experienced it as a disruption. The efficiency gains evaporated.

How the contact flow changed

Before

Customer

→

Chat bot

→

Hits a limit

Hard handoff to CSA

Customer experience breaks

After

Customer

→

Chat bot

→

CSA approves silently

→

Bot continues

Customer never knows

Initial concept

Bot requests approval inside the chat thread

When the bot reached a decision requiring human judgment, it surfaced an approval request directly in the chat thread. Simple to build — but the approval looked like just another chat message.

Dale Cooper

In chat · Marcus Webb

MATCH

prime

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

🍳

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

📚

Atomic Habits

Paperback · James Clear

Delivered Mar 9

CUSTOMER CHAT

Marcus Webb

MARCUS

I was charged again for Prime Video. This happened 4 months ago.

YOU

I can see this is the second time. Really sorry about that.

Bot requesting approval

I'd like to issue a full refund of $8.99 and disable auto-renewal. Approve?

Type a message…

↑

Problems with this approach

Associates raised this immediately in the first prototype session

The approval request got lost in the chat thread

The bot's proposed action looked like just another message. Associates were not sure if they were responding to the customer or confirming the bot. That ambiguity created hesitation.

Observed across all timed prototype walkthroughs

Speed and clarity pulled in opposite directions

To keep the bot seamless, the associate had to act fast. But acting fast meant not enough time to evaluate. Slowing down broke the continuity the feature was built to preserve.

Associates described this tradeoff unprompted

Associates were not sure who was making the decision

Participants regularly paused to figure out whether they were approving something the bot had already done, or something it was about to do. That uncertainty undermined the whole premise.

Concept

Final design, dedicated approval surface, same pattern as predictive resolution

Dale Cooper

In chat · Marcus Webb

CUSTOMER CHAT

Marcus Webb

MARCUS

I was charged again for Prime Video. This already happened before.

YOU

I can see this is the second time. I'm really sorry about that.

MARCUS

I just want this sorted out.

Type a message…

↑

Bot requesting approval

Not visible to customer

Customer said

Charge dispute

Amount

$8.99 · Mar 14

Confidence

High · 94%

Prior contacts

Yes · same issue

Issue full refund · $8.99

Free trial auto-conversion · no content streamed · second contact

MATCH

prime

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

🍳

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

📚

Atomic Habits

Paperback · James Clear

Delivered Mar 9

What this solved

Associates acted in under three seconds in prototype sessions

A dedicated surface meant associates knew exactly what they were looking at

By pulling the approval out of the chat thread and giving it a dedicated space at the top of the screen, the action was unmissable and unambiguous. No hunting. No second-guessing.

Associates reported zero learning curve — they recognized the pattern immediately

The same pattern as Predictive Resolution made it instantly familiar

Contact data, proposed action, confirm or override. Associates had already internalized that structure. No explanation needed. Recognition was instant.

Confirmed in cohort testing across 7 global marketplaces

The customer never knew a human was involved

The conversation continued uninterrupted. The judgment stayed human. Efficiency gains stayed intact. 80%+ time savings across all markets.

Impact

Before

Hard break

When the bot hit a limit, the customer experienced a full handoff — starting over with a live associate, no context carried over.

After

80%+

Time savings across 7 global marketplaces. The conversation stayed seamless. Leadership restructured roadmaps to scale across more use cases immediately.

After

80%+

Time savings across 7 global marketplaces. The customer experience stayed seamless. Leadership prioritized scaling across more use cases immediately.

Solving problem 4

Predictive Resolution

Everything the associate needs, before the contact starts.

For a large category of contacts, Amazon already had all the information needed to resolve them before the associate even picked up. The item was missing. Amazon knew what it was, when it shipped, where it was delivered, and what the policy said. But the associate still had to navigate through a manual series of questions to reach a resolution that was already predictable. Some customers had even learned which answers produced the outcome they wanted, which generated inaccurate data and made it harder for Amazon to understand and fix the real underlying problems.

The starting point

Manual workflow — step-by-step radio buttons, no AI

This was the baseline. For every contact — regardless of how predictable the outcome — associates navigated the same step-by-step sequence. Amazon already had the data to predict the resolution. The workflow made no use of it.

Step 1, Navigate to contact type

Step 2, Work through radio button questions

Problems with this starting point

Associates described this frustration in research sessions

Associates were navigating to conclusions the system could already predict

For routine contacts, the outcome was already predictable. Amazon had the order data, delivery status, and policy match. The associate still navigated step by step to a conclusion the system already knew.

Surfaced through contact data analysis and defect reporting

Customers had learned how to game the workflow

Because outcomes depended on the answers given, customers — and some associates — learned which selections produced the desired result. This generated unreliable data and made it nearly impossible to identify the real underlying problems causing repeat contacts.

Pattern visible across all design reviews and prototype sessions

Every contact started from zero, regardless of what Amazon already knew

No prior contacts, no order data, no delivery history — none of it surfaced. For the associate, unnecessary work. For the customer, questions that should not need to be asked.

First AI concept

AI pre-fills the match and solve cards

The first direction: let AI pre-fill the match and solve cards with its best prediction so associates could skip the navigation. It proved the AI could predict correctly. But the presentation created a trust problem we did not anticipate.

Dale Cooper

In call · Laura Palmer

MATCH

🔌

Amazon 5W USB Charger

iOS and Android · OEM

Delivered today

🐕

Logical Dog Harness

Full grain · medium

Delivered today

🕶️

Polarized Sunglasses

UV400 · unisex

Delivered today

🔌

Amazon 5W USB Wall Charger

Replacement eligible

Workflow: Request a replacement

1. How was the item damaged or affected?

Item was missing from the delivery

Item arrived physically broken

Item has visible damage to packaging or product

Item stopped working within 30 days of delivery

2. Can the customer provide evidence of damage?

Yes, customer has photos or video

No, item was missing, no evidence possible

Customer unwilling to provide evidence

3. Is a return required before issuing replacement?

Yes, return label required

No return required, Prime eligible

Escalate to supervisor for decision

What we learned from this concept

Associates reacted negatively in the first prototype sessions

Pre-filled forms felt like the AI had already made the decision

When the form arrived pre-filled, associates felt bypassed rather than supported. They questioned the suggestion more, not less — and had no way to understand why the AI chose what it chose.

The gap became clear in prototype debrief sessions

There was an answer but no evidence — and that created doubt

The AI confidence was not visible. No receipts, no supporting data. Associates were being asked to trust a conclusion without access to the reasoning behind it.

Identified in the working session after testing

The AI suggestion needed its own surface, separate from the standard workflow

Pre-filling the existing form mixed the AI output with the associate navigation. To build trust, the suggestion needed its own space, its own evidence, and the standard flow still available below it.

Final design

Dedicated widget with contact data, resolution, and receipts

Instead of pre-filling the form, we built a purpose-built AI widget at the top of the workspace. It showed the suggested resolution alongside the supporting evidence. Associates verified, not navigated.

Dale Cooper

In chat · Marcus Webb

CUSTOMER CHAT

Marcus Webb

MARCUS

I was charged again for Prime Video. This already happened before.

YOU

I can see this is the second time. I'm really sorry about that.

MARCUS

I just want this sorted out.

Type a message…

↑

AI resolution

High confidence · 96%

Contact reason

Subscription charge dispute

Charge · date

$8.99 · March 14

Prior contact

Yes · same issue

Issue full refund · $8.99

Free trial auto-conversion · no content streamed · first-time exception eligible

MATCH

prime

Amazon Prime Video

$8.99/mo · auto-renewed Mar 14

Charged today

🍳

Cuisinart Food Processor

14-cup · stainless steel

Delivered Mar 12

📚

Atomic Habits

Paperback · James Clear

Delivered Mar 9

What this solved

Associates acted confidently in all prototype sessions

The widget separated the AI's suggestion from the associate's workflow

By giving the AI its own surface, the suggestion felt like a recommendation rather than a decision already made. Associates retained control — and felt it. The standard workflow was always one click below.

Trust built quickly once associates could see the supporting data

The receipts made the AI's reasoning visible

Contact reason, charge amount, prior contacts, confidence level — all visible at a glance. Associates verified the AI's logic in seconds. Seeing the evidence made the suggestion feel earned rather than imposed.

Measured in cohort testing; VP restructured roadmaps after seeing the result

The associate's job shifted from navigating to confirming

Same outcome, a fraction of the time. In cohort testing, resolution time dropped ~78%. That result did not just validate the design — it became a mandate to scale.

Impact

Before

Step-by-step

Associates navigated the full workflow for every contact — even when the system could already predict the resolution. Time, error, and inconsistency built in by design.

After

~78%

Faster to resolve in cohort testing. VP leadership restructured roadmaps to scale it. The POC result was not just a metric — it was a mandate.

After

~78%

Faster to resolve in cohort testing. VP leadership restructured roadmaps around scaling it. The POC result wasn't just a metric, it was a mandate.

SCALING WHAT WORKED

Built once. Available to everyone.

With AI experiments running across retail, devices, digital, shipping, and Amazon Business simultaneously, there was a real risk of fragmentation, the same problem solved five different ways, five separate times, none of them able to learn from the others. Because my team owned the UX framework, we were in a position to prevent that. Any design pattern that proved out in one business unit could be evaluated for whether it belonged in the shared platform, available to every team, not just the one that built it first. I set up a structured cross-team review process across four organizations. Designers from each vertical brought their work. We identified what was genuinely reusable, refined it, and graduated it into the framework. The result: teams didn't have to reinvent. They could onboard onto proven patterns and focus on what was unique to their context.

20+ AI interaction patterns identified, validated across verticals, and built into the shared platform, so every team onboarding to AC3 could use them without starting from scratch.

— Cross-org UX sync outcome

OUTCOMES

Results that moved roadmaps.

Every initiative here was measured against real contacts with real associates, not simulated scenarios or internal demos. The results were strong enough that leadership didn't just note them. They restructured roadmaps around them.

~78%

Faster to resolve contacts, predictive resolution

80%+

Bot automation time savings, 7 global marketplaces

~30s

Per-contact time savings, context summarizer

20+

Reusable AI patterns shipped to platform

SEE IT IN ACTION

See it in context.

Two interactive prototypes built to the level of fidelity we used for associate testing. Each one walks through a contact type that happens thousands of times a day at Amazon.

Scenario 1

Missing item

A customer calls about a USB charger that was marked delivered but isn't there. See the context summarizer, match cards, and solve flow in action.

Open demo

Scenario 2

Repeat charge dispute

A frustrated customer contacts about a Prime Video charge they've raised before. CS Helper surfaces the policy, the summarizer provides context, and the AI suggests a resolution.

Open demo

Next case study

AC3 Scaling, how we built the team behind the platform

View case study →