18.06, Webinar From SQL scripts to production data platform on AWS
6 min read

Automation: from bottleneck to advantage

Learn how we built a serverless AWS architecture for an entertainment company that handles thousands of requests daily.




As digital products grow, customer support often becomes the first system to break under scale. When tens of thousands of messages arrive each month, even well-designed workflows collapse into queues, workarounds, and frustrated users trying to reach a human at any cost. The real issue isn’t response speed — it’s misallocated human effort. Skilled agents handle repetitive requests while complex, high-risk cases wait longer than they should. This creates rising costs, declining experience, and an operation that feels reactive rather than scalable.

In this article, we explore a practical approach to automating customer support for high-volume environments using AI-driven intent recognition and backend action execution. You’ll see how automation can resolve common requests instantly while routing sensitive cases to humans with the right context. The patterns described here come from building production systems designed to handle large message volumes with strict accuracy requirements.

Why this problem matters

At scale, customer support is shaped by three compounding pressures that traditional workflows can’t absorb:

A less obvious but critical challenge emerges at this point: users learn how to bypass guided flows. Free-text inputs and generic options like “Other” introduce noise into queues, pulling agents into avoidable back-and-forth and delaying resolution for high-priority cases.

Solving this requires more than a chatbot. It demands an intelligent automation layer that can protect support flows, triage intent accurately, handle routine actions end-to-end, and escalate complex or sensitive cases to humans — fast, safely, and with full context.

Our use case & objectives

We framed the initiative around four progressive capabilities for our customer support AI:

These goals map tightly to business outcomes: lower cost per resolution, faster responses, better agent focus, and higher consistency across policies.

High-level flow

High-level architecture flow diagram

The starting point is a button-based support interface built on Intercom. Users are guided toward predefined categories intended to structure demand and reduce ambiguity. This approach works well for many — but at scale, cracks emerge. Some users shortcut the system by selecting generic options or entering free-text to reach an agent faster, creating noise and misaligned routing. To address this, the architecture introduces AI in two distinct but complementary roles.

The first is an AI firewall that sits between the incoming user message and human agent routing. Every interaction is evaluated before it reaches the support queue:

The second AI role operates alongside humans as a conversational support agent. This component handles low-complexity, high-frequency cases independently, following deterministic, validated workflows. It resolves simple requests end-to-end or hands off to a human when conditions exceed defined boundaries.

Together, these layers protect agent focus, reduce queue noise, and allow automation to grow safely — one category at a time.

Architecture overview

Detailed architecture diagram

This architecture illustrates how AI is layered onto an existing helpdesk system. While the full platform includes monitoring, secrets management, and security controls, those elements are deliberately omitted here to keep the diagram focused on core data and decision flows.

The user interacts with a helpdesk interface through a primarily button-based flow, with limited free-text input at the end. All user interactions are sent via a REST API to Amazon API Gateway, which triggers a single, central AWS Lambda function acting as an AI agent router. This Lambda is responsible for orchestrating the system’s behaviour and contains the application logic that determines how each request is handled.

The Lambda communicates directly with a large language model hosted on Amazon Bedrock, where the core reasoning and intent interpretation occur. Depending on the request, the same underlying model operates in two distinct modes. In AI firewall mode, it validates intent, detects bypass attempts or misuse, redirects users to appropriate paths, or escalates urgent cases. In AI support agent mode, it resolves low-complexity issues autonomously or prepares structured hand-offs to human agents.

Both modes rely on retrieval from Amazon OpenSearch, which indexes curated knowledge stored in Amazon S3. A dedicated “data-puller” Lambda, triggered daily via EventBridge, synchronizes selected internal sources to keep this knowledge current. When needed, the agent Lambda also integrates with backend services to perform lightweight checks or actions.

This is the kind of project where the architecture appears simple, but the underlying solution is complex and hard to fine-tune. The real effort sits in Lambda code and LLM behaviour design, while parts of conversation handling remain in helpdesk workflows — yet despite its simplicity on paper, it delivers real impact by reducing costs and freeing agent time.

Data & knowledge: what we used

We curated these sources for clarity and minimal ambiguity, because early in the PoC the chosen model favored concise, essential instructions; richer context could cause inconsistent outputs. A more capable model would let us reintroduce detail without losing stability.

Early results & adoption signals

In early validations and reviewer tests:

We also captured human feedback on specific misclassifications (e.g., “ok vs wrong-path”), which is essential for the next iteration.

Final thoughts

AI-driven support automation isn’t about replacing agents — it’s about protecting their time. Start by defending flows and prioritizing safety, then grow automation where the evidence is strongest. The combination of serverless AWS, Amazon Bedrock RAG, and tight documentation governance gives you a stack that’s scalable, explainable, and adaptable to future channels and models.

If you’re planning a similar initiative, anchor the work in your current pain points (bypass, latency, costs), commit to evaluation discipline, and treat content quality as seriously as code. The payoff is a support operation that moves from cost center to competitive advantage — faster for users, calmer for agents, and clearer for leadership.

Let's talk about your project

We'd love to answer your questions and help you thrive in the cloud.