How We Built an AI Sales Agent for a Saudi manufacturing Company
In early 2026, a mid-sized Saudi manufacturing company came to us with a familiar pain point: their sales team was spending 60% of their time on cold outreach, lead qualification, and follow-up — leaving only 40% for actual closing. With 15 sales reps covering three product lines across five regions, they were losing leads in the cracks and struggling to scale. Their ask was simple but audacious: "Build us an AI sales agent that can do the grunt work so our people can focus on selling."
This is the story of how we designed, built, and deployed a fully autonomous AI sales agent for a real B2B manufacturing business — the architecture, the challenges, the results, and the lessons learned along the way.
The Challenge: Industrial B2B Sales at Scale
Manufacturing B2B sales in Saudi Arabia comes with unique complexity. The client — let's call them "GulfMetals" — manufactures steel components, aluminum extrusions, and specialized fasteners. Their buyers range from small construction contractors to large government infrastructure projects. Every segment requires different outreach, different technical knowledge, and different pricing models.
The core problems we needed to solve:
- Cold outreach inefficiency: Reps manually researched prospects, drafted emails, and made calls — producing 5-10 qualified leads per week per rep at best.
- Multilingual complexity: The sales cycle spans Arabic and English, and some technical documents are only available in Arabic. Reps needed translation support for every interaction.
- Long follow-up cycles: B2B deals in manufacturing take 45-90 days with multiple touches required. Human reps often forgot to follow up at the right intervals.
- Inconsistent qualification: Without a standardized lead scoring framework, reps qualified leads based on gut feel rather than data.
- CRM data decay: Contact information, company details, and past interactions lived in spreadsheets or outdated CRM entries.
The client wanted a system that could autonomously research prospects, initiate outreach in both Arabic and English, qualify leads against a scoring model, schedule meetings, and hand off only high-intent opportunities to human reps.
The Architecture: Building the Agentic Sales Stack
We designed the AI sales agent as a multi-agent orchestration system with five specialized sub-agents, each responsible for a distinct part of the sales process. Here's how the stack is laid out:
Orchestrator Agent (Central Brain)
The orchestrator manages the overall sales workflow. It receives leads from the CRM or import pipeline, decides which sub-agent to invoke, tracks conversation state across channels, and escalates to human reps when the lead reaches "hot" status. Think of it as the conductor of an orchestra — it doesn't play every instrument, but it ensures everyone plays in sync.
Research & Enrichment Agent
This agent autonomously researches every incoming lead. It scrapes company websites, checks LinkedIn profiles, reads recent news, and pulls financial data (where available). It enriches the CRM record with company size, industry, recent projects, decision-maker names, and any "trigger events" (new funding, expansion announcements, new leadership hires) that signal buying intent. For a Saudi construction contractor, this might mean detecting a new infrastructure project awarded by the government — a perfect time to pitch steel components.
Communication Agent (Multilingual Outreach)
The communication agent handles all outbound messaging across email, WhatsApp Business, and SMS. It generates personalized messages in Arabic or English depending on the prospect's language preference (detected from public profiles or prior interactions). The agent maintains conversation context across channels — a prospect who replies on WhatsApp after receiving an email gets a coherent, context-aware response. It also handles objection handling autonomously: "We already have a supplier" triggers a pre-approved competitive differentiation script, while "Send me a quote" triggers the quotation engine.
Lead Scoring & Qualification Agent
This agent maintains a dynamic scoring model that evaluates every lead across five dimensions: intent signals (recent project announcements, RFQs), fit score (company size, industry match), engagement score (email open rate, reply sentiment, call pickup rate), budget proxy (company revenue band, project value estimates), and timeline (urgency signals like lease expiry or project deadline). Leads scoring above 85 points are auto-escalated to a human rep with a complete brief including conversation history, qualification summary, and suggested next steps.
Voice Call Agent (AI Cold Calling)
The most ambitious component: an AI voice agent that makes cold calls using natural-sounding speech synthesis and real-time conversation AI. Powered by a voice-to-voice pipeline (speech-to-text, LLM reasoning, text-to-speech), the agent can handle common call flows: gatekeeper deflection ("Who's calling?"), elevator pitch delivery, objection handling, and meeting scheduling. It operates in both Arabic and English and adapts its tone based on the prospect's language and speaking style. If the prospect asks a question the agent can't answer, it escalates to a human immediately.
Technical Deep Dive: How the Agent Works
Under the hood, the system runs on a modular agentic framework built on top of Hermes Agent as the runtime orchestrator. Each sub-agent is a self-contained process with its own tools, memory, and prompt instructions.
Tool-Augmented Reasoning
Every sub-agent uses function-calling LLMs (we used DeepSeek V4 and GPT-4o) to decide which tools to invoke. The Research Agent has tools for web scraping, LinkedIn API queries, and CRM reads. The Communication Agent has tools for send_email, send_whatsapp, and schedule_meeting. This tool-augmented approach means the LLM never guesses — it always consults real data before acting.
Persistent Memory & State Management
Each lead's journey is tracked as a state machine: New → Researched → Contacted → Engaged → Qualified → Hot (escalated). The orchestrator persists state in a PostgreSQL database (using Shayntech TimeTravel for audit trail, naturally) and uses a Redis-backed conversation cache for real-time context. If the system crashes mid-call, it picks up exactly where it left off.
Multilingual Natural Language Pipeline
For Arabic-language interactions, we built a pipeline that: (1) detects language from the incoming message, (2) uses a dedicated Arabic LLM fine-tune for understanding nuanced Saudi dialect and business Arabic, (3) generates responses in the detected language with culturally appropriate phrasing (Islamic greetings, respect titles, local business idioms), and (4) maintains a bilingual thread so a conversation that starts in Arabic and switches to English remains coherent. This was critical — our client's buyers commonly switch between the two languages mid-conversation.
Results: What Happened After 90 Days
After three months of production deployment, the numbers speak for themselves:
3.2x More Qualified Leads Per Week
The AI agent consistently produced 32 qualified leads per week (scored above 70) versus 10 per week from human-only outreach. The autonomous research and 24/7 outreach capability were the primary drivers.
68% Reduction in Time to First Follow-Up
The agent responded to inbound inquiries and triggered first follow-ups within 2 minutes on average, versus 4.6 hours for human reps. Faster response directly correlated with 40% higher reply rates.
41% Increase in Demo Conversion Rate
By the time human reps took over, leads were already qualified, researched, and primed with relevant context. Reps could focus on closing rather than qualifying, resulting in a significant jump in demo-to-deal conversion.
75% of Initial Outreach Handled Autonomously
Three-quarters of all first-touch outreach (cold emails, WhatsApp intros, initial calls) was handled entirely by the AI agent without human involvement. Humans only stepped in at the "hot lead" stage, negotiation, and closing.
Lessons Learned: What Worked & What Didn't
Building an AI sales agent for a real B2B environment taught us hard-won lessons that shaped our approach to agentic systems:
- Human-in-the-loop is non-negotiable: We started with full autonomy but quickly learned that certain interactions — price negotiations, complex technical questions, sensitive account situations — need a human. The escalation mechanism must be fast and seamless.
- Arabic language quality is hard: Off-the-shelf LLMs handle Modern Standard Arabic (MSA) well, but Saudi dialect, business Arabic, and code-switching are significantly harder. Fine-tuning on domain-specific Saudi business conversations was essential for natural-sounding interactions.
- Data quality matters more than AI quality: Garbage in, garbage out. Clean CRM data, accurate contact information, and up-to-date company profiles had a bigger impact on agent performance than any model upgrade.
- Voice calls are the hardest channel: Email and WhatsApp work reliably with current LLM quality, but voice calls (especially cold calls) are still challenging. Background noise, accents, interruptions, and unexpected questions push the limits of current voice-to-voice pipelines.
- Monitoring and observability are essential: Every agent action needs to be logged, traceable, and reviewable. We built a dashboard showing all agent decisions, conversation transcripts, and escalation logs so the sales manager could audit any interaction.
Enterprise-Grade Security & Compliance
All customer data resides on Saudi-hosted infrastructure (KSA data residency). Conversations are encrypted at rest and in transit. The agent follows SOC 2-aligned data handling policies, and every interaction is logged with tamper-evident SHA-256 hash chains via TimeTravel. We built the system from day one with the understanding that Saudi manufacturing companies handle sensitive commercial data that cannot leave the kingdom.
What's Next: The Roadmap
We're actively extending this platform for GulfMetals and other manufacturing clients. The roadmap includes:
- WhatsApp Business API deep integration: Full product catalog sharing, quotation delivery via WhatsApp, and payment link embedding within the conversation thread.
- Multi-channel conversation merging: Seamlessly connect email, WhatsApp, phone, and even in-person meeting notes into a single lead timeline.
- Predictive lead scoring with ML: Beyond rules-based scoring, training a machine learning model on historical conversion data to predict which leads are most likely to close and recommend optimal outreach sequences.
- Arabic-first document understanding: The agent will be able to read and extract information from Arabic PDFs (BOQs, contracts, technical specs) to automatically populate CRM fields and generate customized proposals.
- Expansion to other Gulf markets: Adapting the agent for UAE, Qatar, and Kuwait, with market-specific compliance, business culture, and dialect handling.
Ready to transform your workflow?
Book a free 15-minute demo and see how Agentic AI works for your business.
Book a Free Demo