If your AI automations are underperforming, the problem almost certainly isn't your prompts. It's the architecture of how your systems work together.
This insight comes from a recent O'Reilly Radar analysis by Nicole Königstein that's reshaping how forward-thinking businesses approach AI implementation. The core argument is compelling: we've moved beyond the era of prompt engineering into an era of organisational systems. For multi-branch estate agencies wrestling with inconsistent AI results across locations, this shift in thinking offers a practical path forward.
The article introduces what Königstein calls the "prompting fallacy"—the mistaken belief that tweaking prompts can fix what are actually systemic coordination failures. If you've spent hours refining your ChatGPT instructions for property descriptions only to get wildly inconsistent results, you've experienced this firsthand. The solution isn't better words; it's better architecture.
For estate agency leaders managing multiple branches, this framework provides a blueprint for building AI systems that actually scale—systems where your Watford office gets the same quality outputs as your St Albans branch, without requiring constant manual oversight.
Understanding the Four Collaboration Patterns
The O'Reilly analysis breaks down multi-agent architecture into four distinct patterns, each suited to different types of work. Understanding which pattern fits which task is the difference between AI that genuinely helps and AI that creates more problems than it solves.
Supervisor-Based Architecture
In this pattern, a single central agent plans, delegates, and decides when work is complete. Think of it as your best branch manager, digitised. This works brilliantly for tightly scoped sequential tasks.
Estate agency applications:
- Compliance document verification workflows
- Anti-money laundering checks across a transaction
- Sequential property listing approval processes
- End-to-end tenant referencing pipelines
The weakness is that everything flows through one point. This creates latency and fills up context windows—the AI equivalent of a branch manager who insists on reviewing every email personally. For high-volume tasks, this bottleneck becomes problematic.
Blackboard-Style Architecture
Multiple specialist agents contribute partial solutions to a shared workspace. Others critique, refine, and build on those contributions. Improvement comes through accumulation rather than command.
Estate agency applications:
- Property valuation synthesis (market data + comparables + local knowledge)
- Marketing copy development where different specialists handle different elements
- Complex negotiation strategy development
- Multi-factor vendor reports combining various data sources
This pattern shines for creative or exploratory work. When you need a property description that incorporates market positioning, emotional appeal, and SEO optimisation, having specialist agents contribute to a shared draft produces better results than any single agent working alone.
Peer-to-Peer Architecture
Agents exchange information directly without central control. This works well for dynamic tasks requiring real-time coordination across multiple touchpoints.
Estate agency applications:
- Cross-branch property matching where buyer enquiries flow between locations
- Real-time availability coordination for viewings
- Multi-step discovery processes (finding suitable properties across your portfolio)
- Distributed lead qualification across channels
The risk is drift. Without aggregation or validation, peer-to-peer systems can fragment or loop. An agent matching buyers to properties might keep circling back to the same unsuitable listings without external oversight flagging the pattern.
Swarm Architecture
Multiple agents work in parallel with deliberate redundancy. Overlap validates signals while divergence avoids blind spots.
Estate agency applications:
- Market analysis where multiple agents examine different data sources simultaneously
- Lead scoring where several models evaluate the same prospect
- Property pricing recommendations from multiple valuation approaches
- Risk assessment across compliance, financial, and operational dimensions
The key risk is generating volume faster than decisions. Swarms can produce runaway token costs if not paired with a consolidation phase. Three agents each generating 2,000-word analysis reports creates more noise than signal without a synthesiser.
Most production systems benefit from hybrid patterns—fast specialists operating in parallel while a slower, more deliberate agent periodically aggregates results and decides whether to continue or stop.
Where Does Your Agency Stand?
Take our free 5-minute Data Maturity Assessment to see how you score across all 6 dimensions.
TAKE THE ASSESSMENTHiring Your AI Team: Matching Models to Roles
A central metaphor throughout the O'Reilly analysis treats agent design like team building. Different model types have different "architectural personalities," and putting the wrong model in the wrong role guarantees poor results.
The Generators and Planners (Decoder-Only Models)
Models like GPT and Claude excel at drafting, coding, and producing candidate solutions. These are your creative producers—the agents who write property descriptions, draft email responses, and generate marketing copy.
In estate agency terms, these are your negotiators and listers. Give them a brief and they'll produce output. They're fast, articulate, and capable of handling nuance. But they're not naturally analytical.
The Analysts and Investigators (Encoder-Only Models)
Models like BERT and ModernBERT are strong at semantic search, filtering, and relevance scoring. Use these to narrow the search space before expensive generators engage.
In estate agency terms, these are your researchers and qualifiers. Before your generator writes a property description, an encoder model identifies which comparable properties are genuinely relevant. Before your email generator drafts a response, an encoder model classifies the enquiry type and urgency.
This filtering step is crucial for cost control. Running an expensive generator model on every incoming enquiry is like having your senior negotiator personally handle every phone call, including wrong numbers.
The Specialist Departments (Mixture of Experts)
These architectures include internal specialist departments where a router activates only relevant experts per token. High capability with selective compute spend.
For estate agencies, think of this as a model that automatically knows when to consult legal expertise versus marketing expertise versus local market knowledge—all within a single system.
The Deliberate Thinkers (Reasoning Models)
Slower but more deliberate, these models pause and check their own reasoning. They prevent expensive downstream mistakes.
Deploy reasoning models for high-stakes decisions: should we reduce the asking price? Is this chain likely to complete? What's the real risk in this transaction? These aren't questions you want answered quickly—you want them answered correctly.
The Critical Insight
Here's what Königstein emphasises that changes everything: if you're writing a 2,000-word prompt to make a fast generator act like a thinker, you've made a bad hire.
You don't need a better prompt. You need a different model.
This single insight could save your agency thousands of pounds in wasted AI costs. Stop trying to force your quick-response model to perform deep analysis through increasingly elaborate prompts. Instead, route analytical tasks to models designed for analysis, and creative tasks to models designed for creation.
Scaling Laws: What Actually Improves AI Performance
The O'Reilly analysis draws an important distinction between two types of scaling that affect AI performance differently.
Neural Scaling (Bigger Models)
Making individual models larger improves their raw capability. This is the approach OpenAI and Anthropic take with each successive model generation. But there are diminishing returns, and cost increases faster than performance.
Test-Time Scaling (More Thinking)
Giving models more time to reason—letting them work through problems step by step—can produce better results than simply using bigger models. This is why Chain-of-Thought prompting and reasoning models have become so effective.
For estate agencies, the practical implication is this: you often don't need the most expensive model. You need adequate models with better reasoning frameworks.
A mid-tier model that takes time to analyse market comparables properly will outperform a premium model that rushes to conclusions. The architecture of thinking matters more than the raw power of the thinker.
Organisational Scaling (More Agents)
This is where multi-agent architecture shines. Rather than making one agent smarter, you create specialist agents that collaborate effectively. A team of focused specialists consistently outperforms a single generalist, provided the coordination overhead doesn't exceed the benefit.
For a multi-branch estate agency, this means building specialist agents for:
- Lead qualification (high volume, fast response required)
- Property matching (needs access to full portfolio, buyer preferences)
- Compliance checking (requires precision, not creativity)
- Marketing content (needs brand voice consistency)
- Market analysis (requires depth over speed)
Each specialist operates within its domain of expertise. A supervisor agent coordinates handoffs and aggregates results when needed.
Practical Implementation: The Hybrid Pattern
Based on the O'Reilly framework, here's what effective multi-agent architecture looks like for a typical multi-branch estate agency:
Layer 1: Fast Specialist Swarm
Multiple lightweight agents handle high-volume, time-sensitive tasks in parallel:
- Enquiry classification and routing
- Availability checking across branches
- Initial lead scoring
- Property alert matching
These agents use smaller, faster models. They make quick decisions on routine matters and escalate anything complex.
Layer 2: Blackboard Collaboration
For tasks requiring multiple perspectives, specialists contribute to shared workspaces:
- Property valuations combine market data, comparable analysis, and local knowledge
- Marketing content incorporates SEO, emotional appeal, and brand guidelines
- Vendor reports synthesise transaction progress, market conditions, and recommendations
Layer 3: Supervisor Aggregation
A more deliberate reasoning model periodically reviews outputs from Layers 1 and 2:
- Catches errors before they reach clients
- Identifies patterns requiring human attention
- Makes decisions about when to continue versus escalate
- Ensures consistency across branches
Layer 4: Human Oversight
Your team focuses on what humans do best:
- Relationship management
- Complex negotiations
- Strategic decisions
- Exception handling
The AI system handles volume; your people handle value.
Avoiding the Prompting Fallacy in Practice
Let's apply this to a common estate agency challenge: inconsistent property description quality across branches.
The prompting fallacy approach: Keep refining the prompt. Add more examples. Make instructions more detailed. When that doesn't work, add more context. Eventually you have a 3,000-word prompt that still produces inconsistent results.
The architectural approach:
- Deploy a filtering agent that extracts and validates property features from listings data before any writing begins
- Use a specialist agent for SEO keyword research specific to the property type and location
- Route to a generator agent with a concise prompt focused on voice and style (not content, which is now structured)
- Add a reviewer agent that checks output against brand guidelines and common errors
- Include a consistency agent that compares new descriptions to recent successful examples from across the network
Each agent has a focused job. No single agent needs a massive prompt because no single agent is trying to do everything. The system produces consistent results because consistency is designed into the architecture, not hoped for through prompt engineering.
Ready to Transform Your Agency's Data?
Our Data Strategy Intensive delivers a complete assessment and roadmap in 6-8 weeks—with guaranteed results or your money back.
BOOK A DISCOVERY CALLNext Steps: Building Your Multi-Agent Foundation
The shift from prompt engineering to organisational systems isn't about abandoning what works—it's about understanding why some things work and others don't, then building accordingly.
For estate agency leaders considering multi-agent architectures, here's where to start:
- Audit your current AI usage. Where are you fighting with prompts? Those are probably architectural problems.
- Identify natural specialist domains. What distinct types of work does your agency perform that could benefit from specialist agents?
- Map the handoffs. How does work currently flow between tasks? This reveals where coordination agents add value.
- Start with one hybrid implementation. Pick a process that's currently problematic and redesign it using the layered approach described above.
- Measure against consistency, not just quality. The real test of multi-agent architecture is whether it performs equally well across all branches, all the time.
The estate agencies that thrive in 2026 and beyond won't be those with the cleverest prompts. They'll be those with the most effective organisational systems for AI—architectures that scale across branches, maintain consistency under pressure, and free human expertise for genuinely human work.
That's not a prompting problem. That's a design opportunity.