AI & ML

How to Incorporate AI into Your Business Today

By
Joel Passen
April 26, 2023
5 min read

What is AI for business?

AI is not a fad. It’s the news. It dominates the talk of every business conference. And it is the number 1 topic of countless leadership meetings. CEOs are challenging their teams to leverage AI in every facet of their businesses to increase their productivity. gain deeper insights into user behavior, automate mundane tasks, and drive deeper insights with which to make critical decisions. It’s here. The big question is: How can you use it in your business right now?

GPT has catapulted AI into the spotlight, but does it translate into measurable productivity and revenue gains? A recent study by the  National Bureau of Economic Research suggests that even in these early innings of AI for business, it’s moving the needle for early adopters.The study’s authors, Erik Brynjolfsson, Danielle Li Lindsey, and R. Raymond, concluded that AI “disseminates the potentially tacit knowledge of more able workers and helps newer workers move down the experience curve.” Our takeaway from the study is that AI is already helping humans do things faster with often better results.

AI provides businesses with numerous automation opportunities, which can help save time and money while increasing accuracy at the same time. Automation technologies such as natural language processing allow machines to interpret human speech or text input without any manual intervention required from humans. This eliminates tedious data entry tasks from employees’ workflows so they can focus their energy on higher-value activities. According to a recent Zapier study, “76% of respondents said they spend 1 - 3 hours a day simply moving data from one place to another. Additionally, 73% of workers spend 1 - 3 hours trying to find information or a particular document.” Additionally, automated processes reduce errors due to human factors like fatigue or distraction, which could otherwise lead to costly mistakes. The same Zapier study confirms this, arguing that “83% of workers said they spend 1 - 3 hours a day fixing errors.” That’s a full workday moving data around, looking for information, and fixing human errors. Imagine what you or your team could do with their day if that were a thing of the past.

The fact is that your company will never generate less data than it does now. The issue is that traditionally, 90% of data generated and collected by businesses is dark—untapped, and often completely unknown. Thanks to applied AI, companies can now use advanced algorithms to easily analyze large unstructured data sets, allowing them to understand consumer and customer behaviors better. This application spans the organization from marketing to engineering, product to customer experience, business intelligence to RevOps, etc. Imagine if these teams had their very own AI teammate that organized every interaction with your prospects and customers and turned it into knowledge and insights to help make everyone’s job easier. Now that’s possible with AI. 

Harness the power of AI today. Simple, smart, and safe.

On a more grounded note, AI will have growing pains along the way. It has problems of its own, especially for businesses that need to leverage it to stay competitive. 

It’s clear that adopting AI is no longer a question of “if” but “when.” It’s an opportunity facing every leader in every industry. And as we confirmed in our research for this report, there are huge incentives to move quickly. But before you leap into AI-powered solutions for your business, you must evaluate them properly—not only from a technical standpoint but also from a functional perspective.

Let’s be honest, millions of dollars will be wasted trying to “roll your own” AI. Before attempting a bespoke AI project, it is important to understand the challenges.

Cross-modal data integration

Cross-modal data integration refers to the process of combining and analyzing data from different modalities or sources, such as email, voice, chat, CRM data, ticketing data, and more. This is one of the biggest blockers for teams trying to build their own AI solutions. Cross-modal data integration aims to extract meaningful insights and knowledge from diverse data sources that may provide complementary or redundant information.

Cross-modal data integration involves several steps, including data preprocessing, feature extraction, alignment, and fusion, or what we call data joining. During data preprocessing, the data from each modality is cleaned, standardized, and prepared for the AI to analyze. Feature extraction involves extracting relevant features or characteristics from each modality, such as text features. Alignment involves mapping the features from each modality onto a common feature space, which enables them to be compared and combined. Finally, fusion involves combining the features from each modality to generate a unified representation of the data. 

​​Cross-modal data integration is integral for AI for Business. For example, when you want to know the most commonly requested feature for a specific cohort of your customers, you need to combine data from multiple inboxes, systems, your CRM, and other modalities to provide a comprehensive and accurate answer.

Data cleansing and normalization

Data cleansing and normalization are critical steps in preparing data for AI applications. In simple terms—garbage in, garbage out. They are also insanely laborious. AI algorithms rely on accurate and reliable data to make predictions or generate insights. Data cleansing and normalization are critical to ensure the data used to train AI models is accurate, consistent, and complete. Think of it this way, you can improve the performance of AI algorithms by reducing noise like duplicate data and other inconsistencies. This can help AI models to make more accurate and reliable predictions. And cleaning and re-structuring data helps to make AI models more interpretable by reducing the complexity and variability of the data. This can help identify the most important features or variables driving the predictions or insights generated by AI models.

One of the most common reasons AI projects fail is due to data cleaning and normalization or, rather, the lack thereof. Business data can come in different formats, such as email, tickets, call transcripts, and more, each with unique characteristics and challenges. Normalizing and cleansing data across these different modalities is a complex task. Now consider that the amount of business data available for analysis is growing rapidly, making it difficult to manage and process data efficiently. This can be particularly challenging when cleansing and normalizing data, which can be time-consuming and computationally intensive.

To compound matters even more, business data is always stored in different systems or databases, which are infrequently compatible with each other. Just think how many different people are emailing your customers. Daunting.

Data cleansing and normalization are important but challenging steps in preparing data for AI applications. It requires domain expertise, technical skills, and a deep understanding of the data and the business problem.

User interface

Let’s say you can get clean, structured data into an AI or large language model. Now what? Now you need a user interface (UI), so business people can inform decisions and workflows with AI-generated insights. This isn’t trivial. You’ll need another team of developers because the people cleaning and preparing the data aren’t the same people who will build an end-user UI. Now you need to build some more software.

A well-designed UI helps users understand an AI system’s capabilities and benefits and increase their willingness to adopt and use it. The UI communicates the outputs and insights generated by the AI system in a way that is easy to understand and interpret. The UI can provide a way for users to input data, provide feedback, or customize the AI system to their needs. This can help improve the AI system’s accuracy and relevance and increase user value.

Data permissions

What next? Sigh. Don’t forget about data permissions. You are going to need them.

Data permissions refer to the rights and permissions required to access and use the outputs from your AI project. They are a critical component of data governance and must ensure that data is accessed per your organization’s policies. Think of this as who can see and use what.

Depending on the nature of the business and the data being used in the AI project, there will likely be requirements around data access and use. Ensuring that the AI project has appropriate data permissions can help to ensure compliance with relevant laws and regulations, such as data protection laws like GDPR.

Simply put, you must build more software to manage who can see what. Data permissions are often reviewed and updated to ensure ongoing compliance with and changing business needs, so your permissioning software has to be commercial grade.

Automations and exhaust

Insights from operational AI systems involve integrating AI models and insights into existing business processes and systems. You must get AI-generated insights to the humans and systems needing the knowledge. This may involve building APIs that allow the model to be called from other applications or systems or integrating the model into your teams’ existing tools like your CRM, email, etc.

Once the AI model is integrated, you have to automate the generation of insights. This may involve setting up automated alerts or reports triggered based on specific events or data conditions or integrating the insights into a dashboard that provides real-time insights (data UI).

By automating insights from AI systems, businesses can gain real-time, actionable insights that can inform decision-making and drive business outcomes. Automating insights can also help improve businesses’ process efficiency and effectiveness by reducing the need for manual analysis and decision-making. Vernon Howard, Co-Founder & CEO at Hallo, exclaims that with AI,

I can automate 20 to 30% of my work now.”

The takeaway is that if you can generate insights, you must autonomously get them to the right place.

Integration requirements

One of the main failure points for AI-related projects is unsustainable methods of extracting and converting data. In the past, this was done manually by interns, business analysts, and data engineers. Technological advancements like APIs make modern data capture processes instantaneously and consistently. This frees data and BI professionals from arduous entry work, focusing their efforts on more rewarding, core business responsibilities.

When selecting any technology solution, it’s important to ensure that it has integration APIs (Application Programming Interfaces) to enable seamless integration with your other systems and applications. Integration APIs allow different software applications to communicate with each other, exchange data, and perform tasks without the need for manual intervention.

Ideally, look for solutions that build direct integrations to platforms instead of using third-party integration platforms. Third-party platforms offer a fast way to connect systems but often lack configurability and data classification controls. Also adding a third-party data integration solution can also introduce another data processor to consider.

Data privacy

The subject matter experts we interviewed for this report agree that privacy concerns are the number one reason AI initiatives fail to launch. Xiaoze Jin, the Lead AI/ML Solution Architect at Rackspace Technology, states,

We’re in a very early inning of cloud AI as a SaaS offering. Privacy, above all else, is most important. Beyond privacy lies responsible AI/ethics, federated learning, and zero-trust framework security.”

Due to privacy concerns and stringent compliance regulations, functional leaders are often stymied by infosec and privacy teams reluctant to allow access to collect or process user data, preventing them from taking full advantage of AI-driven insights. Yacov Salomon, Founder & Chief Innovation Officer at Ketch, states,

Be aware of privacy, the ethical use of data, and the governance of data.”

He continues,

...the world is evolving, and if ML and AI are involved, you need to scrutinize. Governing around data makes a big difference.”

Data privacy laws are designed to protect individuals’ privacy and personally identifiable information (PII). PII refers to any data or information that can be used to identify a specific individual, directly or indirectly. PII can include any information that can be used to identify an individual, such as their name, email address, social security number, phone number, home address, date of birth, driver’s license number, passport number, biometric data, or any other unique identifier.

Any commercial AI solution should have a detailed and thorough approach to privacy. Ask for a copy. Otherwise, look for solutions that automatically de-identify data. The de-identification process can involve removing or masking certain data elements, such as names or addresses. Of course, there are different levels of de-identification, ranging from removing obvious identifiers to more advanced techniques that involve complex algorithms and statistical analysis. The effectiveness of de-identification methods depends on the type of data being processed and the acceptable re-identification risk threshold. Patricia Thaine, Co-Founder & CEO at Private AI, argues,

The easiest thing to do is remove PII as early as possible in the pipeline... At ingest or as soon as you possibly can in your system in order to minimize risk.”

Pro Tip: Sending customer data that includes PII to large language models (LLMs) like ChatGPT is a bad idea and will likely compromise user and confidentiality agreements with your customers.

Security

AI solutions may require access to sensitive data. Ensure that any solution provider maintains a comprehensive Information Security Management program to manage Sturdy’s systems and products’ security, availability, confidentiality, integrity, and privacy risks. The vendor’s program must be independently audited and certified to meet the requirements of Trust Services Criteria SOC2 Type II. You’ll also want to ensure that all data communications into and out of a platform are encrypted-in-transit. That data is stored encrypted-at-rest using industry-standard encryption mechanisms.

Any solution vendor should have a current SOC 2 Type II for your team to review. This is the most comprehensive SOC protocol and attests not only to the suitability of a vendor’s processes and systems but their operational effectiveness of sticking to those controls over a period of time.

Considering these technical requirements when evaluating AI solutions, you can ensure that the solution is compatible with your existing infrastructure and meets your performance needs.

AI is the future wave, and those who do not embrace this ever-evolving technology in their business are already falling behind. Jin argues,

We’ve already begun to see the paradigm shift, creating a new way of living and working with AI as a co-pilot... And yet, we’re still living in the wild west of AI, with land-grabbing occurring left and right.”

Those who do understand the immense power that AI can begin to automate processes, secure insights and increase customer engagement across multiple channels reap immeasurable rewards. As a CEO and business owner, Howard exclaims,

This is a big deal... this is huge.”

Unsurprisingly, AI has become one of the most aggressive sectors in business today. Still, fortunately, there are now plenty of resources and expert advice on how to safely and successfully deploy AI into your company. It’s time to get ahead of the competition and take advantage of this revolutionary force before it zooms us into an entirely new world of business opportunities!

Similar articles

View all
AI & ML

The Context Engine

Joel Passen
May 19, 2026
5 min read
Executive Summary

The Context Engine

The model is not the problem. In every enterprise AI deployment that has hit a production wall in 2026, the failure lives one layer down: in how data is prepared, permissioned, and delivered before the model ever begins reasoning. Model choice has become the wrong question. With Anthropic's Claude surpassing OpenAI in U.S. enterprise adoption (34.4% vs. 32.3%, Ramp AI Index, April 2026), the market has already moved on. The competition has shifted from the Reasoning Engine to the Context Engine.

While nearly every enterprise has deployed frontier models, most are paying a Hallucination Tax they cannot see on their P&L. For an organization with 1,000 knowledge workers, the 4.3 hours per employee per week spent manually verifying AI outputs (Forrester, 2025) equates to approximately $16.8 million in annual salary drain, calculated at a conservative $75 per fully-loaded hour. Multiply that across a global enterprise, and it maps to the $67.4 billion in documented AI hallucination losses recorded in 2024 alone (AllAboutAI, 2025). This is not a failure of the model. It is a failure of architecture.

This paper argues that the next phase of enterprise AI requires a Deterministic Intelligence Layer: infrastructure that normalizes, indexes, and permissions customer data before it reaches the model. Teams replacing token-heavy RAG workflows with deterministic, pre-indexed context are seeing substantial reductions in cost per task while dramatically improving retrieval precision and AI reliability. More importantly, they are crossing the Threshold of Action: the point where AI becomes trustworthy enough to move from surfacing insights to executing workflows.

Section 1

The New Benchmark: Claude's Enterprise Breakout Moment

The AI market just had its crossover moment. As of April 2026, more U.S. businesses pay for Anthropic's Claude than for any other AI model. 34.4% vs. 32.3% for OpenAI, according to the Ramp AI Index, which tracks actual spending across more than 50,000 companies. This isn't a survey about intent. It's purchasing data.

By March 2026, Anthropic was capturing 73% of first-time business AI buyers (Axios, March 2026). A year earlier, one in 25 businesses on Ramp's platform paid for Anthropic. Today, it's nearly one in three.

Enterprise buyers don't switch defaults on a whim. They switch when something is demonstrably working better for the work they actually need done.

The Model Is Not the Problem

Here is the harder truth underneath that adoption story. Despite the crossover, most enterprise AI deployments are not delivering.

Widespread adoption. Widespread underdelivery. Both things are true simultaneously.

The instinct in most organizations is to treat this as a model problem: switch providers, upgrade to the latest version, hire a prompt engineer. None of it moves the needle in any sustained way, because the model is not where the failure lives. Claude is a reasoning engine. A sophisticated one. But a reasoning engine can only reason over what it's given. And in most enterprise deployments, what's given is a mess. Fragments.

The Performance Ceiling

Every technical leader deploying Claude at scale hits the same wall. The demo works. The pilot looks promising. Then it moves toward production, and something breaks. Not catastrophically, but consistently. The AI misattributes an item to the wrong account. It summarizes a customer's history using stale data. It generates an output that sounds authoritative and requires 20 minutes of human verification before it can be trusted.

"Feed a world-class reasoning engine confident, well-structured garbage, and you get the same in return."

This is not a failure of reasoning capability. It is a failure of context architecture. The data required to generate reliable outputs, account history, communications, support activity, call transcripts, and operational metadata typically exists across fragmented systems with inconsistent normalization, disconnected permissions, and no canonical entity resolution layer tying it together.

Context Is the New Infrastructure

The companies pulling ahead in 2026 are not winning because they chose a better model. They are winning because they solved the harder problem underneath it: delivering clean, resolved, permission-aware context before the model ever begins reasoning.

  • IT, Data, and Platform Engineering provide the Engine (Claude): a recurring operating expense. World-class reasoning, rented.
  • RevOps, Data, and AI Teams provide the Map (the Deterministic Data Layer): a long-term asset. Customer intelligence, owned.

Claude is the current catalyst. The model market will keep moving. New releases, new providers, new pricing. What doesn't move is the underlying problem: fragmented, unresolved, improperly permissioned data. Deterministic context is the durable architecture. The organizations building it now will carry that advantage into every subsequent model generation.

Most organizations already have the engine. What they lack is the map.

Section 2

The Hallucination Tax: Why Fragmented Data Kills AI Performance

If the model isn't the problem, why are so many production-grade AI initiatives hitting a performance ceiling? The answer is the Hallucination Tax.

In 2024, hallucinations cost enterprises an estimated $67.4 billion in global losses (AllAboutAI, 2025). By early 2026, the cost has shifted from outright fabrications to "silent hallucinations": outputs that look structurally perfect but are factually untethered from the current state of the business.

For an organization with 1,000 knowledge workers, the 4.3 hours lost per person per week equates to roughly 223,600 hours of wasted annual productivity, approximately $16.8 million in annual salary drain at a conservative, fully loaded rate. It never appears on the P&L as an AI cost. It shows up as underperformance, missed forecasts, and slower deal cycles.

This forces employees to act as "Human Middleware": the bridge between fragmented systems and the AI that was supposed to make them irrelevant. This tax is the direct result of four specific architectural failure modes.

Failure Mode 1: Retrieval Precision (The Token Tax)

Standard RAG is probabilistic. It retrieves semantically similar fragments, not operational truth. When a sales leader asks, "Why did we lose this seven-figure deal?", the system may surface an old QBR deck instead of the pricing objections in email, the procurement concerns buried in Slack, the legal escalation in Jira, and the product gaps discussed in call transcripts that actually determined the outcome.

Because retrieval is imprecise, teams over-index by stuffing the context window with every possible document to ensure the right one is in there. The result: thousands of reasoning tokens spent filtering noise. A world-class reasoning engine doing the work of a search index.

Failure Mode 2: "Lost in the Middle" (Attention Drift)

Research by Liu et al. (TACL, 2024) demonstrated that accuracy on multi-document reasoning tasks drops by more than 30 percentage points when relevant information is buried in the middle of a long context window. This matters enormously in enterprise environments, where critical signals are scattered across support escalations, pricing discussions, call transcripts, Slack threads, and CRM updates. Simply increasing context size does not solve the problem. In many cases, it amplifies it by forcing the model to attend to more noise.

Failure Mode 3: The Identity Crisis (Entity Disambiguation)

In a fragmented environment, identity is a variable, not a constant. "Jane Doe" in a Zoom transcript needs to resolve to the same Jane Doe in Salesforce, Gmail, Zendesk, Slack, and the CRM activity timeline. Without deterministic entity resolution, the model is forced to infer whether those interactions belong to the same person, account, or buying committee.

Without deterministic entity resolution, the model is forced to reconstruct identity probabilistically. A support escalation tied to one stakeholder, a pricing objection raised in a sales call, and an executive concern discussed over email may be incorrectly assembled into the wrong account narrative entirely.

Failure Mode 4: The Permission Ghost (Unauthorized Surface)

This is the silent killer of enterprise AI programs. Most RAG pipelines lack Source-System Parity. If the AI retrieves a snippet from a private executive email because it was "semantically relevant" to an intern's query, the system has failed regardless of whether anyone noticed.

Incidents like EchoLeak show exactly why retrieval-layer permission enforcement matters. In late 2025, researchers demonstrated a zero-click vulnerability in Microsoft 365 Copilot that could exfiltrate sensitive data from Copilot context without user interaction. No prompt injection required. The retrieval layer was the attack surface.

For most organizations, the permission layer isn't just a technical problem. It is an organizational liability that Legal and Security will eventually force you to solve on a deadline, under pressure, after something has already gone wrong.

The Production Wall

These four failure modes create the Production Wall. A curated demo can appear remarkably accurate. But production environments are not curated. They are noisy, fragmented, and constantly changing, with critical signals distributed across emails, calls, support threads, Slack conversations, and operational systems evolving in real time.

"You cannot solve these four problems by tuning the prompt. You have to solve them by fixing the context."
Section 3

The Deterministic Intelligence Layer

To climb over the Production Wall, enterprise architecture must evolve. The solution is not a larger context window or a more complex prompt. It is a fundamental shift in how data is prepared for the model. Enter the Deterministic Intelligence Layer: infrastructure that sits between your raw data silos and Claude, acting as the architectural antidote to the four failure modes in Section 2.

The Four Pillars

1. Precision Indexing (Ending the Token Tax)

Instead of relying on similarity search alone, the context layer resolves entities, removes duplication, and prioritizes high-signal interactions before retrieval. The model receives structured operational context rather than raw fragments competing for attention.

In Sturdy-observed deployments, replacing raw context with pre-indexed, distilled payloads has reduced token consumption by 80 to 90% on comparable workflows. Results vary by source data density and baseline architecture. You stop paying for Claude to be a search filter.

2. Signal Distillation (Solving "Lost in the Middle")

Semantic Pruning strips HTML headers, Slack noise, legal footers, and the RE: FWD: RE: reply chains that bury every actual decision in 40 lines of quoted text, distilling threads into thematic buckets: Bug Reports, Feature Requests, Sentiment Shifts. The most critical insights land at the beginning of the context window, bypassing the 30-point accuracy drop documented in long-context research.

3. Deterministic Entity Resolution (Fixing the Identity Crisis)

A Global Entity Map resolves disparate naming conventions into a single, immutable Customer ID. Claude is no longer guessing whether two conversations belong to the same account. It is being told they do.

4. Parity-Enforced Permissions (Exorcising the Permission Ghost)

The retrieval layer enforces source-system permissions before context assembly, so unauthorized records are excluded from the payload sent to the model. This is not a prompt-level instruction that can be overridden or confused. It is an architectural enforcement point that sits entirely upstream of the model.

Security becomes a structural property of the architecture, not a probabilistic instruction to the model. Incidents like EchoLeak show why this distinction matters: when permission logic lives inside the prompt, the retrieval layer remains an attack surface. When it lives at the data layer, it doesn't.

Reference Implementation: Sturdy + Claude via MCP

While the merits of this architecture are clear, building it internally results in years of maintenance debt (see Section 5). Sturdy leverages the Model Context Protocol to serve as the Context Engine for Claude, normalizing, indexing, and permission-stamping your customer intelligence layer across Salesforce, Gmail, Slack, and Zendesk before Claude ever queries it.

Claude provides the Reasoning Layer. Sturdy provides the Memory and Context Layer. Together, they move an enterprise from AI that reads your business to AI that acts on it.

Section 4

What It Unlocks: From Reading to Acting

In 2026, summarization is a commodity. The competitive advantage lies in moving from AI that reads your business to AI that acts on it. This transition requires a fundamental shift in how leadership views the AI stack and who owns what.

  • IT, Data, and Platform Engineering provide the Engine (Claude): recurring operating expense. World-class reasoning, rented.
  • RevOps, Data, and AI Teams provide the Map (the Deterministic Data Layer): a long-term asset. Customer intelligence, owned, not rented.

When the engine has a perfect map, the Acceleration Gap closes.

RevOps: The Revenue Architect

For the RevOps leader, a deterministic layer turns fragmented operational data into active revenue signals. Instead of building static dashboards that explain why a quarter was missed, RevOps can monitor the commercial signals that actually move deals: pricing hesitation in email, procurement delays, legal friction, competitive mentions, executive disengagement, stalled next steps, and tone changes across active opportunities.

A deterministic context layer resolves those signals to the right person, account, opportunity, and timeline before AI ever reasons over them. That is what turns scattered communication into reliable revenue action.

RevOps stops being a report generator. It becomes the operating system for revenue execution: designing the logic that turns verified commercial signals into coordinated GTM action.

Sales: Instant Account Intelligence

The average sales rep spends roughly 20% of their week on pre-call research. With a deterministic layer, the account briefing is no longer a probabilistic summary. It is a verified snapshot: "The customer's last three support tickets were resolved, but they haven't yet implemented the API update discussed in the March QBR."

Product: The Automated Feedback Loop

Product managers are often the most data-rich but insight-poor employees in the company. A deterministic layer moves PMs from reading feedback to querying insights. Claude analyzes 60 days of feedback across Slack and Zendesk and, with a single prompt, generates a high-fidelity Jira ticket including exact customer quotes, impacted account IDs, and revenue at risk.

Customer Success: Proactive Triage

In CS, latency is the enemy. A deterministic layer allows Claude to perform live triage. When a customer sends a frustrated email, the AI checks contract terms and recent product usage logs before the CSM has finished reading the subject line. It presents a Context-Aware Response ready to send, grounded in verified account data.

"The model you license today is rent. The customer intelligence layer you build is equity. One gets replaced. The other compounds."

Every account signal normalized, every entity resolved, every permission enforced. That accumulates. The organizations building this layer now are building institutional memory that makes every model they run on top of it better.

Section 5

The Build vs. Buy Reality

The instinct for most sophisticated IT and data teams is to build. It is a legitimate impulse. The stack looks deceptively simple: a few API connectors, a vector database, and some chunking logic. In the demo phase, an internal build often feels like the most cost-effective path.

The Four Hidden Engineering Hurdles

1. The Normalization Treadmill

Building a connector to Salesforce is straightforward. Maintaining the logic layer that resolves entity names across Salesforce, Slack, and Zendesk as those systems' schemas evolve is a full-time engineering job. This is Semantic Drift: hundreds of developer hours consumed by maintenance rather than innovation.

2. The Permission Mapping Paradox

Mapping row-level permissions from source systems into an AI context window is one of the most complex security challenges in modern software. Most internal builds rely on prompt-level security, which fails under the weight of incidents like EchoLeak. This isn't a technical trade-off. It is an organizational liability waiting to be forced into crisis.

3. The Latency Wall

A custom RAG pipeline often takes 5 to 10 seconds to fetch and clean data. In Sturdy-observed deployments, pre-indexed deterministic retrieval consistently operates under 1 second on production data volumes, but reaching that benchmark requires specialized search infrastructure expertise that is rarely the core competency of a generalist data team building from scratch.

4. The Token Optimization Tax

Without signal distillation, internal builds routinely pass 3x to 5x more tokens than necessary. Teams save on build costs only to spend twice as much on model API costs.

Where Does Your Engineering Dollar Go?

The strategic question isn't "Can we build this?" It's "Should we own the maintenance of this?"

Competitive advantage does not live in the plumbing. No customer chooses a vendor because their AI has a better Python script for cleaning Slack data.

By offloading the Normalization Treadmill to Sturdy, organizations are promoting their engineering teams from Data Cleaners to AI Product Owners, moving their best people away from the maintenance treadmill and toward the high-value work of building AI that drives revenue.

Buy the plumbing. Build the logic. The teams doing this are shipping revenue-generating AI workflows, while their competitors are still debugging entity-resolution scripts.

Section 6

What to Do Now: The 2026 Roadmap

The Acceleration Gap is not a permanent state. It is a choice of architecture. The move is not to wait for a smarter model. The move is to fix the context. Here are four moves for leadership to take in the next 90 days.

Move 1: Audit Your Retrieval Precision, Not Your Prompts

Most teams spend the majority of their time prompt-tuning errors caused by bad data retrieval. The action: Run a Ground Truth test. Take ten complex customer queries and manually check the data fragments Claude is being fed. If more than 20% of that data is noisy, stale, or misattributed, no prompt engineering will save the deployment. You have a plumbing problem, not a reasoning problem.

Move 2: Isolate a Multi-Source Workflow

The highest ROI for a deterministic layer is found where data is most fragmented. The action: Pick a high-value, closed-loop use case where data lives in at least three systems. For example: the path from customer feedback in Slack and Zendesk to an engineering action in Jira. Solve the context problem here, and you've built a blueprint for the rest of the organization.

Move 3: Enforce Permissions at the Data Layer

Stop treating security as a probabilistic instruction. The action: Move permission enforcement out of the system prompt and into the retrieval infrastructure. Ensure the retrieval layer enforces source-system permissions before context assembly, so unauthorized records never reach the model. The Permission Ghost is exorcised structurally, not instructionally, and the organizational liability is removed before Legal ever has to get involved.

Move 4: Define Where AI Earns the Right to Act

The distance between AI that summarizes and AI that executes is a trust gap, not a technology gap. The action: Build human-in-the-loop approval gates for high-stakes actions. Drafting a renewal contract. Creating a Jira ticket. Sending a support response. Use your deterministic layer to provide the required Confidence Equity. The threshold to target is a sub-5% error rate on AI-generated drafts. That is the point at which approval gates can be safely reduced, and workflows become self-sustaining.

Traditional probabilistic RAG architectures struggle to reach this threshold consistently at enterprise scale. Because probabilistic retrieval introduces entity errors, stale data, and permission noise, error rates on complex multi-source tasks typically stabilize in the 15 to 30% range regardless of prompt quality, even with hybrid retrieval and reranking layers added on top.

A deterministic layer that resolves entities before inference, distills the signal before retrieval, and enforces permissions before the model ever sees the data is the only architecture that makes sub-5% structurally achievable, rather than an occasional lucky outcome.

In Sturdy-observed deployments, teams that reach this threshold have consistently moved to reduced-oversight approval workflows within a quarter. Results depend on workflow complexity and baseline data quality. Reaching the sub-5% Trust Threshold is the definitive signal that an organization has graduated from "AI Experiments" to a Context Engine architecture capable of autonomous action. That is the architectural line between AI that assists and AI that acts.

Conclusion

The Architectural Advantage

Frontier models will continue to improve and commoditize. The durable advantage is no longer the model itself. It is the architecture surrounding it.

The long-term value does not live in another standalone AI interface. Interfaces change too quickly. The durable layer is the operational context infrastructure beneath them.

Organizations that solve deterministic context assembly, entity resolution, permission-aware retrieval, and operational state assembly gain a compounding advantage independent of whichever model, interface, or orchestration layer dominates next year.

Organizations that solve context architecture today are building infrastructure that compounds across model generations. As interfaces evolve and models improve, the operational context layer beneath them becomes increasingly valuable.

"The era of the Context Engine is here. Is your architecture ready for it?"

AI & ML

Your AI isn’t the problem. Your data is.

Joel Passen
May 6, 2026
5 min read

IT leaders may have resisted AI early, but that phase passed quickly. The real concern wasn’t whether to use it. It was how to control it. Governance, security, visibility. In the end, it came down to preventing sensitive work from being done in personal accounts. Reasonable.

So they got comfortable, signed off, and rolled it out. ChatGPT, Copilot, Claude, company-wide, with guardrails.

People are using it. That part worked.

The disappointment

The problem is what revenue leaders are finding now that it’s live.

The data they actually want to use isn’t accessible in any meaningful way. And that matters more than most people realize, because LLMs are only as useful as what you put in front of them. They’re exceptional at reasoning over structured, coherent information. They’re not designed to reconcile fragmented, inconsistent data spread across a dozen systems.

Nobody’s model is.

So instead, people compensate.

They cut and paste. Drop in exports. Upload a batch of emails and call transcripts, and hope coherence comes out the other side.

It doesn’t. They get fragments. Plausible-sounding ones, but fragments.

The diagnosis

What commercial leaders are running into isn’t a model problem. It’s a data problem.

The data they actually care about isn’t unified. It lives across email, Slack, Zoom, support tickets, calls, and CRM notes. Different systems. Different formats. No shared identity. No relationship context.

Even with connectors. Even with MCPs.

Because underneath it all, the data isn’t organized in a way a model can reason on. There’s no canonical view of the world.

The model doesn’t know that the same person shows up in Zoom, Slack, Zendesk, and Salesforce. It doesn’t understand that those interactions belong to the same thread, the same account, the same moment in a relationship.

So it fills in the gaps.

Not because it’s weak. Because it has to keep trying.

The gap

Meanwhile, the models themselves have gotten amazingly powerful. Reasoning is sharper than it’s ever been and getting better daily.

But the data layer most companies are feeding them? Still immature.

According to MIT’s 2025 State of AI in Business, over 80% of companies have explored or deployed LLMs, but only around 5% are seeing meaningful business impact.

High adoption. Low transformation.

That’s not a model problem.

What’s possible

What it looks like when this actually works is different.

Not dashboards. Not reports. Not exports.

A conversation. Like having the best revenue ops analyst you’ve ever worked with on call, one who has read every email, sat in on every call, and never forgets anything.

You ask: “Which accounts have shown signs of churn risk in the last 90 days?”

And instead of a guess, you get a ranked list. Accounts. ARR. The exact messages where the signal showed up. What changed. What triggered it. What to do next.

So you ask a follow-up: “Which of these are new customers?”

Now you’re looking at onboarding breakdowns. Common threads. Where the process is failing.

So you keep going: “Where are we missing expansion opportunities?”

And it surfaces accounts where someone said, “We’re thinking about rolling this out to another team.” But nothing was logged. No opportunity created. No follow-up.

That’s the shift.

You’re no longer stitching together context. You’re interrogating it.

What changes

What changes when you fix the data layer, when your commercial data is normalized, deduplicated, and accessible, isn’t just speed.

It’s the level of questions you can ask.

These aren’t dashboard queries. They’re judgment calls. The kind that used to require a senior operator spending a weekend in spreadsheets and Salesforce. When your data layer is clean and the model has real context to work with, they become a 90-second conversation.

That’s the difference. Not a better model. A better fuel.

The data infrastructure reality

Most teams won’t get there by accident. The infrastructure problem is real: identity resolution across systems, conversation reconstruction across channels, deduplication, and signal enrichment. It’s six to twelve months of plumbing if you build it yourself.

The companies that crack it first won’t just be more efficient. They’ll be operating with a fundamentally different information advantage. They’ll see churn coming, spot expansion signals, catch friction early, before any of it shows up in the numbers.

At that point, the question changes.

It’s not whether AI works.

It’s whether your data is ready for it.

And whether you’re going to build that layer, or keep working around the absence of it.

This is what we're building at Sturdy.ai. The data layer your LLM actually needs.

Insight Updates

The Moment B2B Sales Teams Forget Everything They Learned During the Deal

Joel Passen
May 6, 2026
5 min read

It’s not the close. It’s not the kickoff call. It’s the 48 hours in between — when the contract gets signed, the champagne (metaphorically) gets popped, and everything the sales team learned over months of conversations, negotiations, and relationship-building quietly disappears.

The delivery team inherits a contract and a few CRM notes. Not the story behind the deal.

This is the handoff problem. And it’s costing companies more than they realize.

Why the Knowledge Dies at the Signature Line

Think about what actually happens during a complex B2B sale.

Over weeks or months, a sales team accumulates an extraordinary amount of institutional knowledge. They learn why the buyer is actually moving now — not the official reason, but the real one. The compliance incident that became a board-level conversation. The internal champion who’s been pushing for change for two years and finally got budget. The exec who’s skeptical and needs to see a specific proof point before they’ll get on board.

They learn who matters and how decisions actually get made, which is almost never what the org chart suggests. They learn what got promised in the final stretch: the SLA clause that got added at the last minute, the integration that’s now contractually locked, the go-live date that the CFO has already presented to her board.

None of that lives in the CRM. It lives in emails, call recordings, Slack threads, and people’s heads.

And the moment the deal closes, the sales team moves on to the next one. That’s their job. That’s how they get paid. But the institutional knowledge they spent months building the context that would let an implementation team start informed, instead of starting over, largely evaporates.

Onto the next pipeline review.

The Cost Nobody Is Measuring

Companies measure churn. They measure NPS. They measure time-to-value.

Most don’t measure the cost of the knowledge gap at handoff — because it doesn’t show up as a line item. It shows up as implementation delays. Escalations. Customers who feel like they have to repeat themselves six months into a relationship that should already be mature.

It shows up as promises made during the sale that nobody on the delivery side knew about. Commitments that surface in month three as a nasty surprise. Expectations that were set in a negotiation conversation that never made it into a system anyone on the CS team can see.

The SaaS industry has spent a decade optimizing the top of the funnel. Sophisticated systems for capturing and qualifying demand. Playbooks for every stage of the sales motion. Entire conferences dedicated to pipeline hygiene.

And then we hand a contract and a prayer to the team responsible for actually delivering the value we sold.

What Good Looks Like

I’ll make this concrete.

We recently ran Sturdy against a real deal, a $190K ACV implementation that had just closed. Board-level compliance incident drove the urgency. CFO was the economic decision-maker: analytical, direct, not interested in being charmed. An integration was contractually locked in Exhibit A. Timeline slippage wasn’t just an ops problem; it would retrigger board scrutiny because of the prior incident.

The implementation team knew all of that before the first kickoff call.

Not because someone wrote a perfect handoff email at 11 pm the night before go-live. Because Sturdy read across the entire deal — emails, calls, negotiations — and surfaced the context that actually matters: why they bought, who really matters internally, what was promised, and where the risk lives.

That’s the brief I show in the video. Notice how specific it is. Notice that it doesn’t just describe what happened, it tells the delivery team what to do with it.

That’s what institutional knowledge looks like when it doesn’t get lost.

The Broader Shift

The handoff problem is really a symptom of something larger.

B2B revenue has always been a team sport — sales, CS, implementation, product, and finance all own a piece of the outcome. But the systems we’ve built treat each function as a silo. Data gets entered into the CRM by whoever remembered to do it. Calls get recorded and filed somewhere nobody looks. Emails pile up in inboxes that get searched only when something’s already on fire.

The signals are there. The context exists. It’s just buried, and it disappears at exactly the moments in the customer lifecycle when it’s most needed.

The companies that figure this out and build systems to capture, preserve, and operationalize institutional knowledge across the revenue lifecycle will have an operational advantage over those still relying on heroic individual effort and the hope that someone wrote a good handoff doc.

This isn’t an incremental improvement. It’s a different way of operating.

The moment a deal closes should be the moment an organization puts everything it learned to work.

Right now, for most companies, it’s the moment they forget it.

That’s the problem Sturdy was built to solve. If this resonates, start at sturdy.ai.

Your customers are already telling you what's going to happen.

Connect what customers say to the reasons your numbers move. Contextual revenue intelligence, ready for any LLM - or running natively in Ask Sturdy from day one.

Unlock Your Accounts
How to Incorporate AI into Your Business Today