How can verifiable and explainable AI change coding?

Vibe coding works for tasks you could have solved with a Stack Overflow search; on complex problems, the bottleneck shifts from writing to review — you can't be sure the code handles every edge case. Verifiable AI changes this by formalising requirements and mathematically proving the generated code meets them, including correctness across all inputs. The competitive advantage will shift from writing elegant code to clearly specifying what you want — though mainstream adoption is still a few years away.

What are the risks of spreading AI, and why does TSMC matter?

The biggest constraint on AI development isn't algorithms — it's chip manufacturing. NVIDIA, Google, and AMD all produce their accelerators at TSMC in Taiwan, and the most advanced process nodes are concentrated there. This is a concentration risk by any definition, made worse by geopolitical tensions in the region. TSMC is expanding to Arizona, but for now the world's AI infrastructure depends heavily on one company in one sensitive location.

Why is AI increasingly seen as an integration platform?

Enterprises want vertical, end-to-end task solving that is aware of existing systems, uses and updates them, and works without errors. AI is increasingly a layer that brings together a decade of model development, accumulated domain knowledge, and existing tools to create a digital workforce. A single complex task — say, analyzing a market entry — needs several agents (data retrieval, financial modeling, compliance, reporting) selected, coordinated, and aggregated by an intelligent platform.

Will small local models catch up with frontier models?

The frontier keeps improving, but gains increasingly come from engineering rather than fundamental breakthroughs, so smaller and open-source players are closing the gap. A 7B-70B active parameter range still seems necessary for complex reasoning, but within it, smarter inference engines (continuous batching, Mixture-of-Experts offloading) and hybrid Mamba-Transformer architectures like NVIDIA's Nemotron-3 Nano make serving significantly faster and cheaper.

Why do 95% of enterprise AI pilots fail, and will that change?

A widely cited MIT study found 95% of enterprise AI pilots deliver zero measurable ROI, but that is starting to change. Teams have become more humble and patient: they integrate accumulated knowledge and existing solutions, understand the problem before throwing AI at it, and prioritize projects with realistic success criteria over moonshots. Combined with models that are genuinely better at tool use and multi-step collaboration, the failure rate should drop meaningfully.

How should knowledge workers prepare for agentic AI?

Treat AI as the successor you are training: write skills, build tools, and automate the routine parts of your job. That lets you solve critical (urgent and important) tasks faster and delegate busywork to agents or AI-enabled junior colleagues, freeing you for the non-urgent but important work — finding augmentation opportunities, designing new services, and solving problems where no training data exists. Imagine what you could create with infinite employees, and start building toward that.

2026: AI Won't Take Your Job (It'll Take Your Busywork)

AI can be scary for white-collar office workers — like myself — who wanted to stay professionally active for a few more years. It writes computer programs in languages I’ve never heard of. It makes presentations, phrases letters, writes blog posts, drafts diagrams, analyzes data. But it’s not just scary — it’s exciting and interesting too. So let’s set aside our existential crisis for a moment and appreciate what’s now possible. This post draws an optimistic scenario where we don’t just keep our jobs because TSMC can’t produce enough chips — but where we find ourselves doing more fulfilling work.

It will be uncomfortable to read back in 2027, once it becomes clear how many things I missed and how many predictions I got wrong. Still, I hope you’ll enjoy reading this mid-to-short-term AI roadmap prediction.

Here’s what I’ll cover: why AI is becoming an integration platform rather than a replacement tool, why smaller models are improving faster than frontier ones, how verifiable AI could make “vibe coding” enterprise-ready, and why the 95% enterprise AI failure rate is about to drop significantly. Along the way, some thoughts on security, knowledge graphs, the TSMC bottleneck, and whether you should learn to bake and brew coffee as a plan B.

AI as Integration Platform: 2026 Is the Year of Collaboration

It’s impressive that AI can write code and solve mathematical theorems. But enterprise adoption is still remarkably low. In most organizations, AI isn’t freeing up resources yet — it’s adding extra complexity on top of existing workflows. Everyone feels it should be useful, but the return on investment hasn’t materialized for most.

Enterprises have well-tested, reliable workflows that took years to build. They want vertical, end-to-end task solving — solutions that are aware of existing systems, that use and update them, and that work without errors. Perhaps with some human oversight at the beginning. What they’re building, essentially, is a digital extension of their workforce. (I know — it doesn’t sound optimistic yet. Bear with me. 🐻)

This realization is shifting how companies think about their AI roadmaps. AI is increasingly seen as an integration platform: a layer where you can bring together your past decade of model development, your accumulated domain knowledge, and your existing tools — to create a digital workforce that actually delivers value. And because that workforce is genuinely valuable, it justifies real investment.

Now, consider what a single complex task looks like in practice. Analyzing a market entry opportunity might need a data retrieval agent, a financial modeling agent, a regulatory compliance agent, and a report generator. You need to select these agents from your pool, provide them with relevant data, coordinate their work, and aggregate the results. For that, you need an intelligent platform that handles this orchestration well.

On the supplier side, you can already find third-party agents on Google Cloud Platform and AWS Marketplace. However, these don’t yet solve your problem end-to-end — they can’t automatically select the right agents based on your budget and predicted token usage for a given task. But they will. And if you’re an SME without an internal platform and years of accumulated data, you’ll rely on these external providers. Within these platforms, there will be competition among agents: only well-tested ones will be able to command premium prices, while new providers will need to offer lower rates until their track record grows. The platform will split the payment and take a commission. Some blockchain-based B2B agent platforms already exist, but none has emerged as the standard yet — and most are still far from production quality.

Meanwhile, the human-in-the-loop pattern is maturing (which sounds like the opposite of these fully autonomous agent platforms, but here we’re back in the enterprise environment). AI is not ready for full autonomy in high-stakes settings, and it shouldn’t be. But instead of constant hand-holding, AI systems are learning which tasks you’re comfortable delegating and which ones need a checkpoint. The interaction is becoming adaptive: the AI requests human input when it’s uncertain, then learns from the correction to need less intervention next time. Full autonomy for complex enterprise tasks is probably 3-5 years away. But the path from “supervise everything” to “review the important bits” is already well underway.

For building usable AI platforms, several building blocks need to mature: efficient on-premise models (you can’t share all your data with third-party vendors), verifiable AI (you need to know the problem is actually solved), secure AI (always), and efficient search across an ocean of data and tools. Plus humans — to drive the innovation that can’t be taught yet. And silicon — because everything runs on chips. The next sections cover these pieces.

(A technical note: these collaborative agent platforms likely won’t follow a hierarchical “CEO agent delegates to mid-level managers” pattern. More democratic algorithms — where agents bid on tasks based on their capabilities and pricing — tend to be more effective. Think auction-based orchestration rather than top-down command.)

Local Intelligence and Efficient Small Models

The frontier models keep improving, but the gains increasingly come from engineering rather than fundamental breakthroughs. If no radically new architectures or training methods emerge — let’s see if AMI Labs changes that — the progress at the top will slow, and smaller (and OS) players will gradually close the gap. Meanwhile, the big labs (OpenAI, Anthropic, Google) are increasingly motivated to use smaller models themselves. Cost efficiency matters, and chip access is becoming the real bottleneck.

The next paragraph gets technical. Feel free to skip — a dedicated (and more accessible) post on inference engine internals is coming.

There’s a floor to how little computation you need for solving complex tasks. A 7B to 70B active parameter range seems necessary for complex reasoning. But within that range, you can save significant resources, driven by two forces:

First, inference engines are getting much smarter. Tools like vLLM — the serving infrastructure between your application and the model — are solving the practical hardware utilization challenges that arise in real deployments. A GPU’s processing capacity is only as useful as its memory bandwidth allows. With concurrent users sending requests of wildly different lengths, keeping both optimally busy requires smart scheduling. Continuous batching slots new requests the moment a slot frees, rather than waiting for a whole batch to finish. Memory is equally constrained: modern large models use a Mixture-of-Experts architecture, where only a subset of specialized “expert” sub-networks activates per token. Inference engines exploit this by keeping frequently used experts resident in GPU memory while offloading the rest — effectively running a model far larger than the GPU’s VRAM would otherwise allow. These optimizations are still maturing, with better CPU-GPU hybrid execution and smarter memory hierarchies on the roadmap.
Second, model architectures are evolving beyond pure Transformers. Hybrid State Space Models like NVIDIA’s Nemotron-3 Nano — a Mamba-Transformer hybrid — offer massive context windows with significantly faster and cheaper inference than traditional Transformers. Instead of processing every token against every other token (the quadratic cost that makes long contexts expensive), these hybrids selectively use attention only where it matters.

Local models will also handle multi-modal inputs more fluidly — processing text, images, and structured data together rather than as separate pipelines — which is necessary for wider adoption.

Verifiable and Explainable AI

Vibe coding — asking an AI agent to write code by just describing the task — has made software development more accessible. Anyone can prototype a small application with the help of an agent. But in an enterprise or professional software development environment, vibe coding works for the tasks you could have solved with a Stack Overflow search a few years back (rest in peace). For complex problems — the ones where a good decision makes the solution faster or cheaper by orders of magnitude — engineers still matter.

AI-assisted coding does make development significantly faster for routine, well-structured tasks. The general speedup varies — studies report 30–55% for scoped tasks, though for highly structured work like writing ETL pipelines or containerizing deployments (where coding standards and templates are well-documented), the gains can reach 2–3x. The bottleneck isn’t writing speed; it’s review (and bugfix). You can’t be sure the generated code handles all edge cases. And that review cost eats into the time savings.

This is where verifiable AI enters the picture. By formalizing requirements and mathematically comparing them against the generated code, we can prove that code meets its specification — including properties like termination (it won’t run forever) and correctness for all inputs. Take this further, and you can reverse the logic: given a formal specification, find the optimal implementation automatically. This could make vibe coding genuinely enterprise-ready. The technology is progressing, but mainstream adoption is probably a few years out.

From the user’s perspective, verifiability matters just as much for analytics and dashboards. When an automated tool calculates a result or adds a new KPI to a dashboard, you need to understand how it got there. The queries and logic should be explainable alongside the results.

This leads to a broader shift in what “code” even means. As AI gets better at finding optimal implementations, the competitive advantage shifts from writing elegant code to clearly specifying what you want. Code will increasingly look like structured text: high-level descriptions with drill-down layers where you can inspect the pseudo-code algorithm behind a three-sentence explanation. You won’t need to read the implementation details. You’ll discuss them with your AI assistant.

Secure AI

As AI agents gain more autonomy and access to sensitive systems, security becomes non-negotiable. The vulnerabilities are real: the same prompt injection techniques that make for entertaining LinkedIn posts about “lousy AI providers” are the exact mechanics behind real security breaches - stealing data, executing unauthorized code, or hijacking entire applications. I covered the attack surface and defense strategies in detail in a previous post.

The core principle hasn’t changed: defense requires layers. It starts with architecture - restricting the LLM to calling controlled functions rather than writing arbitrary code, limiting API access to the user’s own permissions, and sandboxing execution so that even when something goes wrong, the blast radius stays contained.

On top of architecture, guardrail solutions are becoming a standard production component. These work as proxies that intercept every message - user to LLM, LLM to user, even agent to agent - and run them through a policy engine combining regex filters, intent classifiers, specialized transformers, and sometimes a smaller LLM acting as a judge. Industry tools include NVIDIA NeMo Guardrails, Azure AI Content Safety, Google Vertex AI Safety Filters, and Amazon Bedrock Guardrails. The built-in safeguards of models like Claude, Gemini, and GPT add another layer, though they shouldn’t be the only one.

Red teaming is moving from best practice to requirement. Automated red teaming tools like Azure PyRIT, Giskard, and DeepEval bombard your system with thousands of adversarial prompts before the first real user ever touches it. All of this adds latency and cost. But the alternative - an unsecured agent with access to your production systems - is not an option worth considering.

Large RAG and Graph AI

Now consider the data landscape. In every industry, you can access a vast ocean of sources: internal documents, external databases, public research, code repositories, methodologies, articles, books, and logs from earlier tool usage. You also have tools — private and public MCPs (a standard protocol that helps agents use external tools), specialized agents, documented workflows, best practices packaged as reusable skills.

Much of this data is contradictory. Some is outdated. Some comes from unreliable sources. Some tools work well; others don’t. And your complex task probably needs several of them together, pulling fragments from across this entire ocean.

This means your agents need memory that improves over time. They need to learn how to navigate a massive knowledge base, how to select the right sub-agents for a given task, how to approach problems systematically. They should record their experiments and learn from the outcomes. And critically, they need to connect the dots — generating synthesized knowledge rather than just retrieving raw sources. You can’t copy 100 books into a single context window (it would be impossibly slow), but you can build a knowledge layer that captures the key concepts, relationships, and patterns across those books.

On top of complexity, there’s speed. Your retrieval system needs to be fast, not just thorough. At Lynx Analytics, we use Graph AI to address these challenges — representing knowledge as connected graphs rather than flat document collections, which lets agents traverse relationships and find non-obvious connections between pieces of information. We even have some tricks to make it fast at inference time (another promised future blog post — stay tuned).

The Human Touch - Reducing Failures

A widely cited MIT study found that 95% of enterprise AI pilots deliver zero measurable return on investment. That number sounds devastating, but it’s starting to change.

People working on AI transformation have gotten more humble and patient. They’ve learned what AI can and can’t do, and they’ve stopped expecting magic — instead, they’re integrating accumulated knowledge and existing solutions. They put more effort into understanding the problem before throwing AI at it. They write better prompts, provide better examples, feed in more relevant data, enable better tools and workflows, and learn more from feedback. Companies are prioritizing projects with realistic success criteria over moonshots that look good in a board presentation. Meanwhile, AI models have genuinely improved at tool use and multi-step collaboration.

The result will be a meaningful drop in that failure rate. And AI-powered customer service - currently the most visible source of user frustration - will finally stop being annoying (my riskiest prediction). The technology is ready. It’s the engineering discipline that’s catching up.

Bottleneck: We Only Have One TSMC

The biggest constraint on AI development right now isn’t algorithms — it’s chip manufacturing. Consider where the high-end AI chips come from:

NVIDIA manufactures its GPUs at TSMC in Taiwan,
Google produces its TPUs at TSMC in Taiwan,
AMD produces its GPUs and CPUs at TSMC in Taiwan.

There are alternatives — Intel Foundry Services, Samsung, and SMIC (Huawei’s manufacturing partner) — but the most advanced process nodes are all at TSMC.

This is a concentration risk by any definition. And since AI accelerators need replacement roughly every 3–4 years to stay competitive, the bottleneck isn’t going away soon. TSMC is expanding capacity (including new fabs in Arizona), and some production is diversifying to the US. But for now, the world’s AI infrastructure depends heavily on a single company in a geopolitically sensitive region.

Some Peripheral Trends

Before we get to the “will I lose my job” section, here are a few more predictions and directions for the next year:

Robotics and AR will produce impressive demos. Not just humanoid robots walking around, but task-specific machines doing useful things in warehouses, hospitals, and farms.
IoT intelligence — smart devices communicating autonomously. Your smart scale advising your fridge on what to stock, so you can never eat a chocolate pudding at 11 PM again. (Whether this is a feature or a bug is left to the reader.)
Specialized AI chips will appear in more consumer devices — though I’m not sure whether the fuzzy PID controller in my rice cooker will finally get an upgrade.
AI regulation is moving. The EU AI Act is now in phased implementation, following the same trajectory as GDPR a decade ago. Other countries will follow with similar frameworks. This sounds like bureaucratic overhead, but clearer rules will actually accelerate enterprise adoption by reducing legal uncertainty.
Quantum computing gets an interesting angle from verifiable AI: translating business problems into quantum-compatible formulations is one of the hard parts, and AI can help with that bridge. New theoretical work on quantum transformers and quantum attention mechanisms is emerging. But practical implementation remains years away — past 2030 for most use cases.
Large consulting firms face an interesting challenge: the language barrier between management and technical teams can now be bridged with a $20/month AI subscription. An oversimplification, sure, but the threat is real. Routine advisory work will move in-house.

Riding the Waves

A former boss of mine used to say:

Always train your successor — that’s the only way you get new responsibilities.

The same principle applies, but your successor is now AI. So start writing skills, building tools, automating the parts of your job that are routine. Hopefully, this helps you solve the critical tasks faster (urgent and important) and delegate the busywork (urgent but not important) to agents — or to less senior colleagues enabled with AI. So finally, you can start focusing on the non-urgent but important tasks: finding augmentation opportunities, designing new services, solving problems where no training data exists — which is where your future work lives. Imagine what you could create if you had infinite employees — and start building toward that.

Where AI shifts your focus - the new Eisenhower Matrix

What are your predictions for 2026? I’d love to hear where you agree or disagree — comment on the LinkedIn post or drop a comment below. And do not forget to SUBSCRIBE (top menu), if you like this post :).