Intelligent Routing: How AI Can Learn from Payment Systems

In the super-fast-evolving world of AI, major players are racing to build the strongest models. As a result, models keep getting bigger and more complex. But as many specialists point out, we’re approaching a point of diminishing returns, where larger and more complex models deliver smaller performance gains. Many experts now argue that LLMs (Large Language Models) alone won’t lead us to AGI (Artificial General Intelligence), and that new approaches are needed.

At the same time, some companies are investing in Small Language Models (SLMs), sparking the debate: are SLMs worth the investment if LLMs can do so much more?

LLMs (like ChatGPT, Claude, Gemini, Grok, or LLaMA) are powerful, general-purpose models trained on vast datasets. They excel in complex reasoning, broad context understanding, and creative problem-solving. However, this power comes with trade-offs: high computational costs, slower response times, and heavy infrastructure requirements.

SLMs, on the other hand, are nimble, specialized, and resource-efficient. Models like Mistral 7B or Phi-3 are designed for low-latency performance (faster response times) on edge devices—even smartphones. They are fine-tuned for specific tasks and offer more control (less bias and hallucination), simpler operations and maintenance, and greater transparency. Additionally, due to their smaller size, they are significantly cheaper to train and deploy.

Choosing between LLMs and SLMs depends on the context. But just like in modern software, where we use modules, microservices, and API ecosystems, the real opportunity may lie in combining both approaches. What if the key is not choosing one model, but orchestrating many?

Imagine an AI architecture where the LLM is no longer the sole engine behind every task. Instead, it acts as a context-aware router or orchestrator:

It understands the intent and context of the query
It selects the most appropriate SLM(s) or external tools (e.g. a math library or knowledge base)
It routes the task to the right modules
It aggregates, interprets, or refines the results (and potentially reiterates to same or different SLMs)
It composes a meaningful, final response for the user

This is not science fiction, it is a scalable, efficient paradigm. Just as CPUs hand off work to GPUs or route I/O to dedicated chips, the future of AI could be built on modular intelligence.

This layered architecture opens the door to a thriving AI ecosystem:

Routing layer: A generalist LLM interprets the request and dynamically selects the best execution path
Execution layer: A mix of SLMs and non-AI components (e.g. logic engines, search APIs) handle specific sub-tasks
Feedback loop: The routing layer improves its strategy based on intermediate results
Composable outputs: A final response is assembled from multiple sources

Each module can be independently developed by niche players, think of it as a super-app for AI.

These players might specialize in:

Specific tasks, e.g. image generation, audio generation, OCR, speech-to-text conversion, document format conversion, video generation, math problem solving, statistical analysis…
Specific domains, e.g. financial services, particular programming languages, vendor-specific tools with embedded knowledge/manuals (e.g. SAP, Salesforce…)

This approach allows you to keep your preferred LLM while gaining access to best-of-breed specialist models. While some routing already happens internally within LLMs, exposing this functionality to an open ecosystem would be far more powerful.

LLMs could offer default integrations with selected partners, while also enabling users to plug in third-party routing services, potentially as a paid feature. We could even imagine plugging in internal models in a secure, privacy-aware way, allowing companies to combine an internally controlled model (trained on sensitive or classified data) with the rapid evolution of general-purpose LLMs and specialized SLMs.

The analogy to the emerging trend of payment orchestration platforms is useful. Where merchants used to connect to a single PSP, many now use orchestration platforms that route payments to different PSPs depending on method, cost, or availability. This routing also increases reliability, enabling fallback if one PSP is down or a method is temporarily disabled. The same logic applies to AI orchestration: the router could optimize based on model quality, domain expertise, cost per call, or availability.

Given the resource-intensive nature of AI models, SLMs might adopt dynamic pricing based on load (i.e. higher prices during peak usage). This would allow the LLM to reroute intelligently, improving load balancing and reducing the need for overprovisioning.

This architecture enables organizations to combine cost-efficiency with high performance, retain control over sensitive data, and reduce hallucination and bias—flaws often seen in monolithic LLMs.

The next wave of AI innovation might be powered by collaboration, not competition. It’s similar to the evolution of Fintechs: after initially trying to disrupt traditional banks, many are now embedding their services within the customer layers of those same institutions.

In my view, this collaboration will give rise to a landscape of specialized AI models, much like a modular API ecosystem or super-app, with orchestration at its core. The result will be stronger than the sum of its parts, in contrast to the all-in-one approach pursued by some tech giants like OpenAI.

There may even be a unique opportunity for European players. Instead of trying to catch up in the race for massive models (a race hindered by high energy costs and infrastructure gaps), Europe could lead by creating orchestrated ecosystems of best-of-breed, niche models that are focused, efficient, and highly performant.

The real question isn’t whether to use a large or small model, it’s how to combine them intelligently for the best result.

Comments

Transforming the insurance sector to an Open API Ecosystem

1. Introduction "Open" has recently become a new buzzword in the financial services industry, i.e. open data, open APIs, Open Banking, Open Insurance …, but what does this new buzzword really mean? "Open" refers to the capability of companies to expose their services to the outside world, so that external partners or even competitors can use these services to bring added value to their customers. This trend is made possible by the technological evolution of open APIs (Application Programming Interfaces), which are the digital ports making this communication possible. Together companies, interconnected through open APIs, form a true API ecosystem , offering best-of-breed customer experience, by combining the digital services offered by multiple companies. In the technology sector this evolution has been ongoing for multiple years (think about the travelling sector, allowing you to book any hotel online). An excelle...

RPA - The miracle solution for incumbent banks to bridge the automation gap with neo-banks?

Hypes and marketing buzz words are strongly present in the IT landscape. Often these are existing concepts, which have evolved technologically and are then renamed to a new term, as if it were a brand new technology or concept. If you want to understand and assess these new trends, it is important to reduce the concepts to their essence and compare them with existing technologies , e.g. Integration (middleware) software ensures that 2 separate applications or components can be integrated in an easy way. Of course, there is a huge evolution in the protocols, volumes of exchanged data, scalability, performance…, but in essence the problem remains the same. Nonetheless, there have been multiple terms for integration software such as ETL, ESB, EAI, SOA, Service Mesh… Data storage software ensures that data is stored in such a way that data is not lost and that there is some kind guaranteed consistency, maximum availability and scalability, easy retrieval...

IoT - Revolution or Evolution in the Financial Services Industry

1. The IoT hype We have all heard about the "Internet of Things" (IoT) as this revolutionary new technology, which will radically change our lives. But is it really such a revolution and will it really have an impact on the Financial Services Industry? To refresh our memory, the Internet of Things (IoT) refers to any object , which is able to collect data and communicate and share this information (like condition, geolocation…) over the internet . This communication will often occur between 2 objects (i.e. not involving any human), which is often referred to as Machine-to-Machine (M2M) communication. Well known examples are home thermostats, home security systems, fitness and health monitors, wearables… This all seems futuristic, but smartphones, tablets and smartwatches can also be considered as IoT devices. More importantly, beside these futuristic visions of IoT, the smartphone will most likely continue to be the cent...

AI in Financial Services - A buzzword that is here to stay!

In a few of my most recent blogs I tried to demystify some of the buzzwords (like blockchain, Low- and No-Code platforms, RPA…), which are commonly used in the financial services industry. These buzzwords often entail interesting innovations, but contrary to their promise, they are not silver bullets solving any problem. Another such buzzword is AI (or also referred to as Machine Learning, Deep Learning, Enforced Learning… - the difference between those terms put aside). Again this term is also seriously hyped, creating unrealistic expectations, but contrary to many other buzzwords, this is something I truly believe will have a much larger impact on the financial services industry than many other buzzwords. This opinion is backed by a study of McKinsey and PWC indicating that 72% of company leaders consider that AI will be the most competitive advantage of the future and that this technology will be the most disruptive force in the decades to come. Deep Lea...

A bank account - A concept of the past

Almost every recent article written about banking starts with the statement that the banking industry is being disrupted by new competitors, new innovations and new technologies. Although this statement is definitely true, the extend of the disruption can still be debated. Even the most innovative neo-banks still work with bank (current, saving, term and investment) accounts, cards (credit and debit), traditional credits, existing payment infrastructure… The user experience surrounding the origination and servicing of these products has dramatically improved (and will continue to evolve), but the underlying banking products are not really disrupted. You could argue that banking products are so intertwined with society and our way of thinking about finance, that they can’t be disrupted, but looking at those products you cannot ignore that they are far from an optimal solution in our current digital world. Let’s consider cards for example. Isn’t ...

An overview of 1-year blogging

Last week I published my 60th post on my blog called Bankloch (a reference to "Banking" and my family name). The past year, I have published a blog on a weekly basis, providing my humble personal vision on the topics of Fintech, IT software delivery and mobility. This blogging has mainly been a personal enrichment , as it forced me to dive deep into a number of different topics, not only in researching for content, but also in trying to identify trends, innovations and patterns into these topics. Furthermore it allowed me to have several very interesting conversations and discussions with passionate colleagues in the financial industry and to get more insights into the wonderful world of blogging and more general of digital marketing, exploring subjects and tools like: Search Engine Optimization (SEO) LinkedIn post optimization Google Search Console Google AdWorks Google Blogger Thinker360 Finextra … Clearly it is not easy to get the necessary ...

Peer-to-peer payments - A crucial component towards a cashless society

The Corona crisis has led to an exponential decrease in the usage of cash , due to the associated hygienic problems and the enormous rise of eCommerce. While in commercial transactions cash is disappearing rapidly, it is however still commonly used for informal money exchanges , like between friends, family, colleagues…, but also those payments are becoming more and more digital, thanks to peer-to-peer payment (P2P) solutions . These solutions drastically improve the user experience (removing friction) for both the person initiating the payment (= the payer) and the person receiving the payment (= the recipient), compared to a simple initiation of a wire transfer in a banking app. Before clarifying where those solutions bring most value, it is important to first identify the typical use cases , where peer-to-peer payments are most common, as the P2P payment solutions need to optimally accommodate these use cases: Family giving a cash gif...

From app to super-app to personal assistant

In July of this year, KBC bank (the 2nd largest bank in Belgium) surprised many people, including many of us working in the banking industry, with their announcement that they bought the rights to broadcast the highlights of soccer matches in Belgium via their mobile app (a service called "Goal alert"). The days following this announcement the news was filled with experts, some of them categorizing it as a brilliant move, others claiming that KBC should better focus on its core mission. Independent of whether it is a good or bad strategic decision (the future will tell), it is clearly part of a much larger strategy of KBC to convert their banking app into a super-app (all-in-one app) . Today you can already buy mobility tickets and cinema tickets and use other third-party services (like Monizze, eBox, PayPal…) within the KBC app. Furthermore, end of last year, KBC announced opening up their app also to non-customers allowing them to also use these thi...

Marketplaces in the financial industry - Here to stay?

Marketplaces are hip and trendy on the internet and will likely evolve even more in the near future. In some markets (like food delivery, transportation, commerce, holiday…) they already represent double digit market shares (e.g. in 2018 $1.86 trillion was spent globally on the top 100 online marketplaces), but for the financial services sector, their impact (even though there are a few unicorn FinTechs in this space) on the industry is still limited. Any form of intermediation (travel agents, taxi dispatchers…) will likely be replaced by a modern, digital and more direct equivalent, i.e. a digital marketplace. As the business of banks is exactly the intermediation between people having excess money and people needing money, the financial services sector will be significantly impacted. Furthermore, marketplaces are strongly intertwined with other concepts like the gig-economy, the sharing-economy and the API-economy . All these trends will ultimately...

Neobanks should find their niche to improve their profitability

The last 5 years dozens of so-called neo- or challenger banks (according to Exton Consulting 256 neobanks are in circulation today) have disrupted the banking landscape, by offering a fully digitized (cfr. "tech companies with a banking license"), very customer-centric, simple and fluent (e.g. possibility to become client and open an account in a few clicks) and low-cost product and service offering. While several of them are already valued at billions of euros (like Revolut, Monzo, Chime, N26, NuBank…), very few of them are expected to be profitable in the coming years and even less are already profitable today (Accenture research shows that the average UK neobank loses $11 per user yearly). These challenger banks are typically confronted with increasing costs, while the margins generated per customer remain low (e.g. due to the offering of free products and services or above market-level saving account interest rates). While it’s obvious that disrupting the financial ma...

Bankloch

Search This Blog