How to build a money mule detection platform on Elasticsearch

Get hands-on with Elasticsearch: Dive into our sample notebooks in the Elasticsearch Labs repo, start a free cloud trial, or try Elastic on your local machine now.

Five ES|QL queries flag money mule accounts across an entire portfolio, trace laundering networks downstream from a single reported case, and correlate suspicious payments against behavioral signals in a separate cluster without moving any data. This post covers the architecture, data model and ES|QL queries behind the fraud investigation platform we built on Elasticsearch.. It also covers how tiered storage and logsdb index mode bring seven-year regulatory retention costs down by up to 65%. Part 2 of three; Part 1 covered the investigation gap in financial services, Part 3 will cover the AI agents built on top.

How the fraud investigation platform came together

We demonstrated this platform at a recent UK Customer Tech Talk. The key point worth emphasizing: Elasticsearch, Agent Builder, ES|QL, and Kibana are components many financial institutions already run, so the investigation capability described here can be assembled from existing infrastructure rather than purchased as a separate stack.

Fraud investigation platform architecture on Elasticsearch

The platform follows a layered architecture with clear separation between data, intelligence, and presentation.

Elasticsearch fraud investigation platform architecture with Agent Builder, ES|QL queries, cross-cluster search and Kibana across data, intelligence and presentation layers — *Fig 1. High-level Elastic Architecture*

Data layer: two clusters, one search

Fraud investigation data rarely lives in one place. Transaction records, customer profiles, and account metadata typically sit in a banking data cluster managed by payments or core banking teams. Meanwhile, application telemetry (mobile app session logs, online banking events, device fingerprints, and call center interaction records) often lives in an observability cluster managed by engineering or operations teams.

Cross-cluster search (CCS) allows each team to keep ownership of its data. At the same time, investigators can query across both clusters in a single request without migrating into a single cluster and without a separate aggregation layer. The banking cluster holds transactional data; the observability cluster holds behavioral signals. Investigators see one unified result set without either team giving up control of their data.

Cross-cluster search is not just a technical convenience. It addresses the intra-organisational barriers we described in Part 1 - different teams owning different data, concerns about query impact on production systems, and role-based access controls that exist for good reason. CCS respects all of these while still allowing cross-domain investigation.

Intelligence layer: Agent Builder, skills, and workflows

The intelligence layer is where AI agents, ES|QL queries, machine learning jobs, and workflow automation come together. In Elastic Agent Builder, agents route a request to the right skill, and each skill binds the ES|QL tools and workflows it needs; for example, Investigation, Typology & Narrative, Graph & Network, COP/Vulnerability, and SAR Creation. Part 3 covers the agent and skill architecture in detail. In this blog, we focus on the data and query foundations they depend on.

Presentation layer: Kibana and custom applications

The platform includes a custom investigation interface that communicates with the agent service and Elasticsearch APIs. But the same data is also available in Kibana for ad-hoc, analyst-driven exploration. This dual-interface approach is intentional: structured investigation workflows for speed, and Kibana for questions the workflow was not designed to answer.

Kibana dashboards provide operational views (alert queues, case volume trends, and mule account heatmaps), while Discover and ES|QL in the console give analysts the freedom to run exploratory queries without waiting for someone to build a new dashboard.

A note on availability: the data model and ES|QL queries shown here run on any Elasticsearch deployment, but two capabilities in this architecture have specific requirements:

Cross-cluster search with ES|QL is generally available in Elasticsearch 8.19 and 9.1; requires a compatible subscription on both clusters (see cross-cluster search and subscriptions).
Elastic Agent Builder is available on Elastic Cloud Serverless and on the Enterprise tier for cloud-hosted and self-managed deployments (see Agent Builder general availability).

Data model for money mule detection: transactions, accounts and behavioral signals

The data model is the foundation of everything else. Fraud investigation is a cross-domain problem, which means the data model must accommodate different document types while still supporting fast, flexible querying.

Elasticsearch fraud investigation data model showing cross-cluster search connecting banking transaction indices to behavioral telemetry in the observability cluster — *Fig 2. App Fraud Data Model - FCA Sandbox*

Banking transactions

The core index holds payment transaction records. Each document represents a single payment event with fields for the parties involved, amounts, timing, and metadata.

A few things stand out about this structure.

The purpose_code and merchant_category fields carry ISO 20022 enrichment where available, providing signals that a rules-based system can use alongside the AI agent.
Negative amounts denote debits, with is_credit and is_debit flags for fast filtering.
geo_location enables spatial analysis of transaction patterns.

Where ISO 20022 enriched data is available, purpose codes, Legal Entity Identifiers, and structured remittance information provide additional signals. The schema-agnostic nature of Elasticsearch means we can ingest both ISO 8583 legacy formats and ISO 20022 enriched formats into the same index without a rigid upfront schema.

Accounts and people

Customer data is generally hosted in a CDP (Customer Data Platform), but to simplify the architecture, we put it in the same cluster. Account-level data is in a banking-accounts index, while customer identity and profile data live in banking-people. This separation is deliberate: investigators who need to query transaction patterns can do so without accessing personally identifiable information, and RBAC controls can restrict PII access to authorised roles.

The person_id field links to the banking-people index, which holds customer identity, address, credit history, employment, and vulnerability flags (kept in a separate index with its own access controls). A simplified example:

Application telemetry (observability cluster)

The observability cluster holds session-level telemetry from the bank’s own mobile and online banking applications. This is not browsing history or external website data. It is telemetry from the institution’s own services, which payment service providers legitimately have access to.

Session telemetry reveals behavioral patterns that transaction data alone cannot: how long a customer spends in the app before making a payment, which screens they visit (particularly payment limit screens), whether the device fingerprint matches their normal device, and whether the login location is consistent with their history.

Call center logs

Customer service interactions are indexed with structured metadata, making it possible to correlate service calls with subsequent transactions.

How do you detect money mule accounts with ES|QL?

ES|QL is Elastic’s piped query language, designed for exploratory analysis across large datasets. In our platform, ES|QL queries serve two roles: they are the tools that agent skills invoke programmatically, and they are available directly to analysts in Kibana for ad-hoc exploration.

Here are the key queries that power the platform.

Mule detection: finding accounts where money flows straight through

This is the most important query in the platform. It looks at every account, calculates how much money came in versus how much went out, measures how quickly that happened, and flags the accounts where money is flowing straight through.

Sample output:

A turnover_ratio close to 1.0 means almost everything that came in went straight back out; the hallmark of a mule account. The days_active filter focuses on recently opened accounts, and the incoming threshold filters out noise. The result is a ranked list of the most suspicious accounts in the portfolio.

Profiling a suspect account: who is sending money in?

Once an account is flagged, the next step is understanding the inbound pattern. This query identifies every account that sent money to a given destination and calculates the total exposure per source.

Sample output:

This surfaces the full list of source accounts - the potential victims. When an account shows 23 distinct sources over six weeks, all sending large sums to the same destination, the pattern is clear.

Tracing the network downstream

Mule networks are layered. To trace where the money goes after the primary mule, we flip the query to look at outbound flows:

Sample output:

The agent chains these queries together: profiling the primary mule’s downstream accounts, then running the same mule detection query against each of them to assess whether they are also part of the laundering network. This recursive pattern is what turns a single reported case into a full network-level investigation.

Customer behavioral baseline

To assess whether a specific transaction is unusual for a given customer, we calculate their historical baseline and compare:

Sample output:

An amount_ratio of 12 means the flagged transaction is 12 times the customer’s typical payment. The agent uses this, alongside timing and channel data, as input to its fraud likelihood assessment.

Cross-cluster behavioral context

Cross-cluster search allows a single ES|QL query to pull data from the observability cluster. This query retrieves the customer’s app telemetry in the hours leading up to the flagged transaction:

Sample output:

When this reveals that a customer who normally spends five minutes banking at lunchtime instead spent two hours in the app late at night, visited the payment limits screen seven times, and then made an unusually large transfer, the picture becomes much clearer. This behavioral context is often the difference between a confident fraud assessment and an ambiguous one.

Scaling fraud data retention with Elasticsearch tiered storage

A fraud investigation platform is only useful if it can handle real-world data volumes and regulatory retention requirements.

Data volumes

Elasticsearch’s distributed architecture handles this comfortably. The same infrastructure that processes billions of security events daily for enterprise customers is more than capable of handling payment transaction volumes (customers such as BBVA).

Tiered storage for regulatory retention

UK financial regulations typically require seven years of transaction data retention. That is a lot of data. Elastic’s tiered storage architecture makes this economically viable:

Hot tier (NVMe SSDs): The last 7 to 30 days. Sub-second query performance for active investigations and real-time alerting.
Warm tier (SSDs): 30 days to one year. Fast enough for historical case review and pattern analysis.
Cold tier and frozen tier (object storage): One to seven years. Compliance and audit queries, training data for ML models.

Logsdb for cost reduction

Logsdb index mode reduces storage costs by up to 65% for transaction data by optimizing how fields are stored and indexed. For a platform retaining years of data at scale, this is the difference between an affordable deployment and a prohibitively expensive one.

Key lessons building a fraud investigation platform on Elasticsearch

Building the Elasticsearch fraud investigation platform reinforced a few lessons worth sharing for anyone building something similar.

Lesson	Why it matters	What to do
Start with the data model	A well-defined schema gives AI agents the context to interpret query results accurately. Every downstream query, skill and dashboard depends on it.	Define explicit mappings upfront, e.g. field names, types and relationships. Normalise field names across indices, define clear document boundaries, and plan for cross-cluster query patterns.
Behavioral data is the differentiator	Transaction data alone is ambiguous. A £15,000 payment to an unfamiliar account could be a house deposit, a car purchase or fraud. Behavioral context (e.g. two hours of app usage at midnight, seven visits to the payment limits screen, a call center request three days prior) is what turns ambiguity into confidence.	Ingest app session telemetry alongside transaction data and query it via cross-cluster search at investigation time.
Cross-cluster search makes the architecture realistic	CCS allows investigators to query across banking and observability clusters while respecting existing data ownership boundaries. Without it, you need data migration or a separate aggregation layer.	Treat CCS as a structural requirement, not an optimisation. It is what makes this architecture deployable in a real institution.

Next: AI agents and workflow automation

With the data model and query foundations in place, part 3 will cover the AI agents built on top of this architecture: how they are configured, which tools they use, how workflow automation handles SAR report generation and account investigation, and a worked scenario that demonstrates the complete investigation flow from alert to compliance action. The result is a unified, queryable investigation layer built from infrastructure that many institutions already run.

Ready to explore Elastic for fraud investigation? Talk to our financial services team about a proof of concept with your data, an architecture review of your current fraud stack, or a strategic discussion on how Elastic fits your fraud roadmap.

이 콘텐츠가 얼마나 도움이 되었습니까?

도움이 되지 않음

어느 정도 도움이 됩니다

매우 도움이 됨

문제 신고하기

Follow the money: tracing laundering networks with ES|QL and cross-cluster search

How the fraud investigation platform came together

Fraud investigation platform architecture on Elasticsearch

Data layer: two clusters, one search

Intelligence layer: Agent Builder, skills, and workflows

Presentation layer: Kibana and custom applications

Data model for money mule detection: transactions, accounts and behavioral signals

Banking transactions

Accounts and people

Application telemetry (observability cluster)

Call center logs

How do you detect money mule accounts with ES|QL?

Mule detection: finding accounts where money flows straight through

Profiling a suspect account: who is sending money in?

Tracing the network downstream

Customer behavioral baseline

Cross-cluster behavioral context

Scaling fraud data retention with Elasticsearch tiered storage

Data volumes

Tiered storage for regulatory retention

Logsdb for cost reduction

Key lessons building a fraud investigation platform on Elasticsearch

이 콘텐츠가 얼마나 도움이 되었습니까?

관련 콘텐츠

How to instrument your search API with OpenTelemetry and query it with ES|QL

Why Elasticsearch is becoming a columnar database

How to build search analytics on Elastic using OpenTelemetry, no extra pipeline required

One command. Natural language. Your Elasticsearch data, straight to the terminal.

Building a multilingual voice agent with Elastic Agent Builder & Sarvam AI

최첨단 검색 환경을 구축할 준비가 되셨나요?