AI Works—Until the Data Becomes Real

I have been building a real-time signal detection pipeline — a system designed to ingest high-volume event data, classify records against defined criteria, and surface meaningful patterns from raw noise. I built it with an AI collaborator, not as a tool for autocomplete, but as a partner in architecture, implementation, and debugging.

This is not a benchmark or a critique. It is a boundary: where AI collaboration works, and where it breaks. That boundary is not intelligence. It is the moment the system encounters real data.

The System

The pipeline ingests roughly four million event records per trading day from a flat-file feed, enriched by a secondary dataset delivered via API. A critical field is missing from the raw events and must be inferred by aligning each record with a time-matched snapshot of the derived dataset.

The output feeds a pattern detector and a cross-event clustering layer, governed by thresholds for value, duration, and cluster size.

Early on, we replaced a data vendor due to structural issues — delayed feeds, inflated values, and missing capabilities that made the core detection logic impossible to implement. No amount of engineering could compensate.

The constraint was in the data.

Where AI Adds Leverage

AI collaboration was most effective where the system was already defined.

When the classifier began hanging, the solution was architectural: separate ingestion from classification. Data is first loaded, enriched, and stored. Classification runs as a separate phase. This transformed a fragile process into a recoverable system.

The same pattern applied to performance. With millions of daily writes, indexing became the bottleneck. Dropping indexes before ingestion and rebuilding them afterward reduced processing time from 41 minutes to 5 minutes per day of data.

Database issues followed a similar pattern. Concurrency constraints and corrupted write states were resolved once the failure modes were clearly defined.

In each case, the collaboration worked because the problem space was explicit and bounded.

AI is highly effective when the system is already known.

The Constraint: When the Data Changes the System

Failures emerged when decisions interacted with the data rather than the implementation.

To validate output against a reference system, I removed the minimum value threshold to expose all candidate records. In isolation, this was reasonable. In the system, it was not.

The clustering layer operates with quadratic complexity. At normal thresholds, it produces a few thousand clusters per day. Without the threshold, it produced hundreds of thousands — turning the system into tens of billions of comparisons. The pipeline stalled under its own scale.

The threshold was not just filtering output. It was enforcing computational feasibility.

A similar pattern appeared during ingestion. The system repeatedly slowed under load, and attempts to optimize within the existing structure led to the same bottleneck. The constraint was not in the queries or batch sizes. It was in the presence of indexes during heavy writes.

The resolution required restructuring the operation: drop indexes, load data in bulk, rebuild indexes, then apply classification. What had been a bottleneck became a linear process.

The same boundary appeared in how classification logic was executed.

The initial design loaded millions of records into application memory and applied classification through iterative loops. At this scale, processing took tens of minutes per day. Attempts to optimize focused on the same structure — improving loop efficiency, batching, and memory handling — but the performance ceiling remained.

The constraint was not in the code. It was in where the computation was happening.

Moving the logic into the database — operating directly on indexed data rather than in-memory iteration — reduced processing time from minutes to seconds. What changed was not the implementation detail, but the execution model.

In each case, the system improved only when the structure changed, not when the existing path was refined.

AI optimizes for immediacy of implementation, not for alignment with system-level constraints.

Hidden Constraints in Data Alignment

A different class of failure emerged in data alignment.

Historical runs used a current snapshot of the derived dataset to infer missing fields for past records. Any drift between the historical state and the current snapshot produced systematic mislabeling — not noise, but directional error that eliminated entire classes of signal.

The symptom appeared in the output. The cause was in the data relationship.

This constraint is not visible in code. It only emerges when the system is run under real conditions.

Not all constraints are computational. Some are embedded in the data itself.

The Boundary: What the Model Cannot Do

Another failure came from recommendation.

At one point, I upgraded to a higher-tier data service based on a confident suggestion. The reasoning was coherent. In practice, it had no impact on the system.

The issue was not correctness in general. It was misalignment with the system’s objective function. The pipeline did not need more data. It needed specific fields, correct latency, and structural consistency.

The model can recommend improvements. It cannot determine whether those improvements matter.

Over time, key classification parameters were established through direct validation against output. These decisions defined what the system considers signal versus artifact. They were not implementation choices. They were definitions of truth.

AI is a strong collaborator on the how. It is not a collaborator on the what.

Implications for Engineering Practice

The implication is not to use AI less, but to use it within the correct boundary.

Structure, implementation, and debugging benefit from collaboration. Problems that require interpreting data, defining signal, or validating output do not. Those require direct interaction with the system.

There is also a velocity effect. Collaboration accelerates decisions. When direction is correct, this is leverage. When it is not, it compounds mistakes.

The cost is not incorrect answers. It is insufficient friction.

The System Carries the Truth

In AI-assisted systems, the pipeline carries the business logic. The model does not.

What data is captured, how it is aligned, what thresholds are enforced, and how signal is defined determine what the model can meaningfully process. The model operates on what is visible. It cannot account for constraints that are implicit or unmodeled.

AI works. It works well in defined systems.

It begins to fail at the point where the system must be defined.

And in real systems, that is where the hardest problems — and the highest leverage decisions — actually live.

Part of the “Pipeline Is the System” series at aimlworld.com.