EngineeringApr 26, 2026 · 6 min read

Natural language SQL fails when the AI cannot see your schema

English-to-SQL is the easy part.

That sounds wrong until you try it in a real database.

Ask an AI model to write SQL against a simple example table, and it looks impressive. Ask it to answer a question against a production schema with 300 tables, inconsistent naming, legacy fields, and business logic hidden in column conventions, and the magic fades quickly.

The problem is not that the model cannot write SQL.

The problem is that it does not know what your schema means.

Real databases are not tutorial databases

Tutorials use tables like customers, orders, and products. Production databases use tables like acct_evt_log, subscription_v2, tenant_user_map, and billing_shadow_copy_old.

The model can guess. Sometimes it guesses well. Sometimes it confidently joins the wrong table and gives you a beautiful answer to a question nobody asked.

That is why natural language SQL needs schema context before it needs clever prompting.

The model needs a map

When someone asks, "Which trials are most likely to convert?", the AI needs to know more than SQL syntax.

It needs to understand:

Where trial accounts live
How plans are represented
Which timestamp means signup, activation, or conversion
Whether deleted or test accounts should be excluded
Which usage events actually indicate product value

Without that context, the model is not querying your business. It is querying column names.

MCP makes schema context part of the workflow

Model Context Protocol (MCP) gives AI clients a structured way to interact with external systems. For databases, the key advantage is that schema and tools can be exposed as context instead of improvised inside a prompt.

That matters because the AI client can reason from actual available tables, fields, and tool definitions.

Instead of asking the model to hallucinate the database shape, the MCP layer can provide the shape.

This is the difference between "write SQL for a database you have never seen" and "answer this question using the schema and tools available here."

Schema discovery should be automatic

Manual schema documentation is useful. It is also usually outdated.

Teams change tables, add columns, rename concepts, and ship features faster than internal documentation can keep up. If AI database access depends on someone maintaining a separate explanation of the schema, it will rot.

A better pattern is automatic schema discovery: connect the database, inspect the structure, and generate the context the AI client needs.

That does not remove the need for human judgment. It removes the busywork that prevents the workflow from being used every day.

Where Conexor fits

Conexor.io connects databases such as PostgreSQL, MySQL, and SQL Server to MCP-compatible AI clients.

The workflow is built around the missing context layer:

Connect your database.
Discover the schema automatically.
Expose MCP tools your AI client can use.
Ask questions in natural language and get answers from live data.

That is what makes the difference between a demo and a workflow. The model is not just generating SQL in isolation. It is working against a discovered structure.

The best prompt cannot fix missing context

You can write a better prompt. You can add examples. You can tell the model to be careful.

All of that helps at the margin. None of it replaces actual knowledge of the schema.

If your AI client cannot see how your data is structured, it will eventually guess. In analytics, a confident guess is worse than no answer.

Start with schema, not dashboards

Dashboards are useful when you know the question in advance. Natural language querying is useful when the question changes.

But changing questions require stable context. The AI needs a reliable map of the database before it can help people explore it.

That is why schema discovery is not a setup detail. It is the foundation of AI-native data access.

Try Conexor free → Connect your database to AI workflows with MCP-first schema discovery.