AI Forecasting

What vendors mean: AI predicts your future. What it actually means: a category where statistical methods still often beat deep learning — and where the foundation models are starting to change that.

Industry Applications

The Technical Definition

AI forecasting predicts future values of a time-varying quantity — demand, revenue, energy load, inventory, headcount need, customer churn rates. The classical methods (ARIMA, exponential smoothing, Prophet from Meta) have dominated production forecasting for decades and still deliver competitive accuracy on most business time series. The 2024-2026 wave introduced foundation models for time series — Nixtla’s TimeGPT, Amazon’s Chronos, Google’s TimesFM — pretrained on millions of time series and capable of zero-shot forecasting on unseen data. These are not the same as a fine-tuned LSTM; they are a new category.

What This Actually Means for Your Business

Forecasting is one of the oldest production AI use cases in any company that ships, sells, or serves at scale. Demand forecasting drives inventory and supply chain. Revenue forecasting drives finance, hiring, and capital plans. Energy forecasting drives grid operations and pricing. Workforce forecasting drives staffing. Forecasting accuracy is one of the highest-leverage operational metrics most companies measure — a 5% accuracy improvement can mean tens of millions in working capital efficiency.

The thing your data science team probably won’t volunteer: most production forecasting still runs on Prophet, ARIMA variants, or gradient-boosted trees, not deep learning. The reason is that for the typical business time series — weekly demand for a SKU, monthly revenue by product line, daily call volume — the simple methods are competitive with deep learning, faster to train, easier to debug, and more interpretable when they’re wrong. The deep learning advantage shows up at extreme scale (Amazon’s full catalog, Walmart’s full store footprint) and on time series with strong cross-sectional structure (related products that should be forecast jointly).

What changed in 2024-2026 is foundation models for time series. TimeGPT and Chronos can produce respectable forecasts on data they’ve never seen, with no training. This is a different value proposition from “build a custom model for each time series.” For companies with thousands of low-volume series (retail SKUs, individual customer churn risk) where building custom models is uneconomic, zero-shot foundation forecasting is reshaping the unit economics. The accuracy is not always state-of-the-art, but the cost-per-forecast is dramatically lower.

The other shift is the integration of forecasting with planning. The traditional separation — forecasting team makes a forecast, planning team makes a plan — is dissolving. Modern systems run forecasting, scenario analysis, and constrained optimization in the same pipeline. The forecast informs the plan, and the plan’s constraints (capacity, budget, contracts) feed back into the forecasting question. This is where AI forecasting actually creates business value, not in incremental accuracy gains.

Reality Check

What the vendor says: “Our AI forecasting platform improves demand accuracy by 30%.”

What that means in practice: Improved against what baseline? Most “30% improvement” claims compare against a naive baseline (last year’s number) or against a poorly tuned in-house model. Against a well-tuned ARIMA or Prophet model on the same data, the improvement is usually 0-10%. Sometimes the new system loses. Run the bake-off before you sign.

What Operators Actually Do

Companies that take forecasting seriously run bake-offs. They take a year of held-out data, run the candidate methods on the same forecast horizon, and measure on metrics that match their business cost (MAPE, RMSE, weighted variants that penalize stockouts more than overstocks). They include a baseline of their existing in-house method and a baseline of “last year’s number.” The results almost always include some surprises, and the surprises usually save the procurement team from buying the most expensive option.

They also instrument the forecast-to-decision chain. A forecast that’s 5% more accurate but arrives a week later is worse than the old forecast. A forecast at the wrong level of granularity (per-SKU when the planning system needs per-category) is useless no matter how accurate. The right benchmark is decision quality, not forecast accuracy in isolation.

The companies getting the most leverage out of foundation models for time series are the ones with long-tail series that nobody had time to forecast properly before. Mid-tier SKUs at retailers, niche customer segments at SaaS companies, secondary energy markets — these are places where the alternative was a default rule, and a zero-shot foundation forecast is a meaningful improvement at low cost.

The Questions to Ask

What’s the accuracy improvement against a properly tuned baseline, on our data? Insist on a back-test that includes ARIMA, Prophet, and a gradient-boosted tree as baselines, not just “last year’s number.” If the vendor refuses or hedges, the improvement isn’t there.
How does the forecast integrate with our planning system? A forecast disconnected from inventory, staffing, or budget systems creates no value on its own. Get the integration architecture spec, not just the model output spec.
What’s the retraining cadence and drift detection? Time series forecasting models degrade as patterns shift. The vendor should have an answer for how the model is retrained, how often, and how drift is detected before it shows up in your business numbers.

The Technical Definition

What This Actually Means for Your Business

Reality Check

What Operators Actually Do

The Questions to Ask

One operator. Every other Wednesday.