Understanding and Detecting Query Performance Regression in Practical Index Tuning
Proceedings of the ACM on Management of Data (SIGMOD 2026) | , Vol 3(6)
Existing index tuners typically rely on the “what if” API provided by the query optimizer to estimate the execution cost of a query on top of an index configuration. Such cost estimates can be inaccurate and may therefore lead to significant query performance regression (QPR) once the recommended indexes are materialized. This becomes a serious problem for cloud database providers, such as Microsoft’s Azure SQL Database, that offer index tuning as an automated service (a.k.a. “auto-indexing”). Previous work has explored use of supervised machine learning (ML) to reduce the likelihood of QPR. However, the trained ML models have limited generalization capability when applied to new databases and workloads. We propose an alternative approach where we analyze the query plan pairs with significant QPRs and look for structural changes due to the new index configuration that could explain the QPR. We perform such study for index tuning data across many benchmark and real-world database workloads, for multiple realistic index tuning scenarios. Our study reveals that most of the significant QPRs can be attributed to a small number of common ”regression patterns” characterizing the structural plan changes, and we further propose a pattern-based QPR detector accordingly. Our experimental evaluation shows that the pattern-based QPR detector can significantly outperform existing ML-based QPR detectors.
Extended version of the paper: [Full Version]
Some of the artifacts used in this work are available: Supplementary Materials (opens in new tab).