Challenge

Production SQL Server databases experiencing performance degradation despite 30% YoY traffic growth. P95 query latency grew from 200ms to 1.2s. Several background jobs failing due to timeouts. Database infrastructure costs ballooning ($85K/month) as team added more compute to scale vertically.

Approach

Diagnostic Phase:

Optimization Strategy:

  1. Index optimization: Added 8 strategic indexes, dropped 12 unused ones (25% storage reduction)
  2. Query rewrites: Refactored 47 slow queries using CTE optimization, window functions, batch operations
  3. Connection pooling: Implemented SQL Server connection pooling reducing connection overhead
  4. Statistics/AUTOUPDATE tuning: Optimized based on workload patterns for query optimizer
  5. Materialized views: 4 heavy analytical queries converted to 2-hour refresh indexed views

Team

Results

Delivery Metrics

Technical Impact

Business Impact

Key Decisions

  1. Profiling before optimization - Data-driven approach prevented wasted effort on wrong queries using SQL Server DMVs
  2. Connection pooling over scaling - Cheaper solution than vertical scaling and better long-term
  3. Indexed views for analytics - Isolated analytical load from OLTP transactions
  4. Scheduled index maintenance - Built sustainable optimization culture vs one-time project