This page provides detailed descriptions of all Δ-Attribution metrics implemented in Delta-Audit.
Definition: Correlation between attribution change magnitude and output change magnitude.
Formula: BAC = Corr(||Δφ||₁, |Δf|)
Range: [-1, 1]
Interpretation:
Use Case: Measures how well attribution changes align with output changes.
Definition: Mean absolute difference between sum of attribution changes and actual output change.
Formula: DCE = E[|ΣΔφ - Δf|]
Range: [0, ∞)
Interpretation:
Use Case: Measures how much the sum of attribution changes differs from the actual output change.
Definition: Mean L1 norm of attribution differences.
Formula: Δ Magnitude L1 = E[||Δφ||₁]
Range: [0, ∞)
Interpretation:
Use Case: Measures the overall magnitude of attribution changes.
Definition: Mean fraction of total magnitude captured by top-10 features.
Formula: Δ TopK10 = E[Σᵢ∈TopK |Δφᵢ| / Σⱼ |Δφⱼ|]
Range: [0, 1]
Interpretation:
Use Case: Measures how concentrated attribution changes are in the most important features.
Definition: Mean entropy of normalized attribution differences.
Formula: Δ Entropy = E[H(|Δφ| / Σ|Δφ|)]
Range: [0, log(n_features)]
Interpretation:
Use Case: Measures the distributional complexity of attribution changes.
Definition: Mean overlap between top-10 features of two attribution sets.
Formula: Rank Overlap @10 = E[|TopK(φ_A) ∩ TopK(φ_B)| / |TopK(φ_A) ∪ TopK(φ_B)|]
Range: [0, 1]
Interpretation:
Use Case: Measures how much the most important features overlap between models.
Definition: Median overlap between top-10 features.
Formula: Median(Rank Overlap @10)
Range: [0, 1]
Interpretation: Robust measure of feature overlap.
Use Case: Less sensitive to outliers than mean rank overlap.
Definition: Distributional shift between attribution sets.
Formula: JSD = 0.5 * KL(φ_A || M) + 0.5 * KL(φ_B || M)
Where M = 0.5 * (φ_A + φ_B)
Range: [0, log(2)]
Interpretation:
Use Case: Measures how much the attribution distributions differ.
Definition: Fraction of attribution changes in relevant features for fixes and regressions.
Formula:
COΔF_fix = E[Σᵢ∈TopM |Δφᵢ| / Σⱼ |Δφⱼ|]
for fixesCOΔF_reg = E[Σᵢ∈TopM |Δφᵢ| / Σⱼ |Δφⱼ|]
for regressionsRange: [0, 1]
Interpretation:
Use Case: Measures how well attribution changes focus on relevant features for performance changes.
Definition: Robustness of attribution changes to input perturbations.
Formula: Δ Stability = E[||Δφ(x+ε) - Δφ(x)||₁ / ||ε||₂]
Range: [0, ∞)
Interpretation:
Use Case: Measures how robust attribution changes are to input noise.
Definition: Classification accuracy of models A and B.
Range: [0, 1]
Use Case: Standard performance comparison.
Definition: Macro-averaged F1 score of models A and B.
Range: [0, 1]
Use Case: Performance comparison for multi-class problems.
Definition: Macro-averaged precision of models A and B.
Range: [0, 1]
Use Case: Performance comparison for multi-class problems.