Blog Post··11 min read
Attribution Methods: Saliency, Integrated Gradients, LIME, and SHAP
What each attribution method actually computes, where they agree, where they fail, and whether gradient-based and perturbation-based approaches are still relevant for LLMs.