User Tools

Site Tools


nlp:explainability

Explainability

Explainability can be crucial for the adoption of automatic methods. For example, without an explaination for the diagnosis, doctors are highly unlikely to use an automatic diagnosis system. Explainability is an open problem for machine learning and NLP (see open problems). See also Wikipedia - Explainable AI.

Explainability in Neural Networks

Surveys

Papers

Jeff's opinion: I have reservations about the gradient-based methods because a small effect of an infinitesimal change doesn't necessarily mean it isn't important - it could be important but saturate the activation function to produce a flat spot in the gradient. I prefer methods like Li et al 2016 - Understanding Neural Networks through Representation Erasure and Burns et al 2019 - Interpreting Black Box Models via Hypothesis Testing.

Converting Neural Networks to Decision Trees

Explainable NLP

Interpretability and Explainability In LLMs

Natural Language Explanations

Evaluating Explanations

Conferences, Workshops, and Shared Tasks

nlp/explainability.txt · Last modified: 2025/06/01 23:17 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki