TY - GEN
T1 - Interpreting BERT-based Text Similarity via Activation and Saliency Maps
AU - Malkiel, Itzik
AU - Ginzburg, Dvir
AU - Barkan, Oren
AU - Caciularu, Avi
AU - Weill, Jonathan
AU - Koenigstein, Noam
N1 - Publisher Copyright:
© 2022 ACM.
PY - 2022/4/25
Y1 - 2022/4/25
N2 - Recently, there has been growing interest in the ability of Transformer-based models to produce meaningful embeddings of text with several applications, such as text similarity. Despite significant progress in the field, the explanations for similarity predictions remain challenging, especially in unsupervised settings. In this work, we present an unsupervised technique for explaining paragraph similarities inferred by pre-trained BERT models. By looking at a pair of paragraphs, our technique identifies important words that dictate each paragraph's semantics, matches between the words in both paragraphs, and retrieves the most important pairs that explain the similarity between the two. The method, which has been assessed by extensive human evaluations and demonstrated on datasets comprising long and complex paragraphs, has shown great promise, providing accurate interpretations that correlate better with human perceptions.
AB - Recently, there has been growing interest in the ability of Transformer-based models to produce meaningful embeddings of text with several applications, such as text similarity. Despite significant progress in the field, the explanations for similarity predictions remain challenging, especially in unsupervised settings. In this work, we present an unsupervised technique for explaining paragraph similarities inferred by pre-trained BERT models. By looking at a pair of paragraphs, our technique identifies important words that dictate each paragraph's semantics, matches between the words in both paragraphs, and retrieves the most important pairs that explain the similarity between the two. The method, which has been assessed by extensive human evaluations and demonstrated on datasets comprising long and complex paragraphs, has shown great promise, providing accurate interpretations that correlate better with human perceptions.
KW - Attention Models
KW - Deep Learning
KW - Explainable AI
KW - Interpretability
KW - Self-supervised
KW - Transformers
UR - http://www.scopus.com/inward/record.url?scp=85129821637&partnerID=8YFLogxK
U2 - 10.1145/3485447.3512045
DO - 10.1145/3485447.3512045
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85129821637
T3 - WWW 2022 - Proceedings of the ACM Web Conference 2022
SP - 3259
EP - 3268
BT - WWW 2022 - Proceedings of the ACM Web Conference 2022
T2 - 31st ACM World Wide Web Conference, WWW 2022
Y2 - 25 April 2022 through 29 April 2022
ER -