matteosaponati
research
blog
music
cv
Geometrical structures in the self-attention matrices of Transformer models
we are preparing the manuscript, work in progress :)
refs: